5 Commits

Author SHA1 Message Date
rasbt
089901db26 small figure update 2024-08-05 17:57:16 -05:00
Daniel Kleine
dcdf04e3bd minor DPO fixes (#298)
* fixed issues, updated .gitignore

* added closing paren

* fixed CEL spelling

* fixed more minor issues

* Update ch07/01_main-chapter-code/ch07.ipynb

* Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb

* Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb

* Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-08-05 08:40:46 -05:00
rasbt
6030071e3f update model path 2024-08-05 07:36:08 -05:00
rasbt
f302f5e8d5 improve latex rendering in dpo notebook 2024-08-04 09:19:59 -05:00
Sebastian Raschka
09dc080cf3 Direct Preference Optimization from scratch (#294) 2024-08-04 08:57:36 -05:00