4 Commits

Author SHA1 Message Date
Daniel Kleine
8318d1f002
minor DPO fixes (#298)
* fixed issues, updated .gitignore

* added closing paren

* fixed CEL spelling

* fixed more minor issues

* Update ch07/01_main-chapter-code/ch07.ipynb

* Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb

* Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb

* Update ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-08-05 08:40:46 -05:00
rasbt
36b9d5e0eb
update model path 2024-08-05 07:36:08 -05:00
rasbt
60aada801b
improve latex rendering in dpo notebook 2024-08-04 09:19:59 -05:00
Sebastian Raschka
52435804eb
Direct Preference Optimization from scratch (#294) 2024-08-04 08:57:36 -05:00