LLMs-from-scratch/ch07/04_preference-tuning-with-dpo
Sebastian Raschka ec062e1099 Dpo vocab size clarification (#628)
* Llama3 from scratch improvements

* vocab size should be 50257 not 50256

* restore
2025-04-18 17:20:56 -05:00
..

Chapter 7: Finetuning to Follow Instructions