LLMs-from-scratch/ch07/04_preference-tuning-with-dpo

Chapter 7: Finetuning to Follow Instructions