mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-08-29 11:00:55 +00:00

Sebastian Raschka d5eaa36416 Ch06 and Ch07 videos (#613 )

* Ch06 and Ch07 videos

* exclude google scholar from link checking

2025-04-12 14:51:02 -05:00

Chapter 7: Finetuning to Follow Instructions

Main Chapter Code

02_dataset-utilities contains utility code that can be used for preparing an instruction dataset
03_model-evaluation contains utility code for evaluating instruction responses using a local Llama 3 model and the GPT-4 API
04_preference-tuning-with-dpo implements code for preference finetuning with Direct Preference Optimization (DPO)
05_dataset-generation contains code to generate and improve synthetic datasets for instruction finetuning
06_user_interface implements an interactive user interface to interact with the pretrained LLM