mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-12-05 03:19:56 +00:00

History

Sebastian Raschka 7bd263144e

* Switch from urllib to requests to improve reliability

* Keep ruff linter-specific

* update

* update

* update

2025-10-07 15:22:59 -05:00

create-preference-data-ollama.ipynb

2025-10-07 15:22:59 -05:00

dpo-from-scratch.ipynb

2025-10-07 15:22:59 -05:00

instruction-data-with-preference.json

2024-07-27 09:44:04 -05:00

previous_chapters.py

2025-09-26 22:42:44 -05:00

README.md

2024-08-04 08:57:36 -05:00

Chapter 7: Finetuning to Follow Instructions

create-preference-data-ollama.ipynb: A notebook that creates a synthetic dataset for preference finetuning dataset using Llama 3.1 and Ollama
dpo-from-scratch.ipynb: This notebook implements Direct Preference Optimization (DPO) for LLM alignment