yujunjun/LLMs-from-scratch

mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-10-10 15:40:08 +00:00

History

rasbt 06151a809e

note about logistic sigmoid

2024-08-06 19:48:30 -05:00

..

create-preference-data-ollama.ipynb

Fix 8-billion-parameter spelling

2024-07-28 10:48:56 -05:00

dpo-from-scratch.ipynb

note about logistic sigmoid

2024-08-06 19:48:30 -05:00

instruction-data-with-preference.json

Generate preference dataset with Llama 3.1 70B (#289 )

2024-07-27 09:44:04 -05:00

previous_chapters.py

Direct Preference Optimization from scratch (#294 )

2024-08-04 08:57:36 -05:00

README.md

Direct Preference Optimization from scratch (#294 )

2024-08-04 08:57:36 -05:00

README.md

Chapter 7: Finetuning to Follow Instructions

create-preference-data-ollama.ipynb: A notebook that creates a synthetic dataset for preference finetuning dataset using Llama 3.1 and Ollama
dpo-from-scratch.ipynb: This notebook implements Direct Preference Optimization (DPO) for LLM alignment