mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2025-11-03 03:10:21 +00:00
13 lines
692 B
Markdown
13 lines
692 B
Markdown
# Chapter 7: Finetuning to Follow Instructions
|
|
|
|
- [create-preference-data-ollama.ipynb](create-preference-data-ollama.ipynb): A notebook that creates a synthetic dataset for preference finetuning dataset using Llama 3.1 and Ollama
|
|
|
|
- In progress ...
|
|
|
|
|
|
|
|
In the meantime, also see
|
|
|
|
- LLM Training: RLHF and Its Alternatives, [https://magazine.sebastianraschka.com/p/llm-training-rlhf-and-its-alternatives](https://magazine.sebastianraschka.com/p/llm-training-rlhf-and-its-alternatives)
|
|
- Tips for LLM Pretraining and Evaluating Reward Models, [https://sebastianraschka.com/blog/2024/research-papers-in-march-2024.html](https://sebastianraschka.com/blog/2024/research-papers-in-march-2024.html)
|