mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2025-09-25 16:17:10 +00:00
Chapter 7: Finetuning to Follow Instructions
In progress ...
In the meantime, see
- LLM Training: RLHF and Its Alternatives, https://magazine.sebastianraschka.com/p/llm-training-rlhf-and-its-alternatives
- Tips for LLM Pretraining and Evaluating Reward Models, https://sebastianraschka.com/blog/2024/research-papers-in-march-2024.html