5 Commits

Author SHA1 Message Date
Sebastian Raschka
09dc080cf3 Direct Preference Optimization from scratch (#294) 2024-08-04 08:57:36 -05:00
Sebastian Raschka
2b7bd46a93 Generate preference dataset with Llama 3.1 70B (#289) 2024-07-27 09:44:04 -05:00
Sebastian Raschka
1002f3079d Update README.md 2024-06-23 08:25:01 -05:00
rasbt
b80e7804b3 add instruction dataset 2024-06-08 10:38:41 -05:00
rasbt
9f8c3f2b35 Ollama-based model evaluation 2024-06-05 08:21:28 -05:00