Daniel Kleine ff31b345b0
ch05/07 gpt_to_llama text improvements (#369)
* fixed typo

* fixed RMSnorm formula

* fixed SwiGLU formula

* temperature=0 for untrained model for reproducibility

* added extra info hf token
2024-09-24 18:45:49 -05:00
..
2024-09-23 08:56:16 -05:00
2024-09-23 07:34:06 -05:00
2024-09-23 07:34:06 -05:00
2024-09-23 07:34:06 -05:00

Converting GPT to Llama

This folder contains code for converting the GPT implementation from chapter 4 and 5 to Meta AI's Llama architecture: