casinca 152a087a37
removing unused RoPE parameters (#590)
* removing unused RoPE parameters

* remove redundant context_length in GQA

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2025-03-31 17:10:39 -05:00
..
2024-09-23 08:56:16 -05:00
2024-09-23 07:34:06 -05:00
2024-10-05 07:30:47 -05:00

Converting GPT to Llama

This folder contains code for converting the GPT implementation from chapter 4 and 5 to Meta AI's Llama architecture in the following recommended reading order: