2 Commits

Author SHA1 Message Date
casinca
152a087a37
removing unused RoPE parameters (#590)
* removing unused RoPE parameters

* remove redundant context_length in GQA

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2025-03-31 17:10:39 -05:00
Sebastian Raschka
0f6894f41e
Memory optimized Llama (#588)
* Memory optimized Llama

* re-ad login
2025-03-30 15:18:12 -05:00