4 Commits

Author SHA1 Message Date
Sebastian Raschka
97a199e40b Disable mask saving as weight in Llama 3 model (#604)
* Disable mask saving as weight

* update pixi

* update pixi
2025-04-06 09:33:36 -05:00
Sebastian Raschka
396e96ab07 Fix Llama language typo in bonus materials (#597) 2025-04-02 21:41:36 -05:00
casinca
d7c316533a removing unused RoPE parameters (#590)
* removing unused RoPE parameters

* remove redundant context_length in GQA

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2025-03-31 17:10:39 -05:00
Sebastian Raschka
4e3b752e5e Memory optimized Llama (#588)
* Memory optimized Llama

* re-ad login
2025-03-30 15:18:12 -05:00