Sebastian Raschka
|
97a199e40b
|
Disable mask saving as weight in Llama 3 model (#604)
* Disable mask saving as weight
* update pixi
* update pixi
|
2025-04-06 09:33:36 -05:00 |
|
Sebastian Raschka
|
396e96ab07
|
Fix Llama language typo in bonus materials (#597)
|
2025-04-02 21:41:36 -05:00 |
|
casinca
|
d7c316533a
|
removing unused RoPE parameters (#590)
* removing unused RoPE parameters
* remove redundant context_length in GQA
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2025-03-31 17:10:39 -05:00 |
|
Sebastian Raschka
|
4e3b752e5e
|
Memory optimized Llama (#588)
* Memory optimized Llama
* re-ad login
|
2025-03-30 15:18:12 -05:00 |
|