Sebastian Raschka
|
67e0680210
|
Disable mask saving as weight in Llama 3 model (#604)
* Disable mask saving as weight
* update pixi
* update pixi
|
2025-04-06 09:33:36 -05:00 |
|
Sebastian Raschka
|
d4c8d8f2c9
|
Fix Llama language typo in bonus materials (#597)
|
2025-04-02 21:41:36 -05:00 |
|
casinca
|
152a087a37
|
removing unused RoPE parameters (#590)
* removing unused RoPE parameters
* remove redundant context_length in GQA
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2025-03-31 17:10:39 -05:00 |
|
Sebastian Raschka
|
0f6894f41e
|
Memory optimized Llama (#588)
* Memory optimized Llama
* re-ad login
|
2025-03-30 15:18:12 -05:00 |
|