Sebastian Raschka c4cde1c21b
Reduce Llama 3 RoPE memory requirements (#658)
* Llama3 from scratch improvements

* Fix Llama 3 expensive RoPE memory issue

* updates

* update package

* benchmark

* remove unused rescale_theta
2025-06-12 11:08:02 -05:00
..
2025-03-23 19:28:49 -05:00
2025-03-23 19:28:49 -05:00
2025-03-23 19:28:49 -05:00
2025-03-23 19:28:49 -05:00
2025-03-27 14:00:25 -05:00
2025-03-27 14:00:25 -05:00
2025-03-27 14:00:25 -05:00
2025-03-23 19:28:49 -05:00
2025-03-23 19:28:49 -05:00