7 Commits

Author SHA1 Message Date
Sebastian Raschka
0b15a00574 Qwen3 KV cache (#688) 2025-06-21 17:34:39 -05:00
Sebastian Raschka
9d62ca0598 Llama 3 KV Cache (#685)
* Llama 3 KV Cache

* skip expensive tests on Gh actions

* Update __init__.py
2025-06-21 10:55:20 -05:00
Sebastian Raschka
a3c4c33347 Reduce Llama 3 RoPE memory requirements (#658)
* Llama3 from scratch improvements

* Fix Llama 3 expensive RoPE memory issue

* updates

* update package

* benchmark

* remove unused rescale_theta
2025-06-12 11:08:02 -05:00
Sebastian Raschka
3eca919a52 Llama3 from scratch improvements (#621)
* Llama3 from scratch improvements

* restore
2025-04-16 18:08:26 -05:00
Sebastian Raschka
4128a91c1d Add Llama 3.2 to pkg (#591)
* Add Llama 3.2 to pkg

* remove redundant attributes

* update tests

* updates

* updates

* updates

* fix link

* fix link
2025-03-31 18:59:47 -05:00
Sebastian Raschka
6f86c78763 Implement Llama 3.2 (#383) 2024-10-05 07:30:47 -05:00
Sebastian Raschka
c38b003aa9 GPT to Llama (#368)
* GPT to Llama

* fix urls
2024-09-23 07:34:06 -05:00