6 Commits

Author SHA1 Message Date
Sebastian Raschka
f596aab0cb
Simplify KV cache usage (#728)
* Simplify KV cache usage

* Swap mark text with ghostwriter
2025-07-10 12:52:29 -05:00
Sebastian Raschka
c4ec55edac
Support different Qwen3 sizes in pkg (#714) 2025-06-28 08:00:23 -05:00
Sebastian Raschka
81eda38d3b
Improve KV cache code for torch.compile (#705)
* Improve KV cache code for torch.compile

* cleanup

* cleanup
2025-06-23 18:08:49 -05:00
Sebastian Raschka
0a2e8c39c4
Qwen3 KV cache (#688) 2025-06-21 17:34:39 -05:00
Sebastian Raschka
fdc3e1b701
Add GPT-2 KV cache to pkg (#687) 2025-06-21 12:29:04 -05:00
Sebastian Raschka
3be0f3202a
Llama 3 KV Cache (#685)
* Llama 3 KV Cache

* skip expensive tests on Gh actions

* Update __init__.py
2025-06-21 10:55:20 -05:00