3 Commits

Author SHA1 Message Date
Sebastian Raschka
dc2f8e95d4 Support different Qwen3 sizes in pkg (#714) 2025-06-28 08:00:23 -05:00
Sebastian Raschka
58b30e2f7b Improve KV cache code for torch.compile (#705)
* Improve KV cache code for torch.compile

* cleanup

* cleanup
2025-06-23 18:08:49 -05:00
Sebastian Raschka
0b15a00574 Qwen3 KV cache (#688) 2025-06-21 17:34:39 -05:00