4 Commits

Author SHA1 Message Date
Martin Ma
6522be94be
Fix bug in masking when kv cache is used. (#697)
* Fix bug in masking when kv cache is used.

* add tests

* dd tests

* upd

* add kv cache test to gh workflow

* explicit mask slicing

* upd

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2025-06-23 13:12:56 -05:00
Sebastian Raschka
fdc3e1b701
Add GPT-2 KV cache to pkg (#687) 2025-06-21 12:29:04 -05:00
Sebastian Raschka
ece59ba587
Optimize KV cache (#673)
* Optimize KV cache

* style

* interpretable generate

* interpretable generate

* update readme
2025-06-16 16:00:50 -05:00
Sebastian Raschka
ba0370abd1
Optimized KV cache (#672)
* Optimized KV cache

* typo fix
2025-06-15 14:26:16 -05:00