Martin Ma
|
6522be94be
|
Fix bug in masking when kv cache is used. (#697)
* Fix bug in masking when kv cache is used.
* add tests
* dd tests
* upd
* add kv cache test to gh workflow
* explicit mask slicing
* upd
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2025-06-23 13:12:56 -05:00 |
|
Sebastian Raschka
|
fdc3e1b701
|
Add GPT-2 KV cache to pkg (#687)
|
2025-06-21 12:29:04 -05:00 |
|
Sebastian Raschka
|
ece59ba587
|
Optimize KV cache (#673)
* Optimize KV cache
* style
* interpretable generate
* interpretable generate
* update readme
|
2025-06-16 16:00:50 -05:00 |
|
Sebastian Raschka
|
ba0370abd1
|
Optimized KV cache (#672)
* Optimized KV cache
* typo fix
|
2025-06-15 14:26:16 -05:00 |
|