Martin Ma
|
ad16b1fbee
|
Fix bug in masking when kv cache is used. (#697)
* Fix bug in masking when kv cache is used.
* add tests
* dd tests
* upd
* add kv cache test to gh workflow
* explicit mask slicing
* upd
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2025-06-23 13:12:56 -05:00 |
|
Sebastian Raschka
|
bb57756444
|
Add GPT-2 KV cache to pkg (#687)
|
2025-06-21 12:29:04 -05:00 |
|
Sebastian Raschka
|
c488578cae
|
Optimize KV cache (#673)
* Optimize KV cache
* style
* interpretable generate
* interpretable generate
* update readme
|
2025-06-16 16:00:50 -05:00 |
|
Sebastian Raschka
|
e704b5fa50
|
Optimized KV cache (#672)
* Optimized KV cache
* typo fix
|
2025-06-15 14:26:16 -05:00 |
|