3 Commits

Author SHA1 Message Date
Sebastian Raschka
bcc73f731d
n_heads × d_head -> d_head × d_head in DeltaNet (#903)
Clarified the explanation of the memory size calculation for `KV_cache_DeltaNet` and updated the quadratic term from `n_heads × d_head` to `d_head × d_head`.
2025-11-05 18:28:37 -06:00
Sebastian Raschka
488bef7e3f
Image resizing 2025-11-02 21:05:38 -06:00
Sebastian Raschka
c6b8332a59
Gated DeltaNet write-up (#901)
* Gated DeltaNet write-up

* Add copyright and source information to script

Added copyright notice and source information.

* Remove unused import of Path in plot_memory_estimates

* Fix url
2025-11-02 21:03:42 -06:00