mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-08-29 11:00:55 +00:00

Sebastian Raschka 9bb203b1b7 Einsum multi-head attention (#345 )

* Einsum multi-head attention

* update diff

2024-09-05 18:24:33 +02:00

More Efficient Multi-Head Attention Implementations

mha-implementations.ipynb contains and compares different implementations of multi-head attention

The figures below summarize the performance benchmarks (lower is better).