mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-11-27 07:28:06 +00:00

History

* fix(typo): correct scaling

* fix(typo): correct comment for `instruct`

2025-09-30 11:18:02 -05:00

mha-implementations.ipynb

some typo fixes (#858 )

2025-09-30 11:18:02 -05:00

README.md

2024-09-05 18:24:33 +02:00

More Efficient Multi-Head Attention Implementations

mha-implementations.ipynb contains and compares different implementations of multi-head attention

The figures below summarize the performance benchmarks (lower is better).