5 lines
351 B
Markdown
Raw Normal View History

2024-03-06 08:30:32 -06:00
# More Efficient Multi-Head Attention Implementations
2024-08-10 09:44:11 -05:00
- [mha-implementations.ipynb](mha-implementations.ipynb) contains and compares different implementations of multi-head attention
<a href="mha-implementations.ipynb"><img src="https://sebastianraschka.com/images/LLMs-from-scratch-images/bonus/mha-benchmark/mha-comparison.webp" width="500px"></a>