3 lines
183 B
Markdown
Raw Normal View History

2024-03-06 08:30:32 -06:00
# More Efficient Multi-Head Attention Implementations
- [mha-implementations.ipynb](mha-implementations.ipynb) contains and compares different implementations of multi-head attention