mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-10-27 07:49:25 +00:00

History

Sebastian Raschka 0528446584 Make code more consistent and add projection layer (#131 )

* Make code more consistent and add projection

* remove redundant buffer

2024-04-26 17:13:08 -05:00

ch03.py

cleanup

2024-04-04 07:58:41 -05:00

mha-implementations.ipynb

2024-04-26 17:13:08 -05:00

README.md

mha variants

2024-03-06 08:30:32 -06:00

More Efficient Multi-Head Attention Implementations

mha-implementations.ipynb contains and compares different implementations of multi-head attention