mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-11-30 09:00:09 +00:00

History

Sebastian Raschka fc101b710e

Added Apple Silicon GPU device update (#820 )

* Added Apple Silicon GPU device

* Added Apple Silicon GPU device

* delete: remove unused model.pth file from understanding-buffers

* update

* update

---------

Co-authored-by: missflash <missflash@gmail.com>

2025-09-13 12:48:06 -05:00

mha-implementations.ipynb

Added Apple Silicon GPU device update (#820 )

2025-09-13 12:48:06 -05:00

README.md

Einsum multi-head attention (#345 )

2024-09-05 18:24:33 +02:00

README.md

More Efficient Multi-Head Attention Implementations

mha-implementations.ipynb contains and compares different implementations of multi-head attention

Summary

The figures below summarize the performance benchmarks (lower is better).

README.md

More Efficient Multi-Head Attention Implementations

Summary

Forward pass only

Forward and backward pass

Forward and backward pass after compilation