mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-12-19 19:22:08 +00:00

History

* Use instance tokenizer

* consistency updates

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>

2024-04-10 21:16:19 -04:00

2024-04-10 21:16:19 -04:00

cleanup

2024-04-04 07:58:41 -05:00

README.md

mha variants

2024-03-06 08:30:32 -06:00

Chapter 3: Coding Attention Mechanisms

01_main-chapter-code contains the main chapter code.
02_bonus_efficient-multihead-attention implements and compares different implementation variants of multihead-attention