mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-08-18 05:31:40 +00:00

History

Jeroen Van Goey 76e6910a1a

* typo fix

* Update ch03/02_bonus_efficient-multihead-attention/mha-implementations.ipynb

* Update ch03/02_bonus_efficient-multihead-attention/mha-implementations.ipynb

* Update ch03/02_bonus_efficient-multihead-attention/mha-implementations.ipynb

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>

2024-08-12 07:54:12 -05:00

01_main-chapter-code

Test code in pytorch 2.4 (#285 )

2024-07-24 21:53:41 -05:00

02_bonus_efficient-multihead-attention

Small typo fix (#313 )

2024-08-12 07:54:12 -05:00

03_understanding-buffers

Update README.md

2024-07-30 06:55:41 -05:00

README.md

Understanding PyTorch Buffers (#288 )

2024-07-26 08:45:36 -05:00

README.md

Chapter 3: Coding Attention Mechanisms

Main Chapter Code

01_main-chapter-code contains the main chapter code.

Bonus Materials

02_bonus_efficient-multihead-attention implements and compares different implementation variants of multihead-attention
03_understanding-buffers explains the idea behind PyTorch buffers, which are used to implement the causal attention mechanism in chapter 3