mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-12-12 15:31:40 +00:00

History

* updated Dockerfile

* updated MHA implementations for PT 2.5

* fixed typo

* update installation instruction

* Update setup/03_optional-docker-environment/.devcontainer/Dockerfile

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>

2024-10-22 20:23:31 -05:00

01_main-chapter-code

Test code in pytorch 2.4 (#285 )

2024-07-24 21:53:41 -05:00

02_bonus_efficient-multihead-attention

updates for PyTorch 2.5 (#408 )

2024-10-22 20:23:31 -05:00

03_understanding-buffers

Update README.md

2024-07-30 06:55:41 -05:00

README.md

Update bonus section formatting (#400 )

2024-10-12 10:26:08 -05:00

README.md

Chapter 3: Coding Attention Mechanisms

Main Chapter Code

01_main-chapter-code contains the main chapter code.

Bonus Materials

02_bonus_efficient-multihead-attention implements and compares different implementation variants of multihead-attention
03_understanding-buffers explains the idea behind PyTorch buffers, which are used to implement the causal attention mechanism in chapter 3