LLMs-from-scratch/ch03/README.md

# Chapter 3: Coding Attention Mechanisms

## Main Chapter Code

- [01_main-chapter-code](01_main-chapter-code) contains the main chapter code.

## Bonus Materials

- [02_bonus_efficient-multihead-attention](02_bonus_efficient-multihead-attention) implements and compares different implementation variants of multihead-attention
- [03_understanding-buffers](03_understanding-buffers) explains the idea behind PyTorch buffers, which are used to implement the causal attention mechanism in chapter 3
add and update readme files 2024-02-05 06:51:58 -06:00			`# Chapter 3: Coding Attention Mechanisms`
add ch03 and TOC 2023-12-09 17:13:56 -06:00
distinguish better between main chapter code and bonus materials 2024-06-11 21:07:42 -05:00			`## Main Chapter Code`

mha variants 2024-03-06 08:30:32 -06:00			`- [01_main-chapter-code](01_main-chapter-code) contains the main chapter code.`
distinguish better between main chapter code and bonus materials 2024-06-11 21:07:42 -05:00
			`## Bonus Materials`

Understanding PyTorch Buffers (#288) 2024-07-26 08:45:36 -05:00			`- [02_bonus_efficient-multihead-attention](02_bonus_efficient-multihead-attention) implements and compares different implementation variants of multihead-attention`
			`- [03_understanding-buffers](03_understanding-buffers) explains the idea behind PyTorch buffers, which are used to implement the causal attention mechanism in chapter 3`