Sebastian Raschka
|
bf039ff3dc
|
Add alternative attention structure (#880)
|
2025-10-13 14:31:13 -05:00 |
|
Sebastian Raschka
|
6eb6adfa33
|
sliding window attention (#879)
|
2025-10-12 22:13:20 -05:00 |
|
Sebastian Raschka
|
9b9586688d
|
Multi-Head Latent Attention (#876)
* Multi-Head Latent Attention
* update
|
2025-10-11 20:08:30 -05:00 |
|
Sebastian Raschka
|
2af686d70b
|
Add KV cache (#671)
|
2025-06-15 09:58:08 -05:00 |
|
Sebastian Raschka
|
73f4342664
|
add ch04 code along video (#573)
|
2025-03-17 11:20:55 -05:00 |
|
Sebastian Raschka
|
b6c4b2f9f1
|
Update bonus section formatting (#400)
|
2024-10-12 10:26:08 -05:00 |
|
Sebastian Raschka
|
5944ab0678
|
Update README.md
|
2024-06-22 12:09:02 -05:00 |
|
rasbt
|
283397aaf2
|
add main and optional sections
|
2024-06-19 17:48:25 -05:00 |
|
rasbt
|
e24fd98cdf
|
distinguish better between main chapter code and bonus materials
|
2024-06-11 21:07:42 -05:00 |
|
rasbt
|
e5e6aaf9f1
|
flops analysis
|
2024-05-23 20:35:41 -05:00 |
|
rasbt
|
f24da86abe
|
title case
|
2024-03-27 07:30:09 -05:00 |
|
rasbt
|
3a5fc79b38
|
add and update readme files
|
2024-02-05 06:51:58 -06:00 |
|