19 Commits

Author SHA1 Message Date
Sebastian Raschka
4adb96d7ee
Make code more consistent and add projection layer (#131)
* Make code more consistent and add projection

* remove redundant buffer
2024-04-26 17:13:08 -05:00
Sebastian Raschka
2de60d1bfb
Rename variable to context_length to make it easier on readers (#106)
* rename to context length

* fix spacing
2024-04-04 07:27:41 -05:00
Sebastian Raschka
cf39abac04 Add and link bonus material (#84) 2024-03-23 07:27:43 -05:00
Sebastian Raschka
a2cd8436cb Ch05 supplementary code (#81) 2024-03-19 09:26:26 -05:00
rasbt
0d517e98b9 update 2024-03-13 08:37:54 -05:00
rasbt
569f6bc7f0 benchmark numbers 2024-03-13 07:12:10 -05:00
taihaozesong
f1fa9df15c Fix mha wrapper implementations in ch03 bonus 2024-03-13 18:02:26 +08:00
rasbt
321f3d33f9 add cuda warmup 2024-03-10 10:31:55 -05:00
rasbt
da33ce8054 remove redundant unsqueeze in mask 2024-03-09 17:42:31 -06:00
rasbt
6ba97adaee add PyTorch version 2024-03-09 17:42:30 -06:00
rasbt
5ca60321c4 add a100 numbers 2024-03-09 10:20:08 -06:00
rasbt
29ca41799a use need_weights=False 2024-03-09 10:09:17 -06:00
rasbt
5643c88db9 add pytorch mha 2024-03-08 09:30:55 -06:00
rasbt
404f48aa74 automatically run on gpu or cpu 2024-03-07 20:14:03 -06:00
rasbt
99a5e28def rename q,k,v for consistency with chapter 3 2024-03-07 06:30:40 -06:00
Rayed Bin Wahed
496079c61e Update mha-implementations.ipynb
Fix variable spelling in comments to keep consistent with code
2024-03-06 23:03:57 +08:00
rasbt
b6fe1a37b3 also add simple wrapper 2024-03-06 08:38:53 -06:00
rasbt
571377a2d6 update title 2024-03-06 08:34:04 -06:00
rasbt
87fcfd9245 mha variants 2024-03-06 08:30:32 -06:00