Daniel Kleine
|
c65928f7dc
|
added std error bars (#320)
* added std error bars
* fixed changes
* Update on A100
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-08-13 20:57:41 -05:00 |
|
Jeroen Van Goey
|
76e6910a1a
|
Small typo fix (#313)
* typo fix
* Update ch03/02_bonus_efficient-multihead-attention/mha-implementations.ipynb
* Update ch03/02_bonus_efficient-multihead-attention/mha-implementations.ipynb
* Update ch03/02_bonus_efficient-multihead-attention/mha-implementations.ipynb
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-08-12 07:54:12 -05:00 |
|
Sebastian Raschka
|
3f6652d87e
|
update attention benchmarks (#307)
|
2024-08-10 09:44:11 -05:00 |
|
Jeroen Van Goey
|
48bd72c890
|
fix typos, add codespell pre-commit hook (#264)
* fix typos, add codespell pre-commit hook
* Update .pre-commit-config.yaml
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-07-16 07:07:04 -05:00 |
|
rasbt
|
71831890a0
|
update mha dim
|
2024-05-27 07:46:29 -05:00 |
|
rasbt
|
050c8b7b73
|
update pr
|
2024-05-26 15:38:35 -05:00 |
|
rasbt
|
98d453b666
|
update formatting
|
2024-05-24 07:20:37 -05:00 |
|
Sebastian Raschka
|
4adb96d7ee
|
Make code more consistent and add projection layer (#131)
* Make code more consistent and add projection
* remove redundant buffer
|
2024-04-26 17:13:08 -05:00 |
|
Sebastian Raschka
|
2de60d1bfb
|
Rename variable to context_length to make it easier on readers (#106)
* rename to context length
* fix spacing
|
2024-04-04 07:27:41 -05:00 |
|
Sebastian Raschka
|
cf39abac04
|
Add and link bonus material (#84)
|
2024-03-23 07:27:43 -05:00 |
|
Sebastian Raschka
|
a2cd8436cb
|
Ch05 supplementary code (#81)
|
2024-03-19 09:26:26 -05:00 |
|
rasbt
|
0d517e98b9
|
update
|
2024-03-13 08:37:54 -05:00 |
|
rasbt
|
569f6bc7f0
|
benchmark numbers
|
2024-03-13 07:12:10 -05:00 |
|
taihaozesong
|
f1fa9df15c
|
Fix mha wrapper implementations in ch03 bonus
|
2024-03-13 18:02:26 +08:00 |
|
rasbt
|
321f3d33f9
|
add cuda warmup
|
2024-03-10 10:31:55 -05:00 |
|
rasbt
|
da33ce8054
|
remove redundant unsqueeze in mask
|
2024-03-09 17:42:31 -06:00 |
|
rasbt
|
6ba97adaee
|
add PyTorch version
|
2024-03-09 17:42:30 -06:00 |
|
rasbt
|
5ca60321c4
|
add a100 numbers
|
2024-03-09 10:20:08 -06:00 |
|
rasbt
|
29ca41799a
|
use need_weights=False
|
2024-03-09 10:09:17 -06:00 |
|
rasbt
|
5643c88db9
|
add pytorch mha
|
2024-03-08 09:30:55 -06:00 |
|
rasbt
|
404f48aa74
|
automatically run on gpu or cpu
|
2024-03-07 20:14:03 -06:00 |
|
rasbt
|
99a5e28def
|
rename q,k,v for consistency with chapter 3
|
2024-03-07 06:30:40 -06:00 |
|
Rayed Bin Wahed
|
496079c61e
|
Update mha-implementations.ipynb
Fix variable spelling in comments to keep consistent with code
|
2024-03-06 23:03:57 +08:00 |
|
rasbt
|
b6fe1a37b3
|
also add simple wrapper
|
2024-03-06 08:38:53 -06:00 |
|
rasbt
|
571377a2d6
|
update title
|
2024-03-06 08:34:04 -06:00 |
|
rasbt
|
87fcfd9245
|
mha variants
|
2024-03-06 08:30:32 -06:00 |
|