rasbt
244137e8a1
amend
2024-03-10 08:05:22 -05:00
rasbt
76205521d7
different dropout behavior on macos and linux
2024-03-10 07:58:10 -05:00
rasbt
73822b8bfa
move ex 3.3 solution outside main chapter
2024-03-10 07:18:24 -05:00
rasbt
da33ce8054
remove redundant unsqueeze in mask
2024-03-09 17:42:31 -06:00
rasbt
3beaea46ce
add lowres figs for better navigation
2024-03-08 07:18:06 -06:00
rasbt
b6fe1a37b3
also add simple wrapper
2024-03-06 08:38:53 -06:00
rasbt
87fcfd9245
mha variants
2024-03-06 08:30:32 -06:00
rasbt
d4754f1bdd
change dim=1 to dim=-1
2024-03-04 18:54:43 -06:00
rasbt
b827bf4eea
remove redundant double-unsequeeze
2024-02-29 08:31:07 -06:00
rasbt
250e6306e2
use attn_scores from sec 3.4 instead of 3.3
2024-02-14 20:23:59 -06:00
Intelligence-Manifesto
6a09e7b03a
**step 2**
...
step 2: According to the context, the formatting here should be **step 2**.
Additionally, it seems that there is a lack of text description for step 1 in this section, as other sections are all labeled with steps 1, 2, 3 in order, clearly indicating the steps.
2024-02-12 18:32:28 +08:00
Intelligence-Manifesto
1278615c25
12 -> 21
...
12 -> 21
2024-02-11 14:17:55 +08:00
rasbt
92896d817c
add toggle for qkv_bias
2024-01-17 07:50:57 -06:00
rasbt
9e85f13ba9
readability improvements
2024-01-15 07:36:19 -06:00
rasbt
a7b4880179
small readability updates
2024-01-14 11:58:42 -06:00
rasbt
c79499572f
update chapter title
2024-01-13 14:51:39 -06:00
rasbt
c400f77f26
update exercise solutions
2024-01-13 14:49:02 -06:00
rasbt
f279134492
small cosmetic fixes and improvements
2024-01-10 08:01:19 -06:00
rasbt
690a1a62b0
add comments to ToC and fix link
2024-01-07 15:13:53 -06:00
rasbt
e113075a16
show normalization explicitely
2024-01-06 19:24:01 -05:00
rasbt
ea4b6c4e5f
add package versions to the top of the notebook
2024-01-01 19:41:18 +01:00
rasbt
4f161bd549
use block size variable in positional embedding layer
2023-12-28 19:05:06 +01:00
Xiaotian Ma
b8901da362
Update ch03.ipynb
2023-12-26 22:41:54 -06:00
Xiaotian Ma
c518adb0b7
Update ch03.ipynb
2023-12-26 22:05:21 -06:00
rasbt
31980a6ef1
add ch03 and TOC
2023-12-09 17:13:56 -06:00