23 Commits

Author SHA1 Message Date
rasbt
267e33cfaf remove redundant import 2024-02-29 19:59:05 -06:00
Rayed Bin Wahed
2fb035435e Update ch04.ipynb
Add missing import
2024-02-27 23:05:36 +08:00
rasbt
f6266c3756 improve code comments 2024-02-27 06:40:35 -06:00
rasbt
3f186ab072 use .shape instead of .size() for consistency 2024-02-25 08:47:25 -06:00
rasbt
f057156181 use smaller number of tokens to emphasize next token prediction goal 2024-02-15 20:09:20 -06:00
rasbt
557ddfc684 make a new example for shortcut connections 2024-02-15 19:34:12 -06:00
rasbt
250e6306e2 use attn_scores from sec 3.4 instead of 3.3 2024-02-14 20:23:59 -06:00
rasbt
231a854ae7 use less ambiguous var name 2024-02-13 07:05:37 -06:00
rasbt
fe332006de ch4 exercise solutions 2024-02-11 11:51:39 -06:00
rasbt
352b83d225 make softmax explicit 2024-02-11 08:42:21 -06:00
rasbt
7d86023fc4 make softmax explicit 2024-02-11 08:41:45 -06:00
rasbt
5840b4b5f8 update name of last section 2024-02-11 07:35:07 -06:00
rasbt
baa8617921 variable name fix 2024-02-10 17:53:54 -06:00
rasbt
496b52f842 format the other GPT architecture sizes 2024-02-10 17:47:56 -06:00
rasbt
10aa2d099d add print statements for illustration purposes 2024-02-10 10:10:14 -06:00
rasbt
5d1d8ce511 add shape information for clarity 2024-02-08 20:16:54 -06:00
rasbt
3a5fc79b38 add and update readme files 2024-02-05 06:51:58 -06:00
rasbt
2b38b63a7a move overview up 2024-02-04 15:57:03 -06:00
rasbt
bb50de7210 adjust figure width 2024-02-04 10:12:11 -06:00
rasbt
1653f6953a adjust figure width 2024-02-04 10:09:36 -06:00
rasbt
ec312e581b add chapter 4 code 2024-02-04 10:02:05 -06:00
rasbt
d261abce4c add forward pass 2024-01-31 08:00:19 -06:00
rasbt
fcb13fd636 add code backbone ch04 2024-01-29 08:14:23 -06:00