46 Commits

Author SHA1 Message Date
Sebastian Raschka
3654571184
align formulas in notes with code (#605) 2025-04-06 16:46:53 -05:00
Greg Gandenberger
b92c0dff89
Add note about context_length (#549)
* Add note about context_length

* update note

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2025-02-27 08:36:41 -06:00
rasbt
1183fd7837
add dropout scaling note 2024-11-06 05:52:47 -06:00
Sebastian Raschka
08040f024c
Test code in pytorch 2.4 (#285)
* test code in pytorch 2.4

* update
2024-07-24 21:53:41 -05:00
rasbt
31806828d0
add links to summary sections 2024-06-29 07:33:26 -05:00
rasbt
c7f892550e
add clarification about :num_tokens 2024-06-29 07:16:42 -05:00
rasbt
5d1fbbbfd2
update dotted line 2024-06-17 20:17:56 -05:00
rasbt
aaa54b10b3
dim-consistency 2024-06-12 19:43:25 -05:00
Sebastian Raschka
c303a7f36d
Explain value truncation in some figures (#199)
* clarify truncation

* typo fix
2024-06-08 13:24:37 -05:00
rasbt
1e12da90e6
clarify truncation 2024-06-08 13:13:43 -05:00
rasbt
98d453b666
update formatting 2024-05-24 07:20:37 -05:00
rasbt
3b57b6d8c4
make consistent with the latest production version 2024-05-18 12:08:39 -05:00
Sebastian Raschka
7740d556a0
Use dim=-1 for consistency (#122) 2024-04-18 05:56:23 -05:00
Sebastian Raschka
2de60d1bfb
Rename variable to context_length to make it easier on readers (#106)
* rename to context length

* fix spacing
2024-04-04 07:27:41 -05:00
rasbt
7d1eadd0be
update notes 2024-04-02 18:27:13 -05:00
Intelligence-Manifesto
96b1fde3f1
"Typographical error (#104) 2024-04-02 18:07:21 -05:00
Sebastian Raschka
a2cd8436cb Ch05 supplementary code (#81) 2024-03-19 09:26:26 -05:00
Sebastian Raschka
ca96abac8a Set up basic test gh worklows (#79)
* Set up basic test gh worklows

* update file paths

* env check

* add env check

* Update requirements.txt

* simplify

* upd
2024-03-18 11:58:37 -05:00
rasbt
4fc6de7afa add notes 2024-03-17 09:29:06 -05:00
rasbt
d60da19fd0 add more notes and embed figures externally to save space 2024-03-17 09:08:38 -05:00
Intelligence-Manifesto
d4b4e3d0f0 the above -> the following 2024-03-15 05:00:28 +08:00
rasbt
244137e8a1 amend 2024-03-10 08:05:22 -05:00
rasbt
76205521d7 different dropout behavior on macos and linux 2024-03-10 07:58:10 -05:00
rasbt
73822b8bfa move ex 3.3 solution outside main chapter 2024-03-10 07:18:24 -05:00
rasbt
da33ce8054 remove redundant unsqueeze in mask 2024-03-09 17:42:31 -06:00
rasbt
3beaea46ce add lowres figs for better navigation 2024-03-08 07:18:06 -06:00
rasbt
b6fe1a37b3 also add simple wrapper 2024-03-06 08:38:53 -06:00
rasbt
87fcfd9245 mha variants 2024-03-06 08:30:32 -06:00
rasbt
d4754f1bdd change dim=1 to dim=-1 2024-03-04 18:54:43 -06:00
rasbt
b827bf4eea remove redundant double-unsequeeze 2024-02-29 08:31:07 -06:00
rasbt
250e6306e2 use attn_scores from sec 3.4 instead of 3.3 2024-02-14 20:23:59 -06:00
Intelligence-Manifesto
6a09e7b03a **step 2**
step 2: According to the context, the formatting here should be **step 2**. 
Additionally, it seems that there is a lack of text description for step 1 in this section, as other sections are all labeled with steps 1, 2, 3 in order, clearly indicating the steps.
2024-02-12 18:32:28 +08:00
Intelligence-Manifesto
1278615c25 12 -> 21
12 -> 21
2024-02-11 14:17:55 +08:00
rasbt
92896d817c add toggle for qkv_bias 2024-01-17 07:50:57 -06:00
rasbt
9e85f13ba9 readability improvements 2024-01-15 07:36:19 -06:00
rasbt
a7b4880179 small readability updates 2024-01-14 11:58:42 -06:00
rasbt
c79499572f update chapter title 2024-01-13 14:51:39 -06:00
rasbt
c400f77f26 update exercise solutions 2024-01-13 14:49:02 -06:00
rasbt
f279134492 small cosmetic fixes and improvements 2024-01-10 08:01:19 -06:00
rasbt
690a1a62b0 add comments to ToC and fix link 2024-01-07 15:13:53 -06:00
rasbt
e113075a16 show normalization explicitely 2024-01-06 19:24:01 -05:00
rasbt
ea4b6c4e5f add package versions to the top of the notebook 2024-01-01 19:41:18 +01:00
rasbt
4f161bd549 use block size variable in positional embedding layer 2023-12-28 19:05:06 +01:00
Xiaotian Ma
b8901da362 Update ch03.ipynb 2023-12-26 22:41:54 -06:00
Xiaotian Ma
c518adb0b7 Update ch03.ipynb 2023-12-26 22:05:21 -06:00
rasbt
31980a6ef1 add ch03 and TOC 2023-12-09 17:13:56 -06:00