490 Commits

Author SHA1 Message Date
Sebastian Raschka
a0b5603423 Make header more clear 2024-05-25 10:44:12 -05:00
rasbt
fe8bb9291e update formatting 2024-05-24 07:20:37 -05:00
rasbt
a9703f4603 update header 2024-05-24 07:15:41 -05:00
Sebastian Raschka
ee6afe260a clarify 2024-05-24 06:58:12 -05:00
Sebastian Raschka
ecb1788a9a Update cover 2024-05-24 06:56:20 -05:00
Sebastian Raschka
7589df7b25 Update cover and add amazon link 2024-05-24 06:55:19 -05:00
Sebastian Raschka
313979a4ce Merge pull request #177 from rasbt/flops-analysis
Flops analysis
2024-05-23 21:45:16 -04:00
rasbt
d93fbbd4b9 flops analysis 2024-05-23 20:35:41 -05:00
rasbt
aa084656e0 update how to retrieve learning rate 2024-05-23 17:19:01 -05:00
rasbt
30ba6a3f4b trainable token -> trainable token position 2024-05-23 11:43:20 -05:00
Sebastian Raschka
209a103d66 Merge pull request #176 from rasbt/dataset-length-warning
Add assertion about data set length
2024-05-23 07:58:47 -04:00
rasbt
c35cf65dbf add assertion about data set length 2024-05-23 06:50:43 -05:00
Sebastian Raschka
66f02723ca Merge pull request #174 from rasbt/update-regex
update regex
2024-05-22 21:37:50 -04:00
rasbt
1407085f07 reset cell count for better nbdiff 2024-05-22 20:27:09 -05:00
rasbt
85c3210105 update regex 2024-05-22 20:15:31 -05:00
Sebastian Raschka
1bf24669f8 Merge pull request #173 from rasbt/device-setting
Fix device setting
2024-05-22 18:59:58 -04:00
rasbt
c4cd48475c Fix device setting 2024-05-22 17:51:51 -05:00
Sebastian Raschka
6aa4572ff7 Merge pull request #171 from d-kleine/main
fixed last_two_blocks
2024-05-22 18:42:29 -04:00
Daniel Kleine
4b0fdab1de removed empty line 2024-05-22 16:15:13 +00:00
Daniel Kleine
a81ba9bd8b fixed last_two_blocks 2024-05-22 02:02:43 +00:00
Sebastian Raschka
ef17f9bda0 Merge pull request #170 from rasbt/last-two-blocks
Experiment with last two blocks
2024-05-21 20:57:55 -04:00
rasbt
80d857c605 fix table alignment 2024-05-21 19:51:22 -05:00
rasbt
8a27baf7c2 experiment with last two blocks 2024-05-21 19:49:34 -05:00
Sebastian Raschka
d2f9f908dc Merge pull request #169 from d-kleine/main
minor: 2nd gitignore / add. exp. table
2024-05-21 20:27:33 -04:00
Daniel Kleine
130a69ce27 improved readability of Additional Experiments table 2024-05-21 19:26:25 +00:00
Daniel Kleine
69da9ed447 removed unnecessary .gitignore 2024-05-21 19:25:16 +00:00
Sebastian Raschka
3d0b94b504 Merge pull request #166 from rasbt/update-lora-init
Update lora init
2024-05-19 21:33:28 -04:00
rasbt
5e1ba8343d 100x -> 50x 2024-05-19 20:26:53 -05:00
rasbt
9512957b07 use macbook version 2024-05-19 20:19:02 -05:00
rasbt
7b9b53c9f2 update lora init 2024-05-19 20:11:56 -05:00
rasbt
3b72e55c26 remove duplicated text 2024-05-19 11:34:47 -05:00
Sebastian Raschka
15e103bf27 Merge pull request #165 from d-kleine/main
updated .gitignore
2024-05-19 12:34:31 -04:00
Daniel Kleine
7b397fcd46 updated .gitignore 2024-05-19 16:07:20 +00:00
rasbt
bc5cbbf1bd change defaults to 0 temp 2024-05-19 09:04:49 -05:00
rasbt
59f5ed8d68 use default value for temperature 2024-05-19 08:48:10 -05:00
rasbt
faffebae4b add ignore index experiment 2024-05-19 07:24:49 -05:00
rasbt
5541f7c8fe add test mode for dataset download 2024-05-18 17:38:19 -05:00
rasbt
bdea15f6c6 new experiment w/o causal mask 2024-05-18 17:03:36 -05:00
Sebastian Raschka
00a466f0b9 fix row number typo 2024-05-18 15:54:13 -05:00
Sebastian Raschka
c05e573121 Merge pull request #164 from rasbt/eos_id-token
Add eos_id option for ch07
2024-05-18 16:10:25 -04:00
rasbt
9d84935b69 add eos_id option for ch07 2024-05-18 12:35:40 -05:00
rasbt
b8ba2c257a make consistent with the latest production version 2024-05-18 12:08:39 -05:00
rasbt
678fad50bc formatting for consistency with production chapter 2024-05-18 11:03:42 -05:00
Sebastian Raschka
cdd51f216e Merge pull request #163 from rasbt/add-gradient-accumulation
Add experiment with gradient accumulation
2024-05-17 22:45:57 -04:00
rasbt
10ebc47720 Add experiment with gradient accumulation 2024-05-17 21:31:22 -05:00
rasbt
623bc19665 fix no padding option 2024-05-17 21:06:51 -05:00
Sebastian Raschka
42d003c4ee Merge pull request #162 from d-kleine/main
minor: fixed variable name in text
2024-05-17 16:45:43 -04:00
Daniel Kleine
37a760bcf3 Merge branch 'rasbt:main' into main 2024-05-17 15:59:44 +02:00
Daniel Kleine
b9c3e10dd3 fixed var name 2024-05-17 13:58:07 +00:00
Sebastian Raschka
8480638964 Merge pull request #161 from rasbt/no-padding
Add new experiment without padding
2024-05-17 09:35:02 -04:00