Sebastian Raschka
|
a0b5603423
|
Make header more clear
|
2024-05-25 10:44:12 -05:00 |
|
rasbt
|
fe8bb9291e
|
update formatting
|
2024-05-24 07:20:37 -05:00 |
|
rasbt
|
a9703f4603
|
update header
|
2024-05-24 07:15:41 -05:00 |
|
Sebastian Raschka
|
ee6afe260a
|
clarify
|
2024-05-24 06:58:12 -05:00 |
|
Sebastian Raschka
|
ecb1788a9a
|
Update cover
|
2024-05-24 06:56:20 -05:00 |
|
Sebastian Raschka
|
7589df7b25
|
Update cover and add amazon link
|
2024-05-24 06:55:19 -05:00 |
|
Sebastian Raschka
|
313979a4ce
|
Merge pull request #177 from rasbt/flops-analysis
Flops analysis
|
2024-05-23 21:45:16 -04:00 |
|
rasbt
|
d93fbbd4b9
|
flops analysis
|
2024-05-23 20:35:41 -05:00 |
|
rasbt
|
aa084656e0
|
update how to retrieve learning rate
|
2024-05-23 17:19:01 -05:00 |
|
rasbt
|
30ba6a3f4b
|
trainable token -> trainable token position
|
2024-05-23 11:43:20 -05:00 |
|
Sebastian Raschka
|
209a103d66
|
Merge pull request #176 from rasbt/dataset-length-warning
Add assertion about data set length
|
2024-05-23 07:58:47 -04:00 |
|
rasbt
|
c35cf65dbf
|
add assertion about data set length
|
2024-05-23 06:50:43 -05:00 |
|
Sebastian Raschka
|
66f02723ca
|
Merge pull request #174 from rasbt/update-regex
update regex
|
2024-05-22 21:37:50 -04:00 |
|
rasbt
|
1407085f07
|
reset cell count for better nbdiff
|
2024-05-22 20:27:09 -05:00 |
|
rasbt
|
85c3210105
|
update regex
|
2024-05-22 20:15:31 -05:00 |
|
Sebastian Raschka
|
1bf24669f8
|
Merge pull request #173 from rasbt/device-setting
Fix device setting
|
2024-05-22 18:59:58 -04:00 |
|
rasbt
|
c4cd48475c
|
Fix device setting
|
2024-05-22 17:51:51 -05:00 |
|
Sebastian Raschka
|
6aa4572ff7
|
Merge pull request #171 from d-kleine/main
fixed last_two_blocks
|
2024-05-22 18:42:29 -04:00 |
|
Daniel Kleine
|
4b0fdab1de
|
removed empty line
|
2024-05-22 16:15:13 +00:00 |
|
Daniel Kleine
|
a81ba9bd8b
|
fixed last_two_blocks
|
2024-05-22 02:02:43 +00:00 |
|
Sebastian Raschka
|
ef17f9bda0
|
Merge pull request #170 from rasbt/last-two-blocks
Experiment with last two blocks
|
2024-05-21 20:57:55 -04:00 |
|
rasbt
|
80d857c605
|
fix table alignment
|
2024-05-21 19:51:22 -05:00 |
|
rasbt
|
8a27baf7c2
|
experiment with last two blocks
|
2024-05-21 19:49:34 -05:00 |
|
Sebastian Raschka
|
d2f9f908dc
|
Merge pull request #169 from d-kleine/main
minor: 2nd gitignore / add. exp. table
|
2024-05-21 20:27:33 -04:00 |
|
Daniel Kleine
|
130a69ce27
|
improved readability of Additional Experiments table
|
2024-05-21 19:26:25 +00:00 |
|
Daniel Kleine
|
69da9ed447
|
removed unnecessary .gitignore
|
2024-05-21 19:25:16 +00:00 |
|
Sebastian Raschka
|
3d0b94b504
|
Merge pull request #166 from rasbt/update-lora-init
Update lora init
|
2024-05-19 21:33:28 -04:00 |
|
rasbt
|
5e1ba8343d
|
100x -> 50x
|
2024-05-19 20:26:53 -05:00 |
|
rasbt
|
9512957b07
|
use macbook version
|
2024-05-19 20:19:02 -05:00 |
|
rasbt
|
7b9b53c9f2
|
update lora init
|
2024-05-19 20:11:56 -05:00 |
|
rasbt
|
3b72e55c26
|
remove duplicated text
|
2024-05-19 11:34:47 -05:00 |
|
Sebastian Raschka
|
15e103bf27
|
Merge pull request #165 from d-kleine/main
updated .gitignore
|
2024-05-19 12:34:31 -04:00 |
|
Daniel Kleine
|
7b397fcd46
|
updated .gitignore
|
2024-05-19 16:07:20 +00:00 |
|
rasbt
|
bc5cbbf1bd
|
change defaults to 0 temp
|
2024-05-19 09:04:49 -05:00 |
|
rasbt
|
59f5ed8d68
|
use default value for temperature
|
2024-05-19 08:48:10 -05:00 |
|
rasbt
|
faffebae4b
|
add ignore index experiment
|
2024-05-19 07:24:49 -05:00 |
|
rasbt
|
5541f7c8fe
|
add test mode for dataset download
|
2024-05-18 17:38:19 -05:00 |
|
rasbt
|
bdea15f6c6
|
new experiment w/o causal mask
|
2024-05-18 17:03:36 -05:00 |
|
Sebastian Raschka
|
00a466f0b9
|
fix row number typo
|
2024-05-18 15:54:13 -05:00 |
|
Sebastian Raschka
|
c05e573121
|
Merge pull request #164 from rasbt/eos_id-token
Add eos_id option for ch07
|
2024-05-18 16:10:25 -04:00 |
|
rasbt
|
9d84935b69
|
add eos_id option for ch07
|
2024-05-18 12:35:40 -05:00 |
|
rasbt
|
b8ba2c257a
|
make consistent with the latest production version
|
2024-05-18 12:08:39 -05:00 |
|
rasbt
|
678fad50bc
|
formatting for consistency with production chapter
|
2024-05-18 11:03:42 -05:00 |
|
Sebastian Raschka
|
cdd51f216e
|
Merge pull request #163 from rasbt/add-gradient-accumulation
Add experiment with gradient accumulation
|
2024-05-17 22:45:57 -04:00 |
|
rasbt
|
10ebc47720
|
Add experiment with gradient accumulation
|
2024-05-17 21:31:22 -05:00 |
|
rasbt
|
623bc19665
|
fix no padding option
|
2024-05-17 21:06:51 -05:00 |
|
Sebastian Raschka
|
42d003c4ee
|
Merge pull request #162 from d-kleine/main
minor: fixed variable name in text
|
2024-05-17 16:45:43 -04:00 |
|
Daniel Kleine
|
37a760bcf3
|
Merge branch 'rasbt:main' into main
|
2024-05-17 15:59:44 +02:00 |
|
Daniel Kleine
|
b9c3e10dd3
|
fixed var name
|
2024-05-17 13:58:07 +00:00 |
|
Sebastian Raschka
|
8480638964
|
Merge pull request #161 from rasbt/no-padding
Add new experiment without padding
|
2024-05-17 09:35:02 -04:00 |
|