Sebastian Raschka
|
ef17f9bda0
|
Merge pull request #170 from rasbt/last-two-blocks
Experiment with last two blocks
|
2024-05-21 20:57:55 -04:00 |
|
rasbt
|
80d857c605
|
fix table alignment
|
2024-05-21 19:51:22 -05:00 |
|
rasbt
|
8a27baf7c2
|
experiment with last two blocks
|
2024-05-21 19:49:34 -05:00 |
|
Sebastian Raschka
|
d2f9f908dc
|
Merge pull request #169 from d-kleine/main
minor: 2nd gitignore / add. exp. table
|
2024-05-21 20:27:33 -04:00 |
|
Daniel Kleine
|
130a69ce27
|
improved readability of Additional Experiments table
|
2024-05-21 19:26:25 +00:00 |
|
Daniel Kleine
|
69da9ed447
|
removed unnecessary .gitignore
|
2024-05-21 19:25:16 +00:00 |
|
Sebastian Raschka
|
3d0b94b504
|
Merge pull request #166 from rasbt/update-lora-init
Update lora init
|
2024-05-19 21:33:28 -04:00 |
|
rasbt
|
5e1ba8343d
|
100x -> 50x
|
2024-05-19 20:26:53 -05:00 |
|
rasbt
|
9512957b07
|
use macbook version
|
2024-05-19 20:19:02 -05:00 |
|
rasbt
|
7b9b53c9f2
|
update lora init
|
2024-05-19 20:11:56 -05:00 |
|
rasbt
|
3b72e55c26
|
remove duplicated text
|
2024-05-19 11:34:47 -05:00 |
|
Sebastian Raschka
|
15e103bf27
|
Merge pull request #165 from d-kleine/main
updated .gitignore
|
2024-05-19 12:34:31 -04:00 |
|
Daniel Kleine
|
7b397fcd46
|
updated .gitignore
|
2024-05-19 16:07:20 +00:00 |
|
rasbt
|
bc5cbbf1bd
|
change defaults to 0 temp
|
2024-05-19 09:04:49 -05:00 |
|
rasbt
|
59f5ed8d68
|
use default value for temperature
|
2024-05-19 08:48:10 -05:00 |
|
rasbt
|
faffebae4b
|
add ignore index experiment
|
2024-05-19 07:24:49 -05:00 |
|
rasbt
|
5541f7c8fe
|
add test mode for dataset download
|
2024-05-18 17:38:19 -05:00 |
|
rasbt
|
bdea15f6c6
|
new experiment w/o causal mask
|
2024-05-18 17:03:36 -05:00 |
|
Sebastian Raschka
|
00a466f0b9
|
fix row number typo
|
2024-05-18 15:54:13 -05:00 |
|
Sebastian Raschka
|
c05e573121
|
Merge pull request #164 from rasbt/eos_id-token
Add eos_id option for ch07
|
2024-05-18 16:10:25 -04:00 |
|
rasbt
|
9d84935b69
|
add eos_id option for ch07
|
2024-05-18 12:35:40 -05:00 |
|
rasbt
|
b8ba2c257a
|
make consistent with the latest production version
|
2024-05-18 12:08:39 -05:00 |
|
rasbt
|
678fad50bc
|
formatting for consistency with production chapter
|
2024-05-18 11:03:42 -05:00 |
|
Sebastian Raschka
|
cdd51f216e
|
Merge pull request #163 from rasbt/add-gradient-accumulation
Add experiment with gradient accumulation
|
2024-05-17 22:45:57 -04:00 |
|
rasbt
|
10ebc47720
|
Add experiment with gradient accumulation
|
2024-05-17 21:31:22 -05:00 |
|
rasbt
|
623bc19665
|
fix no padding option
|
2024-05-17 21:06:51 -05:00 |
|
Sebastian Raschka
|
42d003c4ee
|
Merge pull request #162 from d-kleine/main
minor: fixed variable name in text
|
2024-05-17 16:45:43 -04:00 |
|
Daniel Kleine
|
37a760bcf3
|
Merge branch 'rasbt:main' into main
|
2024-05-17 15:59:44 +02:00 |
|
Daniel Kleine
|
b9c3e10dd3
|
fixed var name
|
2024-05-17 13:58:07 +00:00 |
|
Sebastian Raschka
|
8480638964
|
Merge pull request #161 from rasbt/no-padding
Add new experiment without padding
|
2024-05-17 09:35:02 -04:00 |
|
rasbt
|
05738f8be6
|
fix link
|
2024-05-17 08:20:35 -05:00 |
|
rasbt
|
f1db50fe9a
|
fix indent
|
2024-05-17 07:58:01 -05:00 |
|
rasbt
|
2653c36957
|
Add new experiment without padding
|
2024-05-17 07:55:51 -05:00 |
|
Sebastian Raschka
|
e230576413
|
Merge pull request #160 from d-kleine/main
small changes Docker / OpenAI
|
2024-05-17 07:51:26 -04:00 |
|
Daniel Kleine
|
e6012b944e
|
fixed empty space
|
2024-05-17 10:44:18 +02:00 |
|
Daniel Kleine
|
018a22338f
|
added missing step 2 and prettyfied readme
|
2024-05-17 10:43:35 +02:00 |
|
rasbt
|
6c6321f671
|
simplify code
|
2024-05-16 20:16:25 -05:00 |
|
Sebastian Raschka
|
d0777e3aa0
|
Merge pull request #157 from DrCesar/main
fix move model to device before calculating loss
|
2024-05-15 20:58:43 -04:00 |
|
Sebastian Raschka
|
df5f680383
|
Merge pull request #159 from ehberg/main
Update README.md, minor path change
|
2024-05-15 20:46:31 -04:00 |
|
Eric Berg
|
3614f6efa9
|
Update README.md, minor path change
|
2024-05-15 13:51:51 -04:00 |
|
Sebastian Raschka
|
a26fd9d02a
|
Merge pull request #158 from d-kleine/main
(minor) Updated vscode extensions and gitignore, added missing word
|
2024-05-15 06:17:57 -05:00 |
|
Daniel Kleine
|
5411f31186
|
added missing word
|
2024-05-15 06:33:25 +00:00 |
|
Daniel Kleine
|
c78ceafe51
|
updated .gitignore with appendix artifacts
|
2024-05-15 06:30:24 +00:00 |
|
DrCesar
|
d2410cb0c6
|
fix move model to device before calculating loss
|
2024-05-14 22:28:00 -07:00 |
|
Daniel Kleine
|
bbb4703cf3
|
added csv extension
|
2024-05-15 05:26:57 +00:00 |
|
Sebastian Raschka
|
e4227c899b
|
Merge pull request #156 from rasbt/improve-bonus-code-ch06
Improve bonus code in chapter 06
|
2024-05-14 19:44:20 -05:00 |
|
Sebastian Raschka
|
47b3ff15ec
|
improve bonus code in chapter 06
|
2024-05-14 20:35:50 -04:00 |
|
Sebastian Raschka
|
dad6eb4456
|
Merge pull request #154 from d-kleine/main
updated .gitignore and added missing python run statement
|
2024-05-14 08:30:26 -05:00 |
|
Sebastian Raschka
|
30010c7a91
|
Merge branch 'main' into main
|
2024-05-14 08:28:02 -05:00 |
|
rasbt
|
6aff47ba60
|
fix file path name
|
2024-05-14 08:27:46 -05:00 |
|