128 Commits

Author SHA1 Message Date
Daniel Kleine
f39087e573 improved readability of Additional Experiments table 2024-05-21 19:26:25 +00:00
rasbt
c2028871e4
update lora init 2024-05-19 20:11:56 -05:00
rasbt
a8a28017c0
remove duplicated text 2024-05-19 11:34:47 -05:00
rasbt
a5593f9860
change defaults to 0 temp 2024-05-19 09:04:49 -05:00
rasbt
1463b2ae47
use default value for temperature 2024-05-19 08:48:10 -05:00
rasbt
1b340c9eb6 add ignore index experiment 2024-05-19 07:24:49 -05:00
rasbt
02e6f06a11
add test mode for dataset download 2024-05-18 17:38:19 -05:00
rasbt
5ef4edf2b5
new experiment w/o causal mask 2024-05-18 17:03:36 -05:00
Sebastian Raschka
57634f2045
fix row number typo 2024-05-18 15:54:13 -05:00
rasbt
4851d5a0fa
add eos_id option for ch07 2024-05-18 12:35:40 -05:00
rasbt
42cb0cbd59
Add experiment with gradient accumulation 2024-05-17 21:31:22 -05:00
rasbt
fc88fefd9c
fix no padding option 2024-05-17 21:06:51 -05:00
rasbt
cbe9664ef4
fix link 2024-05-17 08:20:35 -05:00
rasbt
5cfc64d038
fix indent 2024-05-17 07:58:01 -05:00
rasbt
04b9540938
Add new experiment without padding 2024-05-17 07:55:51 -05:00
Sebastian Raschka
e631823762 improve bonus code in chapter 06 2024-05-14 20:35:50 -04:00
Sebastian Raschka
717b294680
Merge branch 'main' into main 2024-05-14 08:28:02 -05:00
rasbt
52f15dff30
fix file path name 2024-05-14 08:27:46 -05:00
Sebastian Raschka
fa52c3bc78
Merge branch 'main' into main 2024-05-14 08:12:19 -05:00
rasbt
6cfec73490
add previous chapters file 2024-05-14 08:11:58 -05:00
Sebastian Raschka
abd29ce7c2
Merge branch 'main' into main 2024-05-14 08:07:58 -05:00
rasbt
25fb63e14a
add missing gpt-download.py 2024-05-14 08:05:56 -05:00
Daniel Kleine
4bf268f398 added missing python run statement 2024-05-14 12:17:09 +00:00
rasbt
c7c83904a0
tokens seen -> examples seen 2024-05-13 20:08:48 -05:00
rasbt
16d19751b0
spelling 2024-05-13 20:06:38 -05:00
rasbt
cd7ea15e8d
add readme 2024-05-13 08:50:55 -05:00
Sebastian Raschka
968af7e0ba
Merge pull request #153 from rasbt/ch06-exercises
Chapter 6 wrap-up
2024-05-13 08:14:08 -05:00
rasbt
b28cc0cb8c
pep8 fixes 2024-05-13 07:50:51 -05:00
rasbt
a740a62239
tests and exercises 2024-05-13 07:45:59 -05:00
Sebastian Raschka
5094eb7567
val before test acc 2024-05-13 07:36:18 -05:00
rasbt
8bc15ab316
fix tests 2024-05-12 19:03:14 -05:00
rasbt
21172a6a7e
add chapter 6 unit test 2024-05-12 18:51:28 -05:00
rasbt
281400feca
add missing figure 2024-05-12 18:37:02 -05:00
rasbt
88176a82eb
chapter 06 summary file 2024-05-12 18:27:50 -05:00
rasbt
ad41c6e3cc
use validation path 2024-05-12 09:41:46 -05:00
rasbt
33dda489a1
use path 2024-05-12 09:36:35 -05:00
rasbt
188d3cd262
basepath 2024-05-12 09:27:38 -05:00
rasbt
a733a7eb42
basepath 2024-05-12 09:25:56 -05:00
rasbt
2e47a6e61c
update dataset naming 2024-05-12 09:22:42 -05:00
rasbt
55c3a91838
rename download_and_unzip to make it more specific 2024-05-12 08:36:24 -05:00
Sebastian Raschka
58c591c0e0
add header 2024-05-11 14:37:21 -05:00
rasbt
4b4e1e1ad5
use spam / not spam labels 2024-05-11 13:42:18 -05:00
rasbt
02ad1bef3a reorder section 6.6 2024-05-11 08:27:07 -05:00
rasbt
694a57a472 explain how class labels are obtained 2024-05-11 07:42:13 -05:00
Sebastian Raschka
6fe8d1a10e
Update README.md 2024-05-11 06:42:05 -05:00
Sebastian Raschka
a3e1fa35f5
Move GPU column so that test accuracy is always visible 2024-05-11 06:41:25 -05:00
Sebastian Raschka
41288a3d3a
Add LoRA experiments (#151)
* Add LoRA experiments

* Update ch06/02_bonus_additional-experiments/additional-experiments.py
2024-05-10 07:26:41 -05:00
rasbt
51ac283257 6 -> 4 2024-05-10 07:02:14 -05:00
rasbt
0b6665939b
clarify overfitting 2024-05-09 09:09:26 -05:00
Sebastian Raschka
86cf5878fd
experiments with largest model (#149) 2024-05-09 07:48:56 -05:00