87 Commits

Author SHA1 Message Date
Sebastian Raschka
49b8e2e767
Update README.md 2024-05-25 10:45:29 -05:00
rasbt
98d453b666
update formatting 2024-05-24 07:20:37 -05:00
rasbt
b43a7a8820
trainable token -> trainable token position 2024-05-23 11:43:20 -05:00
rasbt
18e729643d
add assertion about data set length 2024-05-23 06:50:43 -05:00
rasbt
86f6c2df43
Fix device setting 2024-05-22 17:51:51 -05:00
Daniel Kleine
982cbe5e40 removed empty line 2024-05-22 16:15:13 +00:00
Daniel Kleine
195849cf8d fixed last_two_blocks 2024-05-22 02:02:43 +00:00
rasbt
caf3725001
fix table alignment 2024-05-21 19:51:22 -05:00
rasbt
725bed56f7
experiment with last two blocks 2024-05-21 19:49:34 -05:00
Daniel Kleine
f39087e573 improved readability of Additional Experiments table 2024-05-21 19:26:25 +00:00
rasbt
c2028871e4
update lora init 2024-05-19 20:11:56 -05:00
rasbt
a8a28017c0
remove duplicated text 2024-05-19 11:34:47 -05:00
rasbt
a5593f9860
change defaults to 0 temp 2024-05-19 09:04:49 -05:00
rasbt
1463b2ae47
use default value for temperature 2024-05-19 08:48:10 -05:00
rasbt
1b340c9eb6 add ignore index experiment 2024-05-19 07:24:49 -05:00
rasbt
02e6f06a11
add test mode for dataset download 2024-05-18 17:38:19 -05:00
rasbt
5ef4edf2b5
new experiment w/o causal mask 2024-05-18 17:03:36 -05:00
Sebastian Raschka
57634f2045
fix row number typo 2024-05-18 15:54:13 -05:00
rasbt
4851d5a0fa
add eos_id option for ch07 2024-05-18 12:35:40 -05:00
rasbt
42cb0cbd59
Add experiment with gradient accumulation 2024-05-17 21:31:22 -05:00
rasbt
fc88fefd9c
fix no padding option 2024-05-17 21:06:51 -05:00
rasbt
cbe9664ef4
fix link 2024-05-17 08:20:35 -05:00
rasbt
5cfc64d038
fix indent 2024-05-17 07:58:01 -05:00
rasbt
04b9540938
Add new experiment without padding 2024-05-17 07:55:51 -05:00
Sebastian Raschka
e631823762 improve bonus code in chapter 06 2024-05-14 20:35:50 -04:00
Sebastian Raschka
717b294680
Merge branch 'main' into main 2024-05-14 08:28:02 -05:00
rasbt
52f15dff30
fix file path name 2024-05-14 08:27:46 -05:00
Sebastian Raschka
fa52c3bc78
Merge branch 'main' into main 2024-05-14 08:12:19 -05:00
rasbt
6cfec73490
add previous chapters file 2024-05-14 08:11:58 -05:00
Sebastian Raschka
abd29ce7c2
Merge branch 'main' into main 2024-05-14 08:07:58 -05:00
rasbt
25fb63e14a
add missing gpt-download.py 2024-05-14 08:05:56 -05:00
Daniel Kleine
4bf268f398 added missing python run statement 2024-05-14 12:17:09 +00:00
rasbt
c7c83904a0
tokens seen -> examples seen 2024-05-13 20:08:48 -05:00
rasbt
16d19751b0
spelling 2024-05-13 20:06:38 -05:00
rasbt
cd7ea15e8d
add readme 2024-05-13 08:50:55 -05:00
Sebastian Raschka
968af7e0ba
Merge pull request #153 from rasbt/ch06-exercises
Chapter 6 wrap-up
2024-05-13 08:14:08 -05:00
rasbt
b28cc0cb8c
pep8 fixes 2024-05-13 07:50:51 -05:00
rasbt
a740a62239
tests and exercises 2024-05-13 07:45:59 -05:00
Sebastian Raschka
5094eb7567
val before test acc 2024-05-13 07:36:18 -05:00
rasbt
8bc15ab316
fix tests 2024-05-12 19:03:14 -05:00
rasbt
21172a6a7e
add chapter 6 unit test 2024-05-12 18:51:28 -05:00
rasbt
281400feca
add missing figure 2024-05-12 18:37:02 -05:00
rasbt
88176a82eb
chapter 06 summary file 2024-05-12 18:27:50 -05:00
rasbt
ad41c6e3cc
use validation path 2024-05-12 09:41:46 -05:00
rasbt
33dda489a1
use path 2024-05-12 09:36:35 -05:00
rasbt
188d3cd262
basepath 2024-05-12 09:27:38 -05:00
rasbt
a733a7eb42
basepath 2024-05-12 09:25:56 -05:00
rasbt
2e47a6e61c
update dataset naming 2024-05-12 09:22:42 -05:00
rasbt
55c3a91838
rename download_and_unzip to make it more specific 2024-05-12 08:36:24 -05:00
Sebastian Raschka
58c591c0e0
add header 2024-05-11 14:37:21 -05:00