704 Commits

Author SHA1 Message Date
rasbt
281400feca
add missing figure 2024-05-12 18:37:02 -05:00
rasbt
88176a82eb
chapter 06 summary file 2024-05-12 18:27:50 -05:00
rasbt
209f63943b
10 -> 5 2024-05-12 10:42:25 -05:00
rasbt
ad41c6e3cc
use validation path 2024-05-12 09:41:46 -05:00
rasbt
33dda489a1
use path 2024-05-12 09:36:35 -05:00
rasbt
188d3cd262
basepath 2024-05-12 09:27:38 -05:00
rasbt
a733a7eb42
basepath 2024-05-12 09:25:56 -05:00
rasbt
2e47a6e61c
update dataset naming 2024-05-12 09:22:42 -05:00
rasbt
55c3a91838
rename download_and_unzip to make it more specific 2024-05-12 08:36:24 -05:00
Sebastian Raschka
58c591c0e0
add header 2024-05-11 14:37:21 -05:00
rasbt
4b4e1e1ad5
use spam / not spam labels 2024-05-11 13:42:18 -05:00
speed
45f6e72f40
fix 1024 characters to 1024 tokens (#152) 2024-05-11 13:17:07 -05:00
rasbt
02ad1bef3a reorder section 6.6 2024-05-11 08:27:07 -05:00
rasbt
694a57a472 explain how class labels are obtained 2024-05-11 07:42:13 -05:00
Sebastian Raschka
6fe8d1a10e
Update README.md 2024-05-11 06:42:05 -05:00
Sebastian Raschka
a3e1fa35f5
Move GPU column so that test accuracy is always visible 2024-05-11 06:41:25 -05:00
Sebastian Raschka
41288a3d3a
Add LoRA experiments (#151)
* Add LoRA experiments

* Update ch06/02_bonus_additional-experiments/additional-experiments.py
2024-05-10 07:26:41 -05:00
rasbt
51ac283257 6 -> 4 2024-05-10 07:02:14 -05:00
rasbt
ca84d797a7 Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch 2024-05-10 06:45:14 -05:00
Sebastian Raschka
b67ef1a2b2
fix punctuation and improve explanation 2024-05-09 21:15:09 -05:00
rasbt
0b6665939b
clarify overfitting 2024-05-09 09:09:26 -05:00
rasbt
037aab0959 Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch 2024-05-09 08:17:43 -05:00
Sebastian Raschka
86cf5878fd
experiments with largest model (#149) 2024-05-09 07:48:56 -05:00
rasbt
756ff780de experiments with largest model 2024-05-09 07:40:09 -05:00
rasbt
2df81f59d3 spelling improvements 2024-05-09 07:25:52 -05:00
rasbt
80cd98884e add note about worker number 2024-05-08 21:20:43 -05:00
rasbt
10d3370f74 update figure 6.6 2024-05-08 20:46:54 -05:00
rasbt
a09a70c345
text -> dataset 2024-05-08 08:14:03 -05:00
rasbt
24e9110fa8
make spam spelling consistent 2024-05-08 06:48:28 -05:00
Sebastian Raschka
9682b0e22d
Explain hardware requirements 2024-05-07 20:47:06 -05:00
rasbt
68c33a64e6 fix sentence 2024-05-07 07:05:47 -05:00
rasbt
d18f92fa34 add additional lora figure 2024-05-07 07:04:35 -05:00
rasbt
c93c90eb1e spelling fix 2024-05-07 06:46:33 -05:00
rasbt
e99c511721 spelling and consistency improvements 2024-05-06 21:02:13 -05:00
rasbt
0dd80359df formatting improvements 2024-05-06 20:35:51 -05:00
rasbt
16e276f8df show downloads 2024-05-06 07:40:09 -05:00
rasbt
cddcbc8b49 tokenizing example 2024-05-06 07:16:40 -05:00
rasbt
258dcad5ee ch06 csv 2024-05-06 07:16:30 -05:00
rasbt
83d5cea795 ch06 dataset 2024-05-06 06:55:56 -05:00
rasbt
8d78098dfa classfication -> classification 2024-05-06 06:50:38 -05:00
rasbt
6f486460bc
ouput -> output 2024-05-05 12:21:10 -05:00
Ikko Eltociear Ashimine
b3215e3351
Update ch06.ipynb (#143)
ouput -> output
2024-05-05 12:18:20 -05:00
Sebastian Raschka
978ef48ccc
Appendix E: Parameter-efficient Finetuning with LoRA (#142) 2024-05-05 12:05:17 -05:00
rasbt
3a632323df
make code more general for larger models 2024-05-05 10:18:46 -05:00
Sebastian Raschka
2e9d5acb5e
cosmetics 2024-05-05 08:15:46 -05:00
rasbt
e9bdbf0725
add text-to-token-id fn 2024-05-05 08:05:20 -05:00
Sebastian Raschka
d3201f5aad
Add figures for ch06 (#141) 2024-05-05 07:10:04 -05:00
rasbt
b8324061d0
update link 2024-05-04 08:08:58 -05:00
rasbt
dddf87296e
table-update 2024-05-04 07:58:18 -05:00
rasbt
d60dcc6724
add description 2024-05-04 07:34:29 -05:00