732 Commits

Author SHA1 Message Date
Sebastian Raschka
86cf5878fd
experiments with largest model (#149) 2024-05-09 07:48:56 -05:00
rasbt
756ff780de experiments with largest model 2024-05-09 07:40:09 -05:00
rasbt
2df81f59d3 spelling improvements 2024-05-09 07:25:52 -05:00
rasbt
80cd98884e add note about worker number 2024-05-08 21:20:43 -05:00
rasbt
10d3370f74 update figure 6.6 2024-05-08 20:46:54 -05:00
rasbt
a09a70c345
text -> dataset 2024-05-08 08:14:03 -05:00
rasbt
24e9110fa8
make spam spelling consistent 2024-05-08 06:48:28 -05:00
Sebastian Raschka
9682b0e22d
Explain hardware requirements 2024-05-07 20:47:06 -05:00
rasbt
68c33a64e6 fix sentence 2024-05-07 07:05:47 -05:00
rasbt
d18f92fa34 add additional lora figure 2024-05-07 07:04:35 -05:00
rasbt
c93c90eb1e spelling fix 2024-05-07 06:46:33 -05:00
rasbt
e99c511721 spelling and consistency improvements 2024-05-06 21:02:13 -05:00
rasbt
0dd80359df formatting improvements 2024-05-06 20:35:51 -05:00
rasbt
16e276f8df show downloads 2024-05-06 07:40:09 -05:00
rasbt
cddcbc8b49 tokenizing example 2024-05-06 07:16:40 -05:00
rasbt
258dcad5ee ch06 csv 2024-05-06 07:16:30 -05:00
rasbt
83d5cea795 ch06 dataset 2024-05-06 06:55:56 -05:00
rasbt
8d78098dfa classfication -> classification 2024-05-06 06:50:38 -05:00
rasbt
6f486460bc
ouput -> output 2024-05-05 12:21:10 -05:00
Ikko Eltociear Ashimine
b3215e3351
Update ch06.ipynb (#143)
ouput -> output
2024-05-05 12:18:20 -05:00
Sebastian Raschka
978ef48ccc
Appendix E: Parameter-efficient Finetuning with LoRA (#142) 2024-05-05 12:05:17 -05:00
rasbt
3a632323df
make code more general for larger models 2024-05-05 10:18:46 -05:00
Sebastian Raschka
2e9d5acb5e
cosmetics 2024-05-05 08:15:46 -05:00
rasbt
e9bdbf0725
add text-to-token-id fn 2024-05-05 08:05:20 -05:00
Sebastian Raschka
d3201f5aad
Add figures for ch06 (#141) 2024-05-05 07:10:04 -05:00
rasbt
b8324061d0
update link 2024-05-04 08:08:58 -05:00
rasbt
dddf87296e
table-update 2024-05-04 07:58:18 -05:00
rasbt
d60dcc6724
add description 2024-05-04 07:34:29 -05:00
Sebastian Raschka
da61d5b76a
Ch06 draft (#138)
* Ch06 first draft

* add utility files
2024-05-03 08:37:58 -05:00
rasbt
c735c21e87
fix swiglu acronym 2024-05-01 20:26:17 -05:00
rasbt
aec169dc12 link formatting 2024-04-30 06:26:23 -05:00
rasbt
d249960bdc Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch 2024-04-30 06:25:37 -05:00
Sebastian Raschka
82d6bd47a4
use training set len (#137) 2024-04-29 21:56:05 -05:00
rasbt
0ac19a1e50 use training set len 2024-04-29 21:50:07 -05:00
Sebastian Raschka
97ed38116a
Rename drop_resid to drop_shortcut (#136) 2024-04-28 14:31:27 -05:00
Sebastian Raschka
70cd174091
add roberta option (#135) 2024-04-28 13:57:36 -05:00
Sebastian Raschka
ca47c5e4b2
Formatting improvements (#134)
* formatting improvements

* .yml triggers
2024-04-28 12:05:32 -05:00
Sebastian Raschka
9a5d4d8ac9
Try windows runners (#133)
* try windows runners

* update triggers

* trigger with code file update

* add new status badges
2024-04-28 07:39:23 -05:00
Sebastian Raschka
e1d094b655
Update README.md 2024-04-27 07:59:42 -05:00
Sebastian Raschka
fc3d70f72f
Data loader intuition with numbers (#132)
* data loader intuition with numbers

* fix link

* fix tests
2024-04-27 07:56:41 -05:00
Sebastian Raschka
4adb96d7ee
Make code more consistent and add projection layer (#131)
* Make code more consistent and add projection

* remove redundant buffer
2024-04-26 17:13:08 -05:00
Sebastian Raschka
59b4fd3e25
IMDB experiments (#128)
* IMDB experiments

* style fixes

* Update README.md
2024-04-25 07:20:53 -05:00
rasbt
258aff3e9a style checks 2024-04-24 07:48:51 -05:00
rasbt
46d09b30d9 add usage 2024-04-24 07:27:04 -05:00
rasbt
5ef438aa3b add more experiments 2024-04-24 07:23:11 -05:00
rasbt
642f819910 update requirements 2024-04-24 06:38:02 -05:00
rasbt
3b4484029d
rename folder 2024-04-23 21:02:57 -05:00
rasbt
c7cdedf981
update figures in bonus notebook 2024-04-23 21:01:27 -05:00
Sebastian Raschka
16964a6486
Chapter 6 ablation studies (#127)
* Chapter 6 ablation studies

* add table

* formatting

* formatting

* formatting
2024-04-23 09:51:52 -05:00
Sebastian Raschka
0bd2608a6c update stride wording 2024-04-22 20:40:48 -05:00