Daniel Kleine
|
79210eb393
|
fixes for code (#206)
* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2024-06-11 20:59:48 -05:00 |
|
rasbt
|
f0e4c99bc3
|
fix typo in comment
|
2024-06-09 06:14:02 -05:00 |
|
Sebastian Raschka
|
40ba3a4068
|
Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer
* add endoftext token
|
2024-06-08 14:57:34 -05:00 |
|
rasbt
|
fe8bb9291e
|
update formatting
|
2024-05-24 07:20:37 -05:00 |
|
Daniel Kleine
|
4b0fdab1de
|
removed empty line
|
2024-05-22 16:15:13 +00:00 |
|
rasbt
|
05738f8be6
|
fix link
|
2024-05-17 08:20:35 -05:00 |
|
Sebastian Raschka
|
47b3ff15ec
|
improve bonus code in chapter 06
|
2024-05-14 20:35:50 -04:00 |
|
Sebastian Raschka
|
30010c7a91
|
Merge branch 'main' into main
|
2024-05-14 08:28:02 -05:00 |
|
rasbt
|
6aff47ba60
|
fix file path name
|
2024-05-14 08:27:46 -05:00 |
|
Sebastian Raschka
|
2f1e1a3d4b
|
Merge branch 'main' into main
|
2024-05-14 08:12:19 -05:00 |
|
rasbt
|
0b176bb1fc
|
add previous chapters file
|
2024-05-14 08:11:58 -05:00 |
|
Sebastian Raschka
|
d499c90903
|
Merge branch 'main' into main
|
2024-05-14 08:07:58 -05:00 |
|
rasbt
|
df4c59cf6e
|
add missing gpt-download.py
|
2024-05-14 08:05:56 -05:00 |
|
Daniel Kleine
|
c754b14a79
|
added missing python run statement
|
2024-05-14 12:17:09 +00:00 |
|
rasbt
|
73e1c68f45
|
use validation path
|
2024-05-12 09:41:46 -05:00 |
|
rasbt
|
1c13810d30
|
use path
|
2024-05-12 09:36:35 -05:00 |
|
rasbt
|
a0adf0d5d3
|
basepath
|
2024-05-12 09:27:38 -05:00 |
|
rasbt
|
913662ebeb
|
basepath
|
2024-05-12 09:25:56 -05:00 |
|
rasbt
|
98c0723b3d
|
update dataset naming
|
2024-05-12 09:22:42 -05:00 |
|
rasbt
|
beeaf323f1
|
rename download_and_unzip to make it more specific
|
2024-05-12 08:36:24 -05:00 |
|
Sebastian Raschka
|
49306b271f
|
add header
|
2024-05-11 14:37:21 -05:00 |
|
rasbt
|
75545e4c1b
|
experiments with largest model
|
2024-05-09 07:40:09 -05:00 |
|
rasbt
|
9457676640
|
ouput -> output
|
2024-05-05 12:21:10 -05:00 |
|
rasbt
|
354bb35726
|
use training set len
|
2024-04-29 21:50:07 -05:00 |
|
Sebastian Raschka
|
d1edfcb63f
|
add roberta option (#135)
|
2024-04-28 13:57:36 -05:00 |
|
Sebastian Raschka
|
4bbd476e7a
|
IMDB experiments (#128)
* IMDB experiments
* style fixes
* Update README.md
|
2024-04-25 07:20:53 -05:00 |
|