Sebastian Raschka
7659af7cdd
Add backup URL for gpt2 weights ( #469 )
...
* Add backup URL for gpt2 weights
* newline
2025-01-05 11:28:09 -06:00
Sebastian Raschka
3c3dae0967
Add mean pooling experiment to classifier bonus experiments ( #406 )
...
* Add mean pooling experiment to classifier bonus experiments
* formatting
* add average embeddings option
* pep8
2024-10-20 11:04:18 -05:00
Daniel Kleine
95926535f8
ch06/03 fixes ( #336 )
...
* fixed bash commands
* fixed help docstrings
* added missing logreg bash cmd
* Update train_bert_hf.py
* Update train_bert_hf_spam.py
* Update README.md
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-08-27 08:23:25 +02:00
rasbt
8eb6fc0ad0
sklearn baseline and roberta-large update
2024-08-26 10:31:54 +02:00
TITC
5acab58d41
add RoBERTa and params frozen ( #335 )
...
* add roberta experiment result
* add roberta & params frozen
* Update README.md
* modify lr
* modify lr
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-08-26 10:27:09 +02:00
Sebastian Raschka
296a91afb8
add BERT experiment results ( #333 )
...
* add BERT experiment results
* cleanup
* formatting
2024-08-23 08:40:40 -05:00
Sebastian Raschka
d0f3b034d8
Add download help message ( #274 )
2024-07-19 08:29:29 -05:00
Daniel Kleine
87f47a281a
fixed spelling typos ( #258 )
2024-07-03 07:47:33 -05:00
Daniel Kleine
73be1c592f
fixed num_workers ( #229 )
...
* fixed num_workers
* ch06 & ch07: added num_workers to create_dataloader_v1
2024-06-19 17:36:46 -05:00
Daniel Kleine
79210eb393
fixes for code ( #206 )
...
* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-06-11 20:59:48 -05:00
rasbt
f0e4c99bc3
fix typo in comment
2024-06-09 06:14:02 -05:00
Sebastian Raschka
40ba3a4068
Remove leftover instances of self.tokenizer ( #201 )
...
* Remove leftover instances of self.tokenizer
* add endoftext token
2024-06-08 14:57:34 -05:00
rasbt
fe8bb9291e
update formatting
2024-05-24 07:20:37 -05:00
Daniel Kleine
4b0fdab1de
removed empty line
2024-05-22 16:15:13 +00:00
rasbt
05738f8be6
fix link
2024-05-17 08:20:35 -05:00
Sebastian Raschka
47b3ff15ec
improve bonus code in chapter 06
2024-05-14 20:35:50 -04:00
Sebastian Raschka
30010c7a91
Merge branch 'main' into main
2024-05-14 08:28:02 -05:00
rasbt
6aff47ba60
fix file path name
2024-05-14 08:27:46 -05:00
Sebastian Raschka
2f1e1a3d4b
Merge branch 'main' into main
2024-05-14 08:12:19 -05:00
rasbt
0b176bb1fc
add previous chapters file
2024-05-14 08:11:58 -05:00
Sebastian Raschka
d499c90903
Merge branch 'main' into main
2024-05-14 08:07:58 -05:00
rasbt
df4c59cf6e
add missing gpt-download.py
2024-05-14 08:05:56 -05:00
Daniel Kleine
c754b14a79
added missing python run statement
2024-05-14 12:17:09 +00:00
rasbt
73e1c68f45
use validation path
2024-05-12 09:41:46 -05:00
rasbt
1c13810d30
use path
2024-05-12 09:36:35 -05:00
rasbt
a0adf0d5d3
basepath
2024-05-12 09:27:38 -05:00
rasbt
913662ebeb
basepath
2024-05-12 09:25:56 -05:00
rasbt
98c0723b3d
update dataset naming
2024-05-12 09:22:42 -05:00
rasbt
beeaf323f1
rename download_and_unzip to make it more specific
2024-05-12 08:36:24 -05:00
Sebastian Raschka
49306b271f
add header
2024-05-11 14:37:21 -05:00
rasbt
75545e4c1b
experiments with largest model
2024-05-09 07:40:09 -05:00
rasbt
9457676640
ouput -> output
2024-05-05 12:21:10 -05:00
rasbt
354bb35726
use training set len
2024-04-29 21:50:07 -05:00
Sebastian Raschka
d1edfcb63f
add roberta option ( #135 )
2024-04-28 13:57:36 -05:00
Sebastian Raschka
4bbd476e7a
IMDB experiments ( #128 )
...
* IMDB experiments
* style fixes
* Update README.md
2024-04-25 07:20:53 -05:00