35 Commits

Author SHA1 Message Date
Sebastian Raschka
7659af7cdd Add backup URL for gpt2 weights (#469)
* Add backup URL for gpt2 weights

* newline
2025-01-05 11:28:09 -06:00
Sebastian Raschka
3c3dae0967 Add mean pooling experiment to classifier bonus experiments (#406)
* Add mean pooling experiment to classifier bonus  experiments

* formatting

* add average embeddings option

* pep8
2024-10-20 11:04:18 -05:00
Daniel Kleine
95926535f8 ch06/03 fixes (#336)
* fixed bash commands

* fixed help docstrings

* added missing logreg bash cmd

* Update train_bert_hf.py

* Update train_bert_hf_spam.py

* Update README.md

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-08-27 08:23:25 +02:00
rasbt
8eb6fc0ad0 sklearn baseline and roberta-large update 2024-08-26 10:31:54 +02:00
TITC
5acab58d41 add RoBERTa and params frozen (#335)
* add roberta experiment result

* add roberta & params frozen

* Update README.md

* modify lr

* modify lr

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-08-26 10:27:09 +02:00
Sebastian Raschka
296a91afb8 add BERT experiment results (#333)
* add BERT experiment results

* cleanup

* formatting
2024-08-23 08:40:40 -05:00
Sebastian Raschka
d0f3b034d8 Add download help message (#274) 2024-07-19 08:29:29 -05:00
Daniel Kleine
87f47a281a fixed spelling typos (#258) 2024-07-03 07:47:33 -05:00
Daniel Kleine
73be1c592f fixed num_workers (#229)
* fixed num_workers

* ch06 & ch07: added num_workers to create_dataloader_v1
2024-06-19 17:36:46 -05:00
Daniel Kleine
79210eb393 fixes for code (#206)
* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-06-11 20:59:48 -05:00
rasbt
f0e4c99bc3 fix typo in comment 2024-06-09 06:14:02 -05:00
Sebastian Raschka
40ba3a4068 Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer

* add endoftext token
2024-06-08 14:57:34 -05:00
rasbt
fe8bb9291e update formatting 2024-05-24 07:20:37 -05:00
Daniel Kleine
4b0fdab1de removed empty line 2024-05-22 16:15:13 +00:00
rasbt
05738f8be6 fix link 2024-05-17 08:20:35 -05:00
Sebastian Raschka
47b3ff15ec improve bonus code in chapter 06 2024-05-14 20:35:50 -04:00
Sebastian Raschka
30010c7a91 Merge branch 'main' into main 2024-05-14 08:28:02 -05:00
rasbt
6aff47ba60 fix file path name 2024-05-14 08:27:46 -05:00
Sebastian Raschka
2f1e1a3d4b Merge branch 'main' into main 2024-05-14 08:12:19 -05:00
rasbt
0b176bb1fc add previous chapters file 2024-05-14 08:11:58 -05:00
Sebastian Raschka
d499c90903 Merge branch 'main' into main 2024-05-14 08:07:58 -05:00
rasbt
df4c59cf6e add missing gpt-download.py 2024-05-14 08:05:56 -05:00
Daniel Kleine
c754b14a79 added missing python run statement 2024-05-14 12:17:09 +00:00
rasbt
73e1c68f45 use validation path 2024-05-12 09:41:46 -05:00
rasbt
1c13810d30 use path 2024-05-12 09:36:35 -05:00
rasbt
a0adf0d5d3 basepath 2024-05-12 09:27:38 -05:00
rasbt
913662ebeb basepath 2024-05-12 09:25:56 -05:00
rasbt
98c0723b3d update dataset naming 2024-05-12 09:22:42 -05:00
rasbt
beeaf323f1 rename download_and_unzip to make it more specific 2024-05-12 08:36:24 -05:00
Sebastian Raschka
49306b271f add header 2024-05-11 14:37:21 -05:00
rasbt
75545e4c1b experiments with largest model 2024-05-09 07:40:09 -05:00
rasbt
9457676640 ouput -> output 2024-05-05 12:21:10 -05:00
rasbt
354bb35726 use training set len 2024-04-29 21:50:07 -05:00
Sebastian Raschka
d1edfcb63f add roberta option (#135) 2024-04-28 13:57:36 -05:00
Sebastian Raschka
4bbd476e7a IMDB experiments (#128)
* IMDB experiments

* style fixes

* Update README.md
2024-04-25 07:20:53 -05:00