8 Commits

Author SHA1 Message Date
Daniel Kleine
dcbdc1d2e5
fixes for code (#206)
* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-06-11 20:59:48 -05:00
rasbt
1b1fd21d64
fix typo in comment 2024-06-09 06:14:02 -05:00
rasbt
a733a7eb42
basepath 2024-05-12 09:25:56 -05:00
rasbt
2e47a6e61c
update dataset naming 2024-05-12 09:22:42 -05:00
rasbt
756ff780de experiments with largest model 2024-05-09 07:40:09 -05:00
rasbt
6f486460bc
ouput -> output 2024-05-05 12:21:10 -05:00
rasbt
0ac19a1e50 use training set len 2024-04-29 21:50:07 -05:00
Sebastian Raschka
59b4fd3e25
IMDB experiments (#128)
* IMDB experiments

* style fixes

* Update README.md
2024-04-25 07:20:53 -05:00