8 Commits

Author SHA1 Message Date
Daniel Kleine
79210eb393 fixes for code (#206)
* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-06-11 20:59:48 -05:00
rasbt
f0e4c99bc3 fix typo in comment 2024-06-09 06:14:02 -05:00
rasbt
913662ebeb basepath 2024-05-12 09:25:56 -05:00
rasbt
98c0723b3d update dataset naming 2024-05-12 09:22:42 -05:00
rasbt
75545e4c1b experiments with largest model 2024-05-09 07:40:09 -05:00
rasbt
9457676640 ouput -> output 2024-05-05 12:21:10 -05:00
rasbt
354bb35726 use training set len 2024-04-29 21:50:07 -05:00
Sebastian Raschka
4bbd476e7a IMDB experiments (#128)
* IMDB experiments

* style fixes

* Update README.md
2024-04-25 07:20:53 -05:00