Daniel Kleine
|
79210eb393
|
fixes for code (#206)
* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2024-06-11 20:59:48 -05:00 |
|
rasbt
|
f0e4c99bc3
|
fix typo in comment
|
2024-06-09 06:14:02 -05:00 |
|
rasbt
|
913662ebeb
|
basepath
|
2024-05-12 09:25:56 -05:00 |
|
rasbt
|
98c0723b3d
|
update dataset naming
|
2024-05-12 09:22:42 -05:00 |
|
rasbt
|
75545e4c1b
|
experiments with largest model
|
2024-05-09 07:40:09 -05:00 |
|
rasbt
|
9457676640
|
ouput -> output
|
2024-05-05 12:21:10 -05:00 |
|
rasbt
|
354bb35726
|
use training set len
|
2024-04-29 21:50:07 -05:00 |
|
Sebastian Raschka
|
4bbd476e7a
|
IMDB experiments (#128)
* IMDB experiments
* style fixes
* Update README.md
|
2024-04-25 07:20:53 -05:00 |
|