Daniel Kleine
|
dcbdc1d2e5
|
fixes for code (#206)
* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2024-06-11 20:59:48 -05:00 |
|
rasbt
|
1b1fd21d64
|
fix typo in comment
|
2024-06-09 06:14:02 -05:00 |
|
rasbt
|
a733a7eb42
|
basepath
|
2024-05-12 09:25:56 -05:00 |
|
rasbt
|
2e47a6e61c
|
update dataset naming
|
2024-05-12 09:22:42 -05:00 |
|
rasbt
|
756ff780de
|
experiments with largest model
|
2024-05-09 07:40:09 -05:00 |
|
rasbt
|
6f486460bc
|
ouput -> output
|
2024-05-05 12:21:10 -05:00 |
|
rasbt
|
0ac19a1e50
|
use training set len
|
2024-04-29 21:50:07 -05:00 |
|
Sebastian Raschka
|
59b4fd3e25
|
IMDB experiments (#128)
* IMDB experiments
* style fixes
* Update README.md
|
2024-04-25 07:20:53 -05:00 |
|