Daniel Kleine
|
dcbdc1d2e5
|
fixes for code (#206)
* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2024-06-11 20:59:48 -05:00 |
|
rasbt
|
1b1fd21d64
|
fix typo in comment
|
2024-06-09 06:14:02 -05:00 |
|
rasbt
|
ad41c6e3cc
|
use validation path
|
2024-05-12 09:41:46 -05:00 |
|
rasbt
|
33dda489a1
|
use path
|
2024-05-12 09:36:35 -05:00 |
|
rasbt
|
188d3cd262
|
basepath
|
2024-05-12 09:27:38 -05:00 |
|
rasbt
|
2e47a6e61c
|
update dataset naming
|
2024-05-12 09:22:42 -05:00 |
|
rasbt
|
6f486460bc
|
ouput -> output
|
2024-05-05 12:21:10 -05:00 |
|
Sebastian Raschka
|
70cd174091
|
add roberta option (#135)
|
2024-04-28 13:57:36 -05:00 |
|
Sebastian Raschka
|
59b4fd3e25
|
IMDB experiments (#128)
* IMDB experiments
* style fixes
* Update README.md
|
2024-04-25 07:20:53 -05:00 |
|