Daniel Kleine
|
ef40f2f9ad
|
minor bug fixes (#207)
* fixed path arg for create_dataset_csvs()
* updated assign_check() to remove user warning
|
2024-06-12 06:27:56 -05:00 |
|
Daniel Kleine
|
dcbdc1d2e5
|
fixes for code (#206)
* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2024-06-11 20:59:48 -05:00 |
|
rasbt
|
98d453b666
|
update formatting
|
2024-05-24 07:20:37 -05:00 |
|
Sebastian Raschka
|
2de60d1bfb
|
Rename variable to context_length to make it easier on readers (#106)
* rename to context length
* fix spacing
|
2024-04-04 07:27:41 -05:00 |
|
rasbt
|
83adc4a2ac
|
add weight sizes
|
2024-03-31 08:48:19 -05:00 |
|
Sebastian Raschka
|
4582995ced
|
Add alternative weight loading strategy as backup (#82)
|
2024-03-20 08:43:18 -05:00 |
|