rasbt
|
9d84935b69
|
add eos_id option for ch07
|
2024-05-18 12:35:40 -05:00 |
|
Daniel Kleine
|
e6012b944e
|
fixed empty space
|
2024-05-17 10:44:18 +02:00 |
|
Sebastian Raschka
|
a5b353667d
|
Rename drop_resid to drop_shortcut (#136)
|
2024-04-28 14:31:27 -05:00 |
|
rasbt
|
df4fc602d8
|
update numbering
|
2024-04-22 07:00:20 -05:00 |
|
rasbt
|
2dd7bf9cda
|
file header
|
2024-04-22 06:53:38 -05:00 |
|
Sebastian Raschka
|
bae4b0fb08
|
Make datesets and loaders compatible with multiprocessing (#118)
|
2024-04-13 13:57:56 -05:00 |
|
James Holcombe
|
0b866c133f
|
Use instance tokenizer (#116)
* Use instance tokenizer
* consistency updates
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-04-10 21:16:19 -04:00 |
|
Sebastian Raschka
|
ccd7cebbb3
|
Rename variable to context_length to make it easier on readers (#106)
* rename to context length
* fix spacing
|
2024-04-04 07:27:41 -05:00 |
|
Sebastian Raschka
|
5beff4e25a
|
Remove reundant dropout in MLP module (#105)
|
2024-04-03 20:19:08 -05:00 |
|
rasbt
|
83adc4a2ac
|
add weight sizes
|
2024-03-31 08:48:19 -05:00 |
|
Sebastian Raschka
|
4582995ced
|
Add alternative weight loading strategy as backup (#82)
|
2024-03-20 08:43:18 -05:00 |
|