19 Commits

Author SHA1 Message Date
Daniel Kleine
bbb2a0c3d5
fixed num_workers (#229)
* fixed num_workers

* ch06 & ch07: added num_workers to create_dataloader_v1
2024-06-19 17:36:46 -05:00
Daniel Kleine
ef40f2f9ad
minor bug fixes (#207)
* fixed path arg for create_dataset_csvs()

* updated assign_check() to remove user warning
2024-06-12 06:27:56 -05:00
Daniel Kleine
dcbdc1d2e5
fixes for code (#206)
* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-06-11 20:59:48 -05:00
Sebastian Raschka
72a073bbbf
Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer

* add endoftext token
2024-06-08 14:57:34 -05:00
Sebastian Raschka
39a831a4d8
Make header more clear 2024-05-25 10:44:12 -05:00
rasbt
98d453b666
update formatting 2024-05-24 07:20:37 -05:00
rasbt
a5593f9860
change defaults to 0 temp 2024-05-19 09:04:49 -05:00
rasbt
1463b2ae47
use default value for temperature 2024-05-19 08:48:10 -05:00
rasbt
4851d5a0fa
add eos_id option for ch07 2024-05-18 12:35:40 -05:00
Daniel Kleine
cf8b6c1094 fixed empty space 2024-05-17 10:44:18 +02:00
Sebastian Raschka
97ed38116a
Rename drop_resid to drop_shortcut (#136) 2024-04-28 14:31:27 -05:00
rasbt
72be9f4e8e update numbering 2024-04-22 07:00:20 -05:00
rasbt
868955f6a5 file header 2024-04-22 06:53:38 -05:00
Sebastian Raschka
dd51d4ad83
Make datesets and loaders compatible with multiprocessing (#118) 2024-04-13 13:57:56 -05:00
James Holcombe
05718c6b94
Use instance tokenizer (#116)
* Use instance tokenizer

* consistency updates

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-04-10 21:16:19 -04:00
Sebastian Raschka
2de60d1bfb
Rename variable to context_length to make it easier on readers (#106)
* rename to context length

* fix spacing
2024-04-04 07:27:41 -05:00
Sebastian Raschka
3829ccdb34
Remove reundant dropout in MLP module (#105) 2024-04-03 20:19:08 -05:00
rasbt
83adc4a2ac add weight sizes 2024-03-31 08:48:19 -05:00
Sebastian Raschka
4582995ced Add alternative weight loading strategy as backup (#82) 2024-03-20 08:43:18 -05:00