9 Commits

Author SHA1 Message Date
rasbt
93d9dae95f
update card 2024-10-11 12:15:01 -05:00
rasbt
1f4fca9f8e
update reference numbers 2024-10-11 12:13:10 -05:00
Sebastian Raschka
6d0f59a49c
Add MFU formula as reference material (#395)
* add MFU formula as reference material

* Update previous_chapters.py
2024-10-10 19:42:53 -05:00
Daniel Kleine
81c843bdc0
minor fixes (#246)
* removed duplicated white spaces

* Update ch07/01_main-chapter-code/ch07.ipynb

* Update ch07/05_dataset-generation/llama3-ollama.ipynb

* removed duplicated white spaces

* fixed title again

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-06-25 17:30:30 -05:00
Daniel Kleine
bbb2a0c3d5
fixed num_workers (#229)
* fixed num_workers

* ch06 & ch07: added num_workers to create_dataloader_v1
2024-06-19 17:36:46 -05:00
Daniel Kleine
dcbdc1d2e5
fixes for code (#206)
* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-06-11 20:59:48 -05:00
Sebastian Raschka
72a073bbbf
Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer

* add endoftext token
2024-06-08 14:57:34 -05:00
rasbt
98d453b666
update formatting 2024-05-24 07:20:37 -05:00
rasbt
e5e6aaf9f1
flops analysis 2024-05-23 20:35:41 -05:00