19 Commits

Author SHA1 Message Date
Tao Qian
cec445f146
Minor readability improvement in dataloader.ipynb (#461)
* Minor readability improvement in dataloader.ipynb

- The tokenizer and encoded_text variables at the root level are unused.
- The default params for create_dataloader_v1 are confusing, especially for the default batch_size 4, which happens to be the same as the max_length.

* readability improvements

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2025-01-04 11:26:10 -06:00
Sebastian Raschka
263eee8921
Test with PyTorch 2.0 and 2.4 (#290)
* Test with PyTorch 2.0 and 2.4

* Update basic-tests-old-pytorch.yml

* skip version cell
2024-07-27 15:09:02 -05:00
Sebastian Raschka
08040f024c
Test code in pytorch 2.4 (#285)
* test code in pytorch 2.4

* update
2024-07-24 21:53:41 -05:00
Daniel Kleine
bbb2a0c3d5
fixed num_workers (#229)
* fixed num_workers

* ch06 & ch07: added num_workers to create_dataloader_v1
2024-06-19 17:36:46 -05:00
rasbt
98d453b666
update formatting 2024-05-24 07:20:37 -05:00
Sebastian Raschka
dd51d4ad83
Make datesets and loaders compatible with multiprocessing (#118) 2024-04-13 13:57:56 -05:00
James Holcombe
05718c6b94
Use instance tokenizer (#116)
* Use instance tokenizer

* consistency updates

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-04-10 21:16:19 -04:00
Sebastian Raschka
2de60d1bfb
Rename variable to context_length to make it easier on readers (#106)
* rename to context length

* fix spacing
2024-04-04 07:27:41 -05:00
Sebastian Raschka
a2cd8436cb Ch05 supplementary code (#81) 2024-03-19 09:26:26 -05:00
Sebastian Raschka
ca96abac8a Set up basic test gh worklows (#79)
* Set up basic test gh worklows

* update file paths

* env check

* add env check

* Update requirements.txt

* simplify

* upd
2024-03-18 11:58:37 -05:00
rasbt
cdcd73ba7f drop_last=True 2024-02-25 07:23:38 -06:00
rasbt
16b30ccd5c set stride equal to max len 2024-02-03 08:50:56 -06:00
rasbt
f27c9e6135 rename create_dataloader to create_dataloader_v1 2024-01-24 07:02:05 -06:00
rasbt
8860e16e05 <|endoftext|> token in dataset v1 2024-01-21 12:03:04 -06:00
rasbt
92896d817c add toggle for qkv_bias 2024-01-17 07:50:57 -06:00
rasbt
ea4b6c4e5f add package versions to the top of the notebook 2024-01-01 19:41:18 +01:00
rasbt
4f161bd549 use block size variable in positional embedding layer 2023-12-28 19:05:06 +01:00
rasbt
c8825b7c22 add exercise solutions 2023-10-27 06:19:40 -05:00
Sebastian R
ab1261d9b1 cleanup and minimal notebook 2023-10-15 17:15:20 -05:00