Tao Qian
cec445f146
Minor readability improvement in dataloader.ipynb ( #461 )
...
* Minor readability improvement in dataloader.ipynb
- The tokenizer and encoded_text variables at the root level are unused.
- The default params for create_dataloader_v1 are confusing, especially for the default batch_size 4, which happens to be the same as the max_length.
* readability improvements
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
2025-01-04 11:26:10 -06:00
Sebastian Raschka
263eee8921
Test with PyTorch 2.0 and 2.4 ( #290 )
...
* Test with PyTorch 2.0 and 2.4
* Update basic-tests-old-pytorch.yml
* skip version cell
2024-07-27 15:09:02 -05:00
Sebastian Raschka
08040f024c
Test code in pytorch 2.4 ( #285 )
...
* test code in pytorch 2.4
* update
2024-07-24 21:53:41 -05:00
Daniel Kleine
bbb2a0c3d5
fixed num_workers ( #229 )
...
* fixed num_workers
* ch06 & ch07: added num_workers to create_dataloader_v1
2024-06-19 17:36:46 -05:00
rasbt
98d453b666
update formatting
2024-05-24 07:20:37 -05:00
Sebastian Raschka
dd51d4ad83
Make datesets and loaders compatible with multiprocessing ( #118 )
2024-04-13 13:57:56 -05:00
James Holcombe
05718c6b94
Use instance tokenizer ( #116 )
...
* Use instance tokenizer
* consistency updates
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-04-10 21:16:19 -04:00
Sebastian Raschka
2de60d1bfb
Rename variable to context_length to make it easier on readers ( #106 )
...
* rename to context length
* fix spacing
2024-04-04 07:27:41 -05:00
Sebastian Raschka
a2cd8436cb
Ch05 supplementary code ( #81 )
2024-03-19 09:26:26 -05:00
Sebastian Raschka
ca96abac8a
Set up basic test gh worklows ( #79 )
...
* Set up basic test gh worklows
* update file paths
* env check
* add env check
* Update requirements.txt
* simplify
* upd
2024-03-18 11:58:37 -05:00
rasbt
cdcd73ba7f
drop_last=True
2024-02-25 07:23:38 -06:00
rasbt
16b30ccd5c
set stride equal to max len
2024-02-03 08:50:56 -06:00
rasbt
f27c9e6135
rename create_dataloader to create_dataloader_v1
2024-01-24 07:02:05 -06:00
rasbt
8860e16e05
<|endoftext|> token in dataset v1
2024-01-21 12:03:04 -06:00
rasbt
92896d817c
add toggle for qkv_bias
2024-01-17 07:50:57 -06:00
rasbt
ea4b6c4e5f
add package versions to the top of the notebook
2024-01-01 19:41:18 +01:00
rasbt
4f161bd549
use block size variable in positional embedding layer
2023-12-28 19:05:06 +01:00
rasbt
c8825b7c22
add exercise solutions
2023-10-27 06:19:40 -05:00
Sebastian R
ab1261d9b1
cleanup and minimal notebook
2023-10-15 17:15:20 -05:00