Daniel Kleine
|
bbb2a0c3d5
|
fixed num_workers (#229)
* fixed num_workers
* ch06 & ch07: added num_workers to create_dataloader_v1
|
2024-06-19 17:36:46 -05:00 |
|
rasbt
|
39c4a887eb
|
add allowed_special={"<|endoftext|>"}
|
2024-06-09 06:04:02 -05:00 |
|
Sebastian Raschka
|
97ed38116a
|
Rename drop_resid to drop_shortcut (#136)
|
2024-04-28 14:31:27 -05:00 |
|
Sebastian Raschka
|
dd51d4ad83
|
Make datesets and loaders compatible with multiprocessing (#118)
|
2024-04-13 13:57:56 -05:00 |
|
James Holcombe
|
05718c6b94
|
Use instance tokenizer (#116)
* Use instance tokenizer
* consistency updates
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-04-10 21:16:19 -04:00 |
|
Sebastian Raschka
|
2de60d1bfb
|
Rename variable to context_length to make it easier on readers (#106)
* rename to context length
* fix spacing
|
2024-04-04 07:27:41 -05:00 |
|
Sebastian Raschka
|
3829ccdb34
|
Remove reundant dropout in MLP module (#105)
|
2024-04-03 20:19:08 -05:00 |
|
Sebastian Raschka
|
ca96abac8a
|
Set up basic test gh worklows (#79)
* Set up basic test gh worklows
* update file paths
* env check
* add env check
* Update requirements.txt
* simplify
* upd
|
2024-03-18 11:58:37 -05:00 |
|
Sebastian Raschka
|
9d6da22ebb
|
Update pep8 (#78)
* simplify requirements file
* style
* apply linter
|
2024-03-18 08:16:17 -05:00 |
|
rasbt
|
da33ce8054
|
remove redundant unsqueeze in mask
|
2024-03-09 17:42:31 -06:00 |
|
rasbt
|
87fcfd9245
|
mha variants
|
2024-03-06 08:30:32 -06:00 |
|
rasbt
|
e0df4df433
|
add dropout for embedding layers
|
2024-03-04 07:05:06 -06:00 |
|
rasbt
|
b827bf4eea
|
remove redundant double-unsequeeze
|
2024-02-29 08:31:07 -06:00 |
|
rasbt
|
cdcd73ba7f
|
drop_last=True
|
2024-02-25 07:23:38 -06:00 |
|
rasbt
|
4e68649f16
|
comment update
|
2024-02-24 06:52:17 -06:00 |
|
rasbt
|
fe332006de
|
ch4 exercise solutions
|
2024-02-11 11:51:39 -06:00 |
|
rasbt
|
5d1d8ce511
|
add shape information for clarity
|
2024-02-08 20:16:54 -06:00 |
|
rasbt
|
ec312e581b
|
add chapter 4 code
|
2024-02-04 10:02:05 -06:00 |
|