22 Commits

Author SHA1 Message Date
Sebastian Raschka
08040f024c
Test code in pytorch 2.4 (#285)
* test code in pytorch 2.4

* update
2024-07-24 21:53:41 -05:00
Sebastian Raschka
72a073bbbf
Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer

* add endoftext token
2024-06-08 14:57:34 -05:00
rasbt
050c8b7b73
update pr 2024-05-26 15:38:35 -05:00
Kostyantyn Borysenko
76cdf5e299 Fix an incorrect input dimension 2024-05-26 13:05:07 -07:00
rasbt
98d453b666
update formatting 2024-05-24 07:20:37 -05:00
James Holcombe
05718c6b94
Use instance tokenizer (#116)
* Use instance tokenizer

* consistency updates

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-04-10 21:16:19 -04:00
Sebastian Raschka
2de60d1bfb
Rename variable to context_length to make it easier on readers (#106)
* rename to context length

* fix spacing
2024-04-04 07:27:41 -05:00
rasbt
3ad442ee90 skip version cell 2024-03-28 08:23:33 -05:00
Sebastian Raschka
a2cd8436cb Ch05 supplementary code (#81) 2024-03-19 09:26:26 -05:00
Sebastian Raschka
ca96abac8a Set up basic test gh worklows (#79)
* Set up basic test gh worklows

* update file paths

* env check

* add env check

* Update requirements.txt

* simplify

* upd
2024-03-18 11:58:37 -05:00
rasbt
1870b4bacd update stride param 2024-03-13 08:39:59 -05:00
rasbt
da33ce8054 remove redundant unsqueeze in mask 2024-03-09 17:42:31 -06:00
rasbt
87fcfd9245 mha variants 2024-03-06 08:30:32 -06:00
rasbt
d4754f1bdd change dim=1 to dim=-1 2024-03-04 18:54:43 -06:00
rasbt
b827bf4eea remove redundant double-unsequeeze 2024-02-29 08:31:07 -06:00
rasbt
8860e16e05 <|endoftext|> token in dataset v1 2024-01-21 12:03:04 -06:00
rasbt
92896d817c add toggle for qkv_bias 2024-01-17 07:50:57 -06:00
rasbt
dfe2c3b46f use blocksize in positional embedding 2024-01-15 08:15:33 -06:00
rasbt
9e85f13ba9 readability improvements 2024-01-15 07:36:19 -06:00
rasbt
a7b4880179 small readability updates 2024-01-14 11:58:42 -06:00
rasbt
4f161bd549 use block size variable in positional embedding layer 2023-12-28 19:05:06 +01:00
rasbt
31980a6ef1 add ch03 and TOC 2023-12-09 17:13:56 -06:00