Sebastian Raschka
|
08040f024c
|
Test code in pytorch 2.4 (#285)
* test code in pytorch 2.4
* update
|
2024-07-24 21:53:41 -05:00 |
|
Sebastian Raschka
|
72a073bbbf
|
Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer
* add endoftext token
|
2024-06-08 14:57:34 -05:00 |
|
rasbt
|
050c8b7b73
|
update pr
|
2024-05-26 15:38:35 -05:00 |
|
Kostyantyn Borysenko
|
76cdf5e299
|
Fix an incorrect input dimension
|
2024-05-26 13:05:07 -07:00 |
|
rasbt
|
98d453b666
|
update formatting
|
2024-05-24 07:20:37 -05:00 |
|
James Holcombe
|
05718c6b94
|
Use instance tokenizer (#116)
* Use instance tokenizer
* consistency updates
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-04-10 21:16:19 -04:00 |
|
Sebastian Raschka
|
2de60d1bfb
|
Rename variable to context_length to make it easier on readers (#106)
* rename to context length
* fix spacing
|
2024-04-04 07:27:41 -05:00 |
|
rasbt
|
3ad442ee90
|
skip version cell
|
2024-03-28 08:23:33 -05:00 |
|
Sebastian Raschka
|
a2cd8436cb
|
Ch05 supplementary code (#81)
|
2024-03-19 09:26:26 -05:00 |
|
Sebastian Raschka
|
ca96abac8a
|
Set up basic test gh worklows (#79)
* Set up basic test gh worklows
* update file paths
* env check
* add env check
* Update requirements.txt
* simplify
* upd
|
2024-03-18 11:58:37 -05:00 |
|
rasbt
|
1870b4bacd
|
update stride param
|
2024-03-13 08:39:59 -05:00 |
|
rasbt
|
da33ce8054
|
remove redundant unsqueeze in mask
|
2024-03-09 17:42:31 -06:00 |
|
rasbt
|
87fcfd9245
|
mha variants
|
2024-03-06 08:30:32 -06:00 |
|
rasbt
|
d4754f1bdd
|
change dim=1 to dim=-1
|
2024-03-04 18:54:43 -06:00 |
|
rasbt
|
b827bf4eea
|
remove redundant double-unsequeeze
|
2024-02-29 08:31:07 -06:00 |
|
rasbt
|
8860e16e05
|
<|endoftext|> token in dataset v1
|
2024-01-21 12:03:04 -06:00 |
|
rasbt
|
92896d817c
|
add toggle for qkv_bias
|
2024-01-17 07:50:57 -06:00 |
|
rasbt
|
dfe2c3b46f
|
use blocksize in positional embedding
|
2024-01-15 08:15:33 -06:00 |
|
rasbt
|
9e85f13ba9
|
readability improvements
|
2024-01-15 07:36:19 -06:00 |
|
rasbt
|
a7b4880179
|
small readability updates
|
2024-01-14 11:58:42 -06:00 |
|
rasbt
|
4f161bd549
|
use block size variable in positional embedding layer
|
2023-12-28 19:05:06 +01:00 |
|
rasbt
|
31980a6ef1
|
add ch03 and TOC
|
2023-12-09 17:13:56 -06:00 |
|