Sebastian Raschka
|
f5a003744e
|
Update README.md
|
2024-07-30 06:55:41 -05:00 |
|
rasbt
|
0dad0a3c04
|
add state_dict example
|
2024-07-28 14:15:32 -05:00 |
|
Sebastian Raschka
|
f4fc0ededd
|
buffer tutorial
|
2024-07-27 17:06:16 -05:00 |
|
rasbt
|
7f1e071fff
|
update
|
2024-07-27 07:12:42 -05:00 |
|
Sebastian Raschka
|
deea13e5c2
|
Understanding PyTorch Buffers (#288)
|
2024-07-26 08:45:36 -05:00 |
|
Sebastian Raschka
|
08040f024c
|
Test code in pytorch 2.4 (#285)
* test code in pytorch 2.4
* update
|
2024-07-24 21:53:41 -05:00 |
|
Jeroen Van Goey
|
48bd72c890
|
fix typos, add codespell pre-commit hook (#264)
* fix typos, add codespell pre-commit hook
* Update .pre-commit-config.yaml
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-07-16 07:07:04 -05:00 |
|
rasbt
|
31806828d0
|
add links to summary sections
|
2024-06-29 07:33:26 -05:00 |
|
rasbt
|
c7f892550e
|
add clarification about :num_tokens
|
2024-06-29 07:16:42 -05:00 |
|
rasbt
|
283397aaf2
|
add main and optional sections
|
2024-06-19 17:48:25 -05:00 |
|
rasbt
|
5d1fbbbfd2
|
update dotted line
|
2024-06-17 20:17:56 -05:00 |
|
rasbt
|
aaa54b10b3
|
dim-consistency
|
2024-06-12 19:43:25 -05:00 |
|
rasbt
|
e24fd98cdf
|
distinguish better between main chapter code and bonus materials
|
2024-06-11 21:07:42 -05:00 |
|
Sebastian Raschka
|
72a073bbbf
|
Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer
* add endoftext token
|
2024-06-08 14:57:34 -05:00 |
|
Sebastian Raschka
|
c303a7f36d
|
Explain value truncation in some figures (#199)
* clarify truncation
* typo fix
|
2024-06-08 13:24:37 -05:00 |
|
rasbt
|
1e12da90e6
|
clarify truncation
|
2024-06-08 13:13:43 -05:00 |
|
Sebastian Raschka
|
c577f52bfc
|
Merge pull request #184 from rasbt/api-key-approach
Change API key retrieval approach
|
2024-05-27 08:47:04 -04:00 |
|
rasbt
|
71831890a0
|
update mha dim
|
2024-05-27 07:46:29 -05:00 |
|
rasbt
|
42af52fef4
|
revert unnecessary changes
|
2024-05-27 07:37:06 -05:00 |
|
rasbt
|
87c3e78dcb
|
Revert "Revert "newline""
This reverts commit a53ca10508aacc0c2a8da5467626f5fce33ef162.
|
2024-05-27 07:32:45 -05:00 |
|
rasbt
|
a53ca10508
|
Revert "newline"
This reverts commit 23982ed3fabc277ca50a998c325d304940eb78a5.
|
2024-05-27 07:32:22 -05:00 |
|
rasbt
|
23982ed3fa
|
newline
|
2024-05-27 07:30:27 -05:00 |
|
rasbt
|
050c8b7b73
|
update pr
|
2024-05-26 15:38:35 -05:00 |
|
Kostyantyn Borysenko
|
76cdf5e299
|
Fix an incorrect input dimension
|
2024-05-26 13:05:07 -07:00 |
|
rasbt
|
98d453b666
|
update formatting
|
2024-05-24 07:20:37 -05:00 |
|
rasbt
|
3b57b6d8c4
|
make consistent with the latest production version
|
2024-05-18 12:08:39 -05:00 |
|
Sebastian Raschka
|
9a5d4d8ac9
|
Try windows runners (#133)
* try windows runners
* update triggers
* trigger with code file update
* add new status badges
|
2024-04-28 07:39:23 -05:00 |
|
Sebastian Raschka
|
4adb96d7ee
|
Make code more consistent and add projection layer (#131)
* Make code more consistent and add projection
* remove redundant buffer
|
2024-04-26 17:13:08 -05:00 |
|
Sebastian Raschka
|
7740d556a0
|
Use dim=-1 for consistency (#122)
|
2024-04-18 05:56:23 -05:00 |
|
James Holcombe
|
05718c6b94
|
Use instance tokenizer (#116)
* Use instance tokenizer
* consistency updates
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-04-10 21:16:19 -04:00 |
|
rasbt
|
6de0417321
|
cleanup
|
2024-04-04 07:58:41 -05:00 |
|
Sebastian Raschka
|
2de60d1bfb
|
Rename variable to context_length to make it easier on readers (#106)
* rename to context length
* fix spacing
|
2024-04-04 07:27:41 -05:00 |
|
rasbt
|
7d1eadd0be
|
update notes
|
2024-04-02 18:27:13 -05:00 |
|
Intelligence-Manifesto
|
96b1fde3f1
|
"Typographical error (#104)
|
2024-04-02 18:07:21 -05:00 |
|
rasbt
|
3ad442ee90
|
skip version cell
|
2024-03-28 08:23:33 -05:00 |
|
Sebastian Raschka
|
cf39abac04
|
Add and link bonus material (#84)
|
2024-03-23 07:27:43 -05:00 |
|
Sebastian Raschka
|
a2cd8436cb
|
Ch05 supplementary code (#81)
|
2024-03-19 09:26:26 -05:00 |
|
Sebastian Raschka
|
ca96abac8a
|
Set up basic test gh worklows (#79)
* Set up basic test gh worklows
* update file paths
* env check
* add env check
* Update requirements.txt
* simplify
* upd
|
2024-03-18 11:58:37 -05:00 |
|
Sebastian Raschka
|
9d6da22ebb
|
Update pep8 (#78)
* simplify requirements file
* style
* apply linter
|
2024-03-18 08:16:17 -05:00 |
|
rasbt
|
4fc6de7afa
|
add notes
|
2024-03-17 09:29:06 -05:00 |
|
rasbt
|
d60da19fd0
|
add more notes and embed figures externally to save space
|
2024-03-17 09:08:38 -05:00 |
|
Intelligence-Manifesto
|
d4b4e3d0f0
|
the above -> the following
|
2024-03-15 05:00:28 +08:00 |
|
rasbt
|
1870b4bacd
|
update stride param
|
2024-03-13 08:39:59 -05:00 |
|
rasbt
|
0d517e98b9
|
update
|
2024-03-13 08:37:54 -05:00 |
|
rasbt
|
f2c8eeb6b8
|
pretraining on project gutenberg
|
2024-03-13 08:34:39 -05:00 |
|
rasbt
|
569f6bc7f0
|
benchmark numbers
|
2024-03-13 07:12:10 -05:00 |
|
taihaozesong
|
f1fa9df15c
|
Fix mha wrapper implementations in ch03 bonus
|
2024-03-13 18:02:26 +08:00 |
|
rasbt
|
321f3d33f9
|
add cuda warmup
|
2024-03-10 10:31:55 -05:00 |
|
rasbt
|
244137e8a1
|
amend
|
2024-03-10 08:05:22 -05:00 |
|
rasbt
|
76205521d7
|
different dropout behavior on macos and linux
|
2024-03-10 07:58:10 -05:00 |
|