60 Commits

Author SHA1 Message Date
rasbt
ea9da3a89c
formatting for consistency with production chapter 2024-05-18 11:03:42 -05:00
rasbt
37a17e2228
simplify code 2024-05-16 20:16:25 -05:00
Sebastian Raschka
fc3d70f72f
Data loader intuition with numbers (#132)
* data loader intuition with numbers

* fix link

* fix tests
2024-04-27 07:56:41 -05:00
rasbt
c7cdedf981
update figures in bonus notebook 2024-04-23 21:01:27 -05:00
Sebastian Raschka
0bd2608a6c update stride wording 2024-04-22 20:40:48 -05:00
Sebastian Raschka
dd51d4ad83
Make datesets and loaders compatible with multiprocessing (#118) 2024-04-13 13:57:56 -05:00
Sebastian Raschka
55ebabf95c
Automated link checking (#117)
* Automated link checking

* Fix links in Jupyter Nbs
2024-04-12 19:08:34 -04:00
James Holcombe
05718c6b94
Use instance tokenizer (#116)
* Use instance tokenizer

* consistency updates

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-04-10 21:16:19 -04:00
Sebastian Raschka
2de60d1bfb
Rename variable to context_length to make it easier on readers (#106)
* rename to context length

* fix spacing
2024-04-04 07:27:41 -05:00
rasbt
dd115c1374 improve importlib experience for windows users 2024-04-03 06:31:15 -05:00
Intelligence-Manifesto
5a3f779405
code -> markdown (#101) 2024-04-02 14:37:45 -05:00
rasbt
1c173e4f44 update figures 2024-03-30 09:43:51 -05:00
rasbt
ca96b7aee5 minor updates 2024-03-29 20:42:32 -05:00
Jeff Hammerbacher
5b222e2d6f Fix small typos in ch02.ipynb (#89) 2024-03-29 08:25:52 -05:00
Sebastian Raschka
cf39abac04 Add and link bonus material (#84) 2024-03-23 07:27:43 -05:00
rasbt
001507481e add colon and semicolon to tokenizer 2024-03-23 06:50:34 -05:00
Sebastian Raschka
a2cd8436cb Ch05 supplementary code (#81) 2024-03-19 09:26:26 -05:00
Sebastian Raschka
ca96abac8a Set up basic test gh worklows (#79)
* Set up basic test gh worklows

* update file paths

* env check

* add env check

* Update requirements.txt

* simplify

* upd
2024-03-18 11:58:37 -05:00
Sebastian Raschka
9d6da22ebb Update pep8 (#78)
* simplify requirements file

* style

* apply linter
2024-03-18 08:16:17 -05:00
Intelligence-Manifesto
c49aa22738 three -> four 2024-03-17 23:40:44 +08:00
rasbt
d60da19fd0 add more notes and embed figures externally to save space 2024-03-17 09:08:38 -05:00
Xiangzhuang Shen
fa2864ddbf fix: inner links 2024-03-11 10:52:56 +08:00
Ikko Eltociear Ashimine
47519f4d14 Update compare-bpe-tiktoken.ipynb
HuggingFace -> Hugging Face
2024-03-10 01:11:35 +09:00
rasbt
3beaea46ce add lowres figs for better navigation 2024-03-08 07:18:06 -06:00
rasbt
c5b17c3d67 simplify 2024-03-07 07:52:24 -06:00
rasbt
3198363c4f add wording from three to four 2024-03-04 06:42:58 -06:00
rasbt
29672da3b0 stride consistency 2024-03-03 19:37:06 -06:00
rasbt
742f0a6d29 add missing output in bonus 2024-03-03 17:29:46 -06:00
rasbt
f526a8d7fb add requirements file for bonus notebook 2024-03-02 16:54:24 -06:00
rasbt
cc2383c4de remove duplicated exercise code 2024-03-02 16:44:36 -06:00
rasbt
cdcd73ba7f drop_last=True 2024-02-25 07:23:38 -06:00
Intelligence-Manifesto
a61583e31e if -> in
if -> in
2024-02-09 04:18:07 +08:00
rasbt
3a5fc79b38 add and update readme files 2024-02-05 06:51:58 -06:00
rasbt
16b30ccd5c set stride equal to max len 2024-02-03 08:50:56 -06:00
Intelligence-Manifesto
78ab68b4d3 missing single quotes
missing single quotes
2024-02-03 03:41:31 +08:00
Ikko Eltociear Ashimine
3a3a4ac1f1 Update bpe_openai_gpt2.py
signficant -> significant
2024-01-28 23:26:23 +09:00
Intelligence-Manifesto
004d6a2af9 a repetition of the word "by"
"The Verdict by by Edith Wharton" contains a repetition of the word "by." The correct form should be "The Verdict by Edith Wharton."
2024-01-26 00:43:57 +08:00
rasbt
f27c9e6135 rename create_dataloader to create_dataloader_v1 2024-01-24 07:02:05 -06:00
rasbt
8860e16e05 <|endoftext|> token in dataset v1 2024-01-21 12:03:04 -06:00
rasbt
92896d817c add toggle for qkv_bias 2024-01-17 07:50:57 -06:00
rasbt
0074c98968 add download utilities for vocab and encoder files 2024-01-15 17:08:04 -06:00
rasbt
c400f77f26 update exercise solutions 2024-01-13 14:49:02 -06:00
rasbt
f279134492 small cosmetic fixes and improvements 2024-01-10 08:01:19 -06:00
rasbt
ea4b6c4e5f add package versions to the top of the notebook 2024-01-01 19:41:18 +01:00
rasbt
4f161bd549 use block size variable in positional embedding layer 2023-12-28 19:05:06 +01:00
Pietro Monticone
40698e63b3 Update embeddings-and-linear-layers.ipynb 2023-12-17 16:01:46 +01:00
Pietro Monticone
1bebe21b91 Update ch02.ipynb 2023-12-17 16:01:43 +01:00
Sebastian Raschka
220df4ffb3 Delete ch02/03_bonus_embedding-vs-matmul/.DS_Store 2023-12-10 08:18:25 -06:00
Sebastian Raschka
a16585049e Delete ch02/.DS_Store 2023-12-10 08:18:11 -06:00
rasbt
d82a5d6c08 remove temp files 2023-12-09 17:20:40 -06:00