Sebastian Raschka
1f61aeb7c4
Note about SSL certificates ( #404 )
2024-10-19 16:27:19 -05:00
Sebastian Raschka
263eee8921
Test with PyTorch 2.0 and 2.4 ( #290 )
...
* Test with PyTorch 2.0 and 2.4
* Update basic-tests-old-pytorch.yml
* skip version cell
2024-07-27 15:09:02 -05:00
Sebastian Raschka
08040f024c
Test code in pytorch 2.4 ( #285 )
...
* test code in pytorch 2.4
* update
2024-07-24 21:53:41 -05:00
Thanh Tran
070a69fc8b
fix typos & inconsistent texts ( #269 )
...
Co-authored-by: TRAN <you@example.com>
2024-07-17 07:34:51 -05:00
rasbt
a33e89c12c
fixes bold font #267
2024-07-16 17:51:15 -05:00
Daniel Kleine
88186bf64a
minor: removed redundant imports ( #260 )
...
* removed duplicated imports
* removed empty cell
2024-07-05 15:33:19 -05:00
rasbt
b92dea8bc6
update decode method
2024-07-05 08:34:27 -05:00
Suman Debnath
2cdcf68598
fixing the regular expression used in the SimpleTokenizer ( #259 )
...
* fixing the regular expression used in the SimpleTokenizer class and a typo in the 2.7 Creating token embedding introduction section
* rerun
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-07-04 12:27:27 -05:00
rasbt
0988996eb8
update figures
2024-07-02 17:12:42 -05:00
rasbt
31806828d0
add links to summary sections
2024-06-29 07:33:26 -05:00
Sebastian Raschka
7e78b52a30
remove redundant code lines ( #247 )
2024-06-25 21:44:19 -05:00
rasbt
7095e84fab
update with latest versions
2024-06-25 21:09:27 -05:00
Daniel Kleine
81c843bdc0
minor fixes ( #246 )
...
* removed duplicated white spaces
* Update ch07/01_main-chapter-code/ch07.ipynb
* Update ch07/05_dataset-generation/llama3-ollama.ipynb
* removed duplicated white spaces
* fixed title again
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-06-25 17:30:30 -05:00
rasbt
283397aaf2
add main and optional sections
2024-06-19 17:48:25 -05:00
Daniel Kleine
bbb2a0c3d5
fixed num_workers ( #229 )
...
* fixed num_workers
* ch06 & ch07: added num_workers to create_dataloader_v1
2024-06-19 17:36:46 -05:00
rasbt
39c4a887eb
add allowed_special={"<|endoftext|>"}
2024-06-09 06:04:02 -05:00
Sebastian Raschka
72a073bbbf
Remove leftover instances of self.tokenizer ( #201 )
...
* Remove leftover instances of self.tokenizer
* add endoftext token
2024-06-08 14:57:34 -05:00
rasbt
60f64bdc23
update figure 2.13
2024-06-01 09:38:33 -05:00
rasbt
98d453b666
update formatting
2024-05-24 07:20:37 -05:00
rasbt
5b1dcf0b33
reset cell count for better nbdiff
2024-05-22 20:27:09 -05:00
rasbt
7686d4569f
update regex
2024-05-22 20:15:31 -05:00
rasbt
ea9da3a89c
formatting for consistency with production chapter
2024-05-18 11:03:42 -05:00
rasbt
37a17e2228
simplify code
2024-05-16 20:16:25 -05:00
Sebastian Raschka
0bd2608a6c
update stride wording
2024-04-22 20:40:48 -05:00
Sebastian Raschka
dd51d4ad83
Make datesets and loaders compatible with multiprocessing ( #118 )
2024-04-13 13:57:56 -05:00
Sebastian Raschka
55ebabf95c
Automated link checking ( #117 )
...
* Automated link checking
* Fix links in Jupyter Nbs
2024-04-12 19:08:34 -04:00
James Holcombe
05718c6b94
Use instance tokenizer ( #116 )
...
* Use instance tokenizer
* consistency updates
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-04-10 21:16:19 -04:00
Sebastian Raschka
2de60d1bfb
Rename variable to context_length to make it easier on readers ( #106 )
...
* rename to context length
* fix spacing
2024-04-04 07:27:41 -05:00
Intelligence-Manifesto
5a3f779405
code -> markdown ( #101 )
2024-04-02 14:37:45 -05:00
rasbt
1c173e4f44
update figures
2024-03-30 09:43:51 -05:00
rasbt
ca96b7aee5
minor updates
2024-03-29 20:42:32 -05:00
Jeff Hammerbacher
5b222e2d6f
Fix small typos in ch02.ipynb ( #89 )
2024-03-29 08:25:52 -05:00
rasbt
001507481e
add colon and semicolon to tokenizer
2024-03-23 06:50:34 -05:00
Sebastian Raschka
a2cd8436cb
Ch05 supplementary code ( #81 )
2024-03-19 09:26:26 -05:00
Sebastian Raschka
ca96abac8a
Set up basic test gh worklows ( #79 )
...
* Set up basic test gh worklows
* update file paths
* env check
* add env check
* Update requirements.txt
* simplify
* upd
2024-03-18 11:58:37 -05:00
Intelligence-Manifesto
c49aa22738
three -> four
2024-03-17 23:40:44 +08:00
rasbt
d60da19fd0
add more notes and embed figures externally to save space
2024-03-17 09:08:38 -05:00
Xiangzhuang Shen
fa2864ddbf
fix: inner links
2024-03-11 10:52:56 +08:00
rasbt
3beaea46ce
add lowres figs for better navigation
2024-03-08 07:18:06 -06:00
rasbt
c5b17c3d67
simplify
2024-03-07 07:52:24 -06:00
rasbt
3198363c4f
add wording from three to four
2024-03-04 06:42:58 -06:00
rasbt
29672da3b0
stride consistency
2024-03-03 19:37:06 -06:00
rasbt
cc2383c4de
remove duplicated exercise code
2024-03-02 16:44:36 -06:00
rasbt
cdcd73ba7f
drop_last=True
2024-02-25 07:23:38 -06:00
Intelligence-Manifesto
a61583e31e
if -> in
...
if -> in
2024-02-09 04:18:07 +08:00
rasbt
3a5fc79b38
add and update readme files
2024-02-05 06:51:58 -06:00
rasbt
16b30ccd5c
set stride equal to max len
2024-02-03 08:50:56 -06:00
Intelligence-Manifesto
78ab68b4d3
missing single quotes
...
missing single quotes
2024-02-03 03:41:31 +08:00
Intelligence-Manifesto
004d6a2af9
a repetition of the word "by"
...
"The Verdict by by Edith Wharton" contains a repetition of the word "by." The correct form should be "The Verdict by Edith Wharton."
2024-01-26 00:43:57 +08:00
rasbt
f27c9e6135
rename create_dataloader to create_dataloader_v1
2024-01-24 07:02:05 -06:00