81 Commits

Author SHA1 Message Date
Sebastian Raschka
7bf70baf10
Remove duplicated cell (#212)
* add a suggestion since code snippet has been repeated.

* remove duplicated cell

---------

Co-authored-by: Shuyib <benmainye@gmail.com>
2024-06-15 12:48:34 -05:00
rasbt
c6466990bb
explain truncation in ch05 2024-06-12 19:50:11 -05:00
Sebastian Raschka
bcccda728b
check gpt files (#208) 2024-06-12 07:19:10 -05:00
Daniel Kleine
ef40f2f9ad
minor bug fixes (#207)
* fixed path arg for create_dataset_csvs()

* updated assign_check() to remove user warning
2024-06-12 06:27:56 -05:00
rasbt
e24fd98cdf
distinguish better between main chapter code and bonus materials 2024-06-11 21:07:42 -05:00
Daniel Kleine
dcbdc1d2e5
fixes for code (#206)
* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-06-11 20:59:48 -05:00
rasbt
1b1fd21d64
fix typo in comment 2024-06-09 06:14:02 -05:00
Sebastian Raschka
72a073bbbf
Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer

* add endoftext token
2024-06-08 14:57:34 -05:00
rasbt
6f0a5c320b
fix learning rate scheduler 2024-06-03 07:06:42 -05:00
rasbt
b352d9ef0a
update loss 2024-05-31 07:30:57 -05:00
Kumar Utsav
bc5d73857c
Update ch05.ipynb
Fixed incorrect token ids
2024-05-29 20:34:23 +05:30
Sebastian Raschka
39a831a4d8
Make header more clear 2024-05-25 10:44:12 -05:00
rasbt
98d453b666
update formatting 2024-05-24 07:20:37 -05:00
rasbt
b40c260859
update how to retrieve learning rate 2024-05-23 17:19:01 -05:00
Daniel Kleine
aa67e6e1ac removed unnecessary .gitignore 2024-05-21 19:25:16 +00:00
rasbt
a5593f9860
change defaults to 0 temp 2024-05-19 09:04:49 -05:00
rasbt
1463b2ae47
use default value for temperature 2024-05-19 08:48:10 -05:00
rasbt
4851d5a0fa
add eos_id option for ch07 2024-05-18 12:35:40 -05:00
Daniel Kleine
cf8b6c1094 fixed empty space 2024-05-17 10:44:18 +02:00
rasbt
cd7ea15e8d
add readme 2024-05-13 08:50:55 -05:00
speed
45f6e72f40
fix 1024 characters to 1024 tokens (#152) 2024-05-11 13:17:07 -05:00
rasbt
aec169dc12 link formatting 2024-04-30 06:26:23 -05:00
Sebastian Raschka
97ed38116a
Rename drop_resid to drop_shortcut (#136) 2024-04-28 14:31:27 -05:00
rasbt
90d239b4f7 fix merge conflict 2024-04-22 07:05:40 -05:00
rasbt
72be9f4e8e update numbering 2024-04-22 07:00:20 -05:00
rasbt
868955f6a5 file header 2024-04-22 06:53:38 -05:00
Sebastian Raschka
44b3815960
remove requests dependency (#125) 2024-04-21 14:15:05 -05:00
Sebastian Raschka
c70ddff558
Return nan if val loader is empty (#124) 2024-04-20 08:02:30 -05:00
Sebastian Raschka
155ac03f61
use torch no grad for loss (#119) 2024-04-14 08:13:07 -05:00
Sebastian Raschka
dd51d4ad83
Make datesets and loaders compatible with multiprocessing (#118) 2024-04-13 13:57:56 -05:00
Sebastian Raschka
9f3f231ac7 use correct lr 2024-04-12 19:55:07 -04:00
Sebastian Raschka
55ebabf95c
Automated link checking (#117)
* Automated link checking

* Fix links in Jupyter Nbs
2024-04-12 19:08:34 -04:00
Sebastian Raschka
e757091301
Organized setup instructions (#115)
* Organized setup instructions

* update tets

* link checker action

* raise error upon broken link

* fix links

* fix links

* delete duplicated paragraph
2024-04-10 22:09:46 -04:00
James Holcombe
05718c6b94
Use instance tokenizer (#116)
* Use instance tokenizer

* consistency updates

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-04-10 21:16:19 -04:00
Daniel Kleine
61b6e35ddf
Added PDF display support to Docker image and VS Code and updated first step for gutenberg project (#111)
* added VS Code extensions recommendations

* Added PDF display support to Docker image and VS Code

* fixed steps to download the dataset
2024-04-08 20:37:55 -04:00
rasbt
58d5bd9e39 address suggestions to improve clarity 2024-04-07 08:41:09 -05:00
rasbt
42eda8b70f renumber exercises 2024-04-07 06:03:41 -05:00
rasbt
c5a17393fc variable renaming for clarity 2024-04-05 07:26:42 -05:00
rasbt
8c36399e7c rename hparams to settings 2024-04-05 07:24:46 -05:00
Daniel Kleine
44c0494406
Updated devcontainer, .gitignore and README for gutenberg project (#107)
* added ch05/03_bonus_pretraining_on_gutenberg model checkpoints and preprocessing output folders to .gitignore

* removed prettier extension, added github alerts markdown extension

* specified download instructions and fixed code markdown

* Update ch05/03_bonus_pretraining_on_gutenberg/README.md

* Update ch05/03_bonus_pretraining_on_gutenberg/README.md

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-04-05 06:53:01 -05:00
Sebastian Raschka
adc2964fc5
Fix Loss in Gutenberg bonus section (#109) 2024-04-04 20:54:09 -05:00
Sebastian Raschka
2de60d1bfb
Rename variable to context_length to make it easier on readers (#106)
* rename to context length

* fix spacing
2024-04-04 07:27:41 -05:00
Sebastian Raschka
3829ccdb34
Remove reundant dropout in MLP module (#105) 2024-04-03 20:19:08 -05:00
rasbt
e14585e954 rename batch to text 2024-04-02 20:46:53 -05:00
rasbt
7d1eadd0be
update notes 2024-04-02 18:27:13 -05:00
Sebastian Raschka
2fab89d47e
Use max size properly 2024-04-02 13:29:23 -05:00
Sebastian Raschka
4a617b8343
Gutenberg for Windows users (#99) 2024-04-02 08:54:24 -05:00
rasbt
f30dd2dd2b improve instructions 2024-04-02 07:12:22 -05:00
rasbt
776a517d18 figure scaling 2024-04-01 08:05:01 -05:00
rasbt
ee096986ea upload exercise solutions of ch05 2024-03-31 20:28:51 -05:00