40 Commits

Author SHA1 Message Date
rasbt
1b1fd21d64
fix typo in comment 2024-06-09 06:14:02 -05:00
rasbt
b352d9ef0a
update loss 2024-05-31 07:30:57 -05:00
Kumar Utsav
bc5d73857c
Update ch05.ipynb
Fixed incorrect token ids
2024-05-29 20:34:23 +05:30
rasbt
98d453b666
update formatting 2024-05-24 07:20:37 -05:00
rasbt
a5593f9860
change defaults to 0 temp 2024-05-19 09:04:49 -05:00
rasbt
1463b2ae47
use default value for temperature 2024-05-19 08:48:10 -05:00
rasbt
4851d5a0fa
add eos_id option for ch07 2024-05-18 12:35:40 -05:00
speed
45f6e72f40
fix 1024 characters to 1024 tokens (#152) 2024-05-11 13:17:07 -05:00
rasbt
aec169dc12 link formatting 2024-04-30 06:26:23 -05:00
Sebastian Raschka
c70ddff558
Return nan if val loader is empty (#124) 2024-04-20 08:02:30 -05:00
Sebastian Raschka
155ac03f61
use torch no grad for loss (#119) 2024-04-14 08:13:07 -05:00
Sebastian Raschka
dd51d4ad83
Make datesets and loaders compatible with multiprocessing (#118) 2024-04-13 13:57:56 -05:00
Sebastian Raschka
9f3f231ac7 use correct lr 2024-04-12 19:55:07 -04:00
Sebastian Raschka
55ebabf95c
Automated link checking (#117)
* Automated link checking

* Fix links in Jupyter Nbs
2024-04-12 19:08:34 -04:00
Sebastian Raschka
e757091301
Organized setup instructions (#115)
* Organized setup instructions

* update tets

* link checker action

* raise error upon broken link

* fix links

* fix links

* delete duplicated paragraph
2024-04-10 22:09:46 -04:00
rasbt
58d5bd9e39 address suggestions to improve clarity 2024-04-07 08:41:09 -05:00
rasbt
c5a17393fc variable renaming for clarity 2024-04-05 07:26:42 -05:00
rasbt
8c36399e7c rename hparams to settings 2024-04-05 07:24:46 -05:00
Sebastian Raschka
adc2964fc5
Fix Loss in Gutenberg bonus section (#109) 2024-04-04 20:54:09 -05:00
Sebastian Raschka
2de60d1bfb
Rename variable to context_length to make it easier on readers (#106)
* rename to context length

* fix spacing
2024-04-04 07:27:41 -05:00
Sebastian Raschka
3829ccdb34
Remove reundant dropout in MLP module (#105) 2024-04-03 20:19:08 -05:00
rasbt
e14585e954 rename batch to text 2024-04-02 20:46:53 -05:00
rasbt
776a517d18 figure scaling 2024-04-01 08:05:01 -05:00
rasbt
ee096986ea upload exercise solutions of ch05 2024-03-31 20:28:51 -05:00
rasbt
83adc4a2ac add weight sizes 2024-03-31 08:48:19 -05:00
rasbt
1c173e4f44 update figures 2024-03-30 09:43:51 -05:00
rasbt
ab1e56a323 reorg files and make standalone download file 2024-03-29 08:16:22 -05:00
rasbt
3c5b288ca0 minor typo fixes 2024-03-28 08:02:05 -05:00
rasbt
88b2dd780a make batch loss calculatution more efficient 2024-03-27 07:11:56 -05:00
rasbt
3cb5a52a1b simplify calc_loss_loader 2024-03-26 20:34:50 -05:00
rasbt
9cc9c4244e simplify 2024-03-26 07:52:36 -05:00
rasbt
12fff1ddcb add endoftext token 2024-03-26 06:47:05 -05:00
rasbt
de576296de simplify .view code 2024-03-25 08:09:31 -05:00
rasbt
45e7826954 Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch 2024-03-24 07:09:18 -05:00
rasbt
c1d939c64e update chapter reference 2024-03-24 07:09:08 -05:00
rasbt
0f0fdef576 small typo fixes 2024-03-23 11:28:20 -05:00
Sebastian Raschka
5d02559993 small cosmetic updates (#83) 2024-03-22 09:15:40 -05:00
rasbt
820d5e3ed1 remove duplicate import 2024-03-19 20:41:35 -05:00
Sebastian Raschka
a2cd8436cb Ch05 supplementary code (#81) 2024-03-19 09:26:26 -05:00
rasbt
861a2788f3 add check for small validation sets 2024-03-19 06:34:52 -05:00