704 Commits

Author SHA1 Message Date
rasbt
8c36399e7c rename hparams to settings 2024-04-05 07:24:46 -05:00
Daniel Kleine
44c0494406
Updated devcontainer, .gitignore and README for gutenberg project (#107)
* added ch05/03_bonus_pretraining_on_gutenberg model checkpoints and preprocessing output folders to .gitignore

* removed prettier extension, added github alerts markdown extension

* specified download instructions and fixed code markdown

* Update ch05/03_bonus_pretraining_on_gutenberg/README.md

* Update ch05/03_bonus_pretraining_on_gutenberg/README.md

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-04-05 06:53:01 -05:00
Sebastian Raschka
adc2964fc5
Fix Loss in Gutenberg bonus section (#109) 2024-04-04 20:54:09 -05:00
rasbt
6de0417321
cleanup 2024-04-04 07:58:41 -05:00
Sebastian Raschka
2de60d1bfb
Rename variable to context_length to make it easier on readers (#106)
* rename to context length

* fix spacing
2024-04-04 07:27:41 -05:00
Sebastian Raschka
a940373a14
Add rsync to dockerfile 2024-04-03 20:28:02 -05:00
Sebastian Raschka
3829ccdb34
Remove reundant dropout in MLP module (#105) 2024-04-03 20:19:08 -05:00
rasbt
dd115c1374 improve importlib experience for windows users 2024-04-03 06:31:15 -05:00
rasbt
e14585e954 rename batch to text 2024-04-02 20:46:53 -05:00
rasbt
7d1eadd0be
update notes 2024-04-02 18:27:13 -05:00
Intelligence-Manifesto
96b1fde3f1
"Typographical error (#104) 2024-04-02 18:07:21 -05:00
Suman Debnath
7b7d23a4e1
fixing the README for python setup under appendix-A (#102)
* fixing the README for python setup under appendix-A

* fixing the README for python setup under appendix-A
2024-04-02 15:51:11 -05:00
Intelligence-Manifesto
5a3f779405
code -> markdown (#101) 2024-04-02 14:37:45 -05:00
Sebastian Raschka
2fab89d47e
Use max size properly 2024-04-02 13:29:23 -05:00
Sebastian Raschka
4a617b8343
Gutenberg for Windows users (#99) 2024-04-02 08:54:24 -05:00
rasbt
f30dd2dd2b improve instructions 2024-04-02 07:12:22 -05:00
rasbt
776a517d18 figure scaling 2024-04-01 08:05:01 -05:00
rasbt
005835bfce make figures for appendix d 2024-03-31 21:24:41 -05:00
rasbt
ac2bdb02bd make figures for appendix d 2024-03-31 21:22:49 -05:00
rasbt
ee096986ea upload exercise solutions of ch05 2024-03-31 20:28:51 -05:00
Daniel Kleine
a6bd197897 updated github actions versions (#96)
* added status badges

* updated github actions versions

* Update README.md

* Update README.md

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-03-31 10:49:12 -05:00
rasbt
83adc4a2ac add weight sizes 2024-03-31 08:48:19 -05:00
rasbt
1c173e4f44 update figures 2024-03-30 09:43:51 -05:00
rasbt
ca96b7aee5 minor updates 2024-03-29 20:42:32 -05:00
rasbt
797cfb20de fix test 2024-03-29 09:03:36 -05:00
Jeff Hammerbacher
5b222e2d6f Fix small typos in ch02.ipynb (#89) 2024-03-29 08:25:52 -05:00
rasbt
71b6d1b7d4 Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch 2024-03-29 08:16:29 -05:00
rasbt
ab1e56a323 reorg files and make standalone download file 2024-03-29 08:16:22 -05:00
Sebastian Raschka
4537dbf001 Update README.md 2024-03-28 09:14:52 -05:00
rasbt
3ad442ee90 skip version cell 2024-03-28 08:23:33 -05:00
rasbt
3c5b288ca0 minor typo fixes 2024-03-28 08:02:05 -05:00
rasbt
c10f5c9bf2 suggest galore 2024-03-27 19:58:32 -05:00
rasbt
f24da86abe title case 2024-03-27 07:30:09 -05:00
rasbt
713b3ee188 add readme 2024-03-27 07:29:16 -05:00
rasbt
88b2dd780a make batch loss calculatution more efficient 2024-03-27 07:11:56 -05:00
rasbt
3cb5a52a1b simplify calc_loss_loader 2024-03-26 20:34:50 -05:00
rasbt
c88e8edf72 use probas in argmax 2024-03-26 08:38:27 -05:00
rasbt
9cc9c4244e simplify 2024-03-26 07:52:36 -05:00
rasbt
12fff1ddcb add endoftext token 2024-03-26 06:47:05 -05:00
rasbt
de576296de simplify .view code 2024-03-25 08:09:31 -05:00
Sebastian Raschka
d4989e01c5 Update README.md 2024-03-25 06:39:43 -05:00
rasbt
45e7826954 Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch 2024-03-24 07:09:18 -05:00
rasbt
c1d939c64e update chapter reference 2024-03-24 07:09:08 -05:00
rasbt
0f0fdef576 small typo fixes 2024-03-23 11:28:20 -05:00
Sebastian Raschka
cf39abac04 Add and link bonus material (#84) 2024-03-23 07:27:43 -05:00
rasbt
35c6e12730 ignore ch05 tmp files 2024-03-23 06:52:08 -05:00
rasbt
001507481e add colon and semicolon to tokenizer 2024-03-23 06:50:34 -05:00
Sebastian Raschka
5d02559993 small cosmetic updates (#83) 2024-03-22 09:15:40 -05:00
rasbt
075a9580ea reader proj and citation 2024-03-21 17:55:32 -05:00
Sebastian Raschka
4582995ced Add alternative weight loading strategy as backup (#82) 2024-03-20 08:43:18 -05:00