43 Commits

Author SHA1 Message Date
rasbt
c31e99720d rename hparams to settings 2024-04-05 07:24:46 -05:00
Daniel Kleine
7d0b9b78b0 Updated devcontainer, .gitignore and README for gutenberg project (#107)
* added ch05/03_bonus_pretraining_on_gutenberg model checkpoints and preprocessing output folders to .gitignore

* removed prettier extension, added github alerts markdown extension

* specified download instructions and fixed code markdown

* Update ch05/03_bonus_pretraining_on_gutenberg/README.md

* Update ch05/03_bonus_pretraining_on_gutenberg/README.md

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-04-05 06:53:01 -05:00
Sebastian Raschka
25f533efe0 Fix Loss in Gutenberg bonus section (#109) 2024-04-04 20:54:09 -05:00
Sebastian Raschka
ccd7cebbb3 Rename variable to context_length to make it easier on readers (#106)
* rename to context length

* fix spacing
2024-04-04 07:27:41 -05:00
Sebastian Raschka
5beff4e25a Remove reundant dropout in MLP module (#105) 2024-04-03 20:19:08 -05:00
rasbt
cd12b4a937 rename batch to text 2024-04-02 20:46:53 -05:00
rasbt
21140b98d4 update notes 2024-04-02 18:27:13 -05:00
Sebastian Raschka
809c944d30 Use max size properly 2024-04-02 13:29:23 -05:00
Sebastian Raschka
5af3834760 Gutenberg for Windows users (#99) 2024-04-02 08:54:24 -05:00
rasbt
f30dd2dd2b improve instructions 2024-04-02 07:12:22 -05:00
rasbt
776a517d18 figure scaling 2024-04-01 08:05:01 -05:00
rasbt
ee096986ea upload exercise solutions of ch05 2024-03-31 20:28:51 -05:00
rasbt
83adc4a2ac add weight sizes 2024-03-31 08:48:19 -05:00
rasbt
1c173e4f44 update figures 2024-03-30 09:43:51 -05:00
rasbt
797cfb20de fix test 2024-03-29 09:03:36 -05:00
rasbt
ab1e56a323 reorg files and make standalone download file 2024-03-29 08:16:22 -05:00
rasbt
3c5b288ca0 minor typo fixes 2024-03-28 08:02:05 -05:00
rasbt
c10f5c9bf2 suggest galore 2024-03-27 19:58:32 -05:00
rasbt
88b2dd780a make batch loss calculatution more efficient 2024-03-27 07:11:56 -05:00
rasbt
3cb5a52a1b simplify calc_loss_loader 2024-03-26 20:34:50 -05:00
rasbt
9cc9c4244e simplify 2024-03-26 07:52:36 -05:00
rasbt
12fff1ddcb add endoftext token 2024-03-26 06:47:05 -05:00
rasbt
de576296de simplify .view code 2024-03-25 08:09:31 -05:00
Sebastian Raschka
d4989e01c5 Update README.md 2024-03-25 06:39:43 -05:00
rasbt
45e7826954 Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch 2024-03-24 07:09:18 -05:00
rasbt
c1d939c64e update chapter reference 2024-03-24 07:09:08 -05:00
rasbt
0f0fdef576 small typo fixes 2024-03-23 11:28:20 -05:00
Sebastian Raschka
cf39abac04 Add and link bonus material (#84) 2024-03-23 07:27:43 -05:00
Sebastian Raschka
5d02559993 small cosmetic updates (#83) 2024-03-22 09:15:40 -05:00
Sebastian Raschka
4582995ced Add alternative weight loading strategy as backup (#82) 2024-03-20 08:43:18 -05:00
rasbt
820d5e3ed1 remove duplicate import 2024-03-19 20:41:35 -05:00
Sebastian Raschka
a2cd8436cb Ch05 supplementary code (#81) 2024-03-19 09:26:26 -05:00
rasbt
861a2788f3 add check for small validation sets 2024-03-19 06:34:52 -05:00
Sebastian Raschka
9d6da22ebb Update pep8 (#78)
* simplify requirements file

* style

* apply linter
2024-03-18 08:16:17 -05:00
Sebastian Raschka
329d046b5d simplify requirements file (#76) 2024-03-18 08:00:49 -05:00
Sebastian Raschka
48253c4f88 Ch05 (#75)
* add chapter 5 main code
2024-03-17 21:07:19 -05:00
rasbt
ee8efcbcf6 fix plotting 2024-03-14 07:41:45 -05:00
rasbt
f2c8eeb6b8 pretraining on project gutenberg 2024-03-13 08:34:39 -05:00
rasbt
da33ce8054 remove redundant unsqueeze in mask 2024-03-09 17:42:31 -06:00
rasbt
87fcfd9245 mha variants 2024-03-06 08:30:32 -06:00
rasbt
e0df4df433 add dropout for embedding layers 2024-03-04 07:05:06 -06:00
rasbt
d89aaf319d update folder name 2024-02-27 08:53:04 -06:00
rasbt
87a743076d hparam tuning script 2024-02-27 08:51:03 -06:00