419 Commits

Author SHA1 Message Date
rasbt
3cb5a52a1b simplify calc_loss_loader 2024-03-26 20:34:50 -05:00
rasbt
c88e8edf72 use probas in argmax 2024-03-26 08:38:27 -05:00
rasbt
9cc9c4244e simplify 2024-03-26 07:52:36 -05:00
rasbt
12fff1ddcb add endoftext token 2024-03-26 06:47:05 -05:00
rasbt
de576296de simplify .view code 2024-03-25 08:09:31 -05:00
Sebastian Raschka
d4989e01c5 Update README.md 2024-03-25 06:39:43 -05:00
rasbt
45e7826954 Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch 2024-03-24 07:09:18 -05:00
rasbt
c1d939c64e update chapter reference 2024-03-24 07:09:08 -05:00
rasbt
0f0fdef576 small typo fixes 2024-03-23 11:28:20 -05:00
Sebastian Raschka
cf39abac04 Add and link bonus material (#84) 2024-03-23 07:27:43 -05:00
rasbt
35c6e12730 ignore ch05 tmp files 2024-03-23 06:52:08 -05:00
rasbt
001507481e add colon and semicolon to tokenizer 2024-03-23 06:50:34 -05:00
Sebastian Raschka
5d02559993 small cosmetic updates (#83) 2024-03-22 09:15:40 -05:00
rasbt
075a9580ea reader proj and citation 2024-03-21 17:55:32 -05:00
Sebastian Raschka
4582995ced Add alternative weight loading strategy as backup (#82) 2024-03-20 08:43:18 -05:00
rasbt
820d5e3ed1 remove duplicate import 2024-03-19 20:41:35 -05:00
rasbt
4bab1b6f33 remove redundant dir 2024-03-19 09:27:27 -05:00
Sebastian Raschka
a2cd8436cb Ch05 supplementary code (#81) 2024-03-19 09:26:26 -05:00
rasbt
861a2788f3 add check for small validation sets 2024-03-19 06:34:52 -05:00
Sebastian Raschka
ca96abac8a Set up basic test gh worklows (#79)
* Set up basic test gh worklows

* update file paths

* env check

* add env check

* Update requirements.txt

* simplify

* upd
2024-03-18 11:58:37 -05:00
Sebastian Raschka
9d6da22ebb Update pep8 (#78)
* simplify requirements file

* style

* apply linter
2024-03-18 08:16:17 -05:00
Sebastian Raschka
e316cafd9f Update pep8-linter.yml 2024-03-18 08:16:08 -05:00
Sebastian Raschka
329d046b5d simplify requirements file (#76) 2024-03-18 08:00:49 -05:00
Sebastian Raschka
3e122fa656 Update pep8-linter.yml 2024-03-18 07:57:17 -05:00
Sebastian Raschka
805e352737 Update pep8-linter.yml 2024-03-18 07:47:30 -05:00
Sebastian Raschka
9acb589650 Update pep8-linter.yml 2024-03-18 07:41:23 -05:00
Sebastian Raschka
e213a0cede Create pep8-linter.yml 2024-03-18 07:00:28 -05:00
Sebastian Raschka
48253c4f88 Ch05 (#75)
* add chapter 5 main code
2024-03-17 21:07:19 -05:00
Sebastian Raschka
3e25216240 Merge pull request #74 from Intelligence-Manifesto/patch-7
three -> four
2024-03-17 16:03:36 -05:00
Intelligence-Manifesto
c49aa22738 three -> four 2024-03-17 23:40:44 +08:00
rasbt
4fc6de7afa add notes 2024-03-17 09:29:06 -05:00
Sebastian Raschka
b58f66b684 Merge pull request #73 from rasbt/notes-ext-figures
add more notes and embed figures externally to save space
2024-03-17 09:09:08 -05:00
rasbt
d60da19fd0 add more notes and embed figures externally to save space 2024-03-17 09:08:38 -05:00
rasbt
b655e628a2 revert back to Apache 2.0 2024-03-17 08:07:31 -05:00
rasbt
861c296312 add imports and version on top 2024-03-16 09:50:00 -05:00
rasbt
ff8657ac92 fix ipywidgets formatting issue 2024-03-16 08:35:43 -05:00
rasbt
a155879d71 update formatting 2024-03-16 08:10:58 -05:00
Sebastian Raschka
44b0febe68 Merge pull request #71 from Intelligence-Manifesto/patch-6
the above -> the following
2024-03-15 16:07:22 -05:00
Intelligence-Manifesto
d4b4e3d0f0 the above -> the following 2024-03-15 05:00:28 +08:00
rasbt
ee8efcbcf6 fix plotting 2024-03-14 07:41:45 -05:00
Sebastian Raschka
f25760c394 Merge pull request #70 from d-kleine/main
Updated Docker readme
2024-03-14 06:50:26 -05:00
Daniel Kleine
809ea9d196 Update README.md
updated readme for Docker with CUDA support instructions
2024-03-13 18:51:20 +01:00
rasbt
1870b4bacd update stride param 2024-03-13 08:39:59 -05:00
Sebastian Raschka
0b66c55950 Merge pull request #69 from rasbt/pretraining-on-proj-gutenberg
Pretraining on Project Gutenberg
2024-03-13 08:38:33 -05:00
rasbt
0d517e98b9 update 2024-03-13 08:37:54 -05:00
rasbt
f2c8eeb6b8 pretraining on project gutenberg 2024-03-13 08:34:39 -05:00
rasbt
569f6bc7f0 benchmark numbers 2024-03-13 07:12:10 -05:00
Sebastian Raschka
319e919062 Merge pull request #68 from taihaozesong/fix_ch03_impl_wrapper
Fix mha wrapper implementations in ch03 bonus
2024-03-13 07:02:13 -05:00
taihaozesong
f1fa9df15c Fix mha wrapper implementations in ch03 bonus 2024-03-13 18:02:26 +08:00
Sebastian Raschka
00b121a5af Merge pull request #66 from rasbt/appendix-d
Add appendix D
2024-03-11 07:08:57 -05:00