rasbt
|
8c36399e7c
|
rename hparams to settings
|
2024-04-05 07:24:46 -05:00 |
|
Daniel Kleine
|
44c0494406
|
Updated devcontainer, .gitignore and README for gutenberg project (#107)
* added ch05/03_bonus_pretraining_on_gutenberg model checkpoints and preprocessing output folders to .gitignore
* removed prettier extension, added github alerts markdown extension
* specified download instructions and fixed code markdown
* Update ch05/03_bonus_pretraining_on_gutenberg/README.md
* Update ch05/03_bonus_pretraining_on_gutenberg/README.md
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-04-05 06:53:01 -05:00 |
|
Sebastian Raschka
|
adc2964fc5
|
Fix Loss in Gutenberg bonus section (#109)
|
2024-04-04 20:54:09 -05:00 |
|
rasbt
|
6de0417321
|
cleanup
|
2024-04-04 07:58:41 -05:00 |
|
Sebastian Raschka
|
2de60d1bfb
|
Rename variable to context_length to make it easier on readers (#106)
* rename to context length
* fix spacing
|
2024-04-04 07:27:41 -05:00 |
|
Sebastian Raschka
|
a940373a14
|
Add rsync to dockerfile
|
2024-04-03 20:28:02 -05:00 |
|
Sebastian Raschka
|
3829ccdb34
|
Remove reundant dropout in MLP module (#105)
|
2024-04-03 20:19:08 -05:00 |
|
rasbt
|
dd115c1374
|
improve importlib experience for windows users
|
2024-04-03 06:31:15 -05:00 |
|
rasbt
|
e14585e954
|
rename batch to text
|
2024-04-02 20:46:53 -05:00 |
|
rasbt
|
7d1eadd0be
|
update notes
|
2024-04-02 18:27:13 -05:00 |
|
Intelligence-Manifesto
|
96b1fde3f1
|
"Typographical error (#104)
|
2024-04-02 18:07:21 -05:00 |
|
Suman Debnath
|
7b7d23a4e1
|
fixing the README for python setup under appendix-A (#102)
* fixing the README for python setup under appendix-A
* fixing the README for python setup under appendix-A
|
2024-04-02 15:51:11 -05:00 |
|
Intelligence-Manifesto
|
5a3f779405
|
code -> markdown (#101)
|
2024-04-02 14:37:45 -05:00 |
|
Sebastian Raschka
|
2fab89d47e
|
Use max size properly
|
2024-04-02 13:29:23 -05:00 |
|
Sebastian Raschka
|
4a617b8343
|
Gutenberg for Windows users (#99)
|
2024-04-02 08:54:24 -05:00 |
|
rasbt
|
f30dd2dd2b
|
improve instructions
|
2024-04-02 07:12:22 -05:00 |
|
rasbt
|
776a517d18
|
figure scaling
|
2024-04-01 08:05:01 -05:00 |
|
rasbt
|
005835bfce
|
make figures for appendix d
|
2024-03-31 21:24:41 -05:00 |
|
rasbt
|
ac2bdb02bd
|
make figures for appendix d
|
2024-03-31 21:22:49 -05:00 |
|
rasbt
|
ee096986ea
|
upload exercise solutions of ch05
|
2024-03-31 20:28:51 -05:00 |
|
Daniel Kleine
|
a6bd197897
|
updated github actions versions (#96)
* added status badges
* updated github actions versions
* Update README.md
* Update README.md
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-03-31 10:49:12 -05:00 |
|
rasbt
|
83adc4a2ac
|
add weight sizes
|
2024-03-31 08:48:19 -05:00 |
|
rasbt
|
1c173e4f44
|
update figures
|
2024-03-30 09:43:51 -05:00 |
|
rasbt
|
ca96b7aee5
|
minor updates
|
2024-03-29 20:42:32 -05:00 |
|
rasbt
|
797cfb20de
|
fix test
|
2024-03-29 09:03:36 -05:00 |
|
Jeff Hammerbacher
|
5b222e2d6f
|
Fix small typos in ch02.ipynb (#89)
|
2024-03-29 08:25:52 -05:00 |
|
rasbt
|
71b6d1b7d4
|
Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch
|
2024-03-29 08:16:29 -05:00 |
|
rasbt
|
ab1e56a323
|
reorg files and make standalone download file
|
2024-03-29 08:16:22 -05:00 |
|
Sebastian Raschka
|
4537dbf001
|
Update README.md
|
2024-03-28 09:14:52 -05:00 |
|
rasbt
|
3ad442ee90
|
skip version cell
|
2024-03-28 08:23:33 -05:00 |
|
rasbt
|
3c5b288ca0
|
minor typo fixes
|
2024-03-28 08:02:05 -05:00 |
|
rasbt
|
c10f5c9bf2
|
suggest galore
|
2024-03-27 19:58:32 -05:00 |
|
rasbt
|
f24da86abe
|
title case
|
2024-03-27 07:30:09 -05:00 |
|
rasbt
|
713b3ee188
|
add readme
|
2024-03-27 07:29:16 -05:00 |
|
rasbt
|
88b2dd780a
|
make batch loss calculatution more efficient
|
2024-03-27 07:11:56 -05:00 |
|
rasbt
|
3cb5a52a1b
|
simplify calc_loss_loader
|
2024-03-26 20:34:50 -05:00 |
|
rasbt
|
c88e8edf72
|
use probas in argmax
|
2024-03-26 08:38:27 -05:00 |
|
rasbt
|
9cc9c4244e
|
simplify
|
2024-03-26 07:52:36 -05:00 |
|
rasbt
|
12fff1ddcb
|
add endoftext token
|
2024-03-26 06:47:05 -05:00 |
|
rasbt
|
de576296de
|
simplify .view code
|
2024-03-25 08:09:31 -05:00 |
|
Sebastian Raschka
|
d4989e01c5
|
Update README.md
|
2024-03-25 06:39:43 -05:00 |
|
rasbt
|
45e7826954
|
Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch
|
2024-03-24 07:09:18 -05:00 |
|
rasbt
|
c1d939c64e
|
update chapter reference
|
2024-03-24 07:09:08 -05:00 |
|
rasbt
|
0f0fdef576
|
small typo fixes
|
2024-03-23 11:28:20 -05:00 |
|
Sebastian Raschka
|
cf39abac04
|
Add and link bonus material (#84)
|
2024-03-23 07:27:43 -05:00 |
|
rasbt
|
35c6e12730
|
ignore ch05 tmp files
|
2024-03-23 06:52:08 -05:00 |
|
rasbt
|
001507481e
|
add colon and semicolon to tokenizer
|
2024-03-23 06:50:34 -05:00 |
|
Sebastian Raschka
|
5d02559993
|
small cosmetic updates (#83)
|
2024-03-22 09:15:40 -05:00 |
|
rasbt
|
075a9580ea
|
reader proj and citation
|
2024-03-21 17:55:32 -05:00 |
|
Sebastian Raschka
|
4582995ced
|
Add alternative weight loading strategy as backup (#82)
|
2024-03-20 08:43:18 -05:00 |
|