rasbt
|
cd7ea15e8d
|
add readme
|
2024-05-13 08:50:55 -05:00 |
|
speed
|
45f6e72f40
|
fix 1024 characters to 1024 tokens (#152)
|
2024-05-11 13:17:07 -05:00 |
|
rasbt
|
aec169dc12
|
link formatting
|
2024-04-30 06:26:23 -05:00 |
|
Sebastian Raschka
|
97ed38116a
|
Rename drop_resid to drop_shortcut (#136)
|
2024-04-28 14:31:27 -05:00 |
|
rasbt
|
90d239b4f7
|
fix merge conflict
|
2024-04-22 07:05:40 -05:00 |
|
rasbt
|
72be9f4e8e
|
update numbering
|
2024-04-22 07:00:20 -05:00 |
|
rasbt
|
868955f6a5
|
file header
|
2024-04-22 06:53:38 -05:00 |
|
Sebastian Raschka
|
44b3815960
|
remove requests dependency (#125)
|
2024-04-21 14:15:05 -05:00 |
|
Sebastian Raschka
|
c70ddff558
|
Return nan if val loader is empty (#124)
|
2024-04-20 08:02:30 -05:00 |
|
Sebastian Raschka
|
155ac03f61
|
use torch no grad for loss (#119)
|
2024-04-14 08:13:07 -05:00 |
|
Sebastian Raschka
|
dd51d4ad83
|
Make datesets and loaders compatible with multiprocessing (#118)
|
2024-04-13 13:57:56 -05:00 |
|
Sebastian Raschka
|
9f3f231ac7
|
use correct lr
|
2024-04-12 19:55:07 -04:00 |
|
Sebastian Raschka
|
55ebabf95c
|
Automated link checking (#117)
* Automated link checking
* Fix links in Jupyter Nbs
|
2024-04-12 19:08:34 -04:00 |
|
Sebastian Raschka
|
e757091301
|
Organized setup instructions (#115)
* Organized setup instructions
* update tets
* link checker action
* raise error upon broken link
* fix links
* fix links
* delete duplicated paragraph
|
2024-04-10 22:09:46 -04:00 |
|
James Holcombe
|
05718c6b94
|
Use instance tokenizer (#116)
* Use instance tokenizer
* consistency updates
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-04-10 21:16:19 -04:00 |
|
rasbt
|
58d5bd9e39
|
address suggestions to improve clarity
|
2024-04-07 08:41:09 -05:00 |
|
rasbt
|
42eda8b70f
|
renumber exercises
|
2024-04-07 06:03:41 -05:00 |
|
rasbt
|
c5a17393fc
|
variable renaming for clarity
|
2024-04-05 07:26:42 -05:00 |
|
rasbt
|
8c36399e7c
|
rename hparams to settings
|
2024-04-05 07:24:46 -05:00 |
|
Sebastian Raschka
|
adc2964fc5
|
Fix Loss in Gutenberg bonus section (#109)
|
2024-04-04 20:54:09 -05:00 |
|
Sebastian Raschka
|
2de60d1bfb
|
Rename variable to context_length to make it easier on readers (#106)
* rename to context length
* fix spacing
|
2024-04-04 07:27:41 -05:00 |
|
Sebastian Raschka
|
3829ccdb34
|
Remove reundant dropout in MLP module (#105)
|
2024-04-03 20:19:08 -05:00 |
|
rasbt
|
e14585e954
|
rename batch to text
|
2024-04-02 20:46:53 -05:00 |
|
rasbt
|
776a517d18
|
figure scaling
|
2024-04-01 08:05:01 -05:00 |
|
rasbt
|
ee096986ea
|
upload exercise solutions of ch05
|
2024-03-31 20:28:51 -05:00 |
|
rasbt
|
83adc4a2ac
|
add weight sizes
|
2024-03-31 08:48:19 -05:00 |
|
rasbt
|
1c173e4f44
|
update figures
|
2024-03-30 09:43:51 -05:00 |
|
rasbt
|
797cfb20de
|
fix test
|
2024-03-29 09:03:36 -05:00 |
|
rasbt
|
ab1e56a323
|
reorg files and make standalone download file
|
2024-03-29 08:16:22 -05:00 |
|
rasbt
|
3c5b288ca0
|
minor typo fixes
|
2024-03-28 08:02:05 -05:00 |
|
rasbt
|
88b2dd780a
|
make batch loss calculatution more efficient
|
2024-03-27 07:11:56 -05:00 |
|
rasbt
|
3cb5a52a1b
|
simplify calc_loss_loader
|
2024-03-26 20:34:50 -05:00 |
|
rasbt
|
9cc9c4244e
|
simplify
|
2024-03-26 07:52:36 -05:00 |
|
rasbt
|
12fff1ddcb
|
add endoftext token
|
2024-03-26 06:47:05 -05:00 |
|
rasbt
|
de576296de
|
simplify .view code
|
2024-03-25 08:09:31 -05:00 |
|
rasbt
|
45e7826954
|
Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch
|
2024-03-24 07:09:18 -05:00 |
|
rasbt
|
c1d939c64e
|
update chapter reference
|
2024-03-24 07:09:08 -05:00 |
|
rasbt
|
0f0fdef576
|
small typo fixes
|
2024-03-23 11:28:20 -05:00 |
|
Sebastian Raschka
|
5d02559993
|
small cosmetic updates (#83)
|
2024-03-22 09:15:40 -05:00 |
|
Sebastian Raschka
|
4582995ced
|
Add alternative weight loading strategy as backup (#82)
|
2024-03-20 08:43:18 -05:00 |
|
rasbt
|
820d5e3ed1
|
remove duplicate import
|
2024-03-19 20:41:35 -05:00 |
|
Sebastian Raschka
|
a2cd8436cb
|
Ch05 supplementary code (#81)
|
2024-03-19 09:26:26 -05:00 |
|
rasbt
|
861a2788f3
|
add check for small validation sets
|
2024-03-19 06:34:52 -05:00 |
|
Sebastian Raschka
|
9d6da22ebb
|
Update pep8 (#78)
* simplify requirements file
* style
* apply linter
|
2024-03-18 08:16:17 -05:00 |
|
Sebastian Raschka
|
329d046b5d
|
simplify requirements file (#76)
|
2024-03-18 08:00:49 -05:00 |
|
Sebastian Raschka
|
48253c4f88
|
Ch05 (#75)
* add chapter 5 main code
|
2024-03-17 21:07:19 -05:00 |
|