rasbt
|
283397aaf2
|
add main and optional sections
|
2024-06-19 17:48:25 -05:00 |
|
Daniel Kleine
|
bbb2a0c3d5
|
fixed num_workers (#229)
* fixed num_workers
* ch06 & ch07: added num_workers to create_dataloader_v1
|
2024-06-19 17:36:46 -05:00 |
|
Sebastian Raschka
|
7bf70baf10
|
Remove duplicated cell (#212)
* add a suggestion since code snippet has been repeated.
* remove duplicated cell
---------
Co-authored-by: Shuyib <benmainye@gmail.com>
|
2024-06-15 12:48:34 -05:00 |
|
rasbt
|
c6466990bb
|
explain truncation in ch05
|
2024-06-12 19:50:11 -05:00 |
|
Sebastian Raschka
|
bcccda728b
|
check gpt files (#208)
|
2024-06-12 07:19:10 -05:00 |
|
rasbt
|
1b1fd21d64
|
fix typo in comment
|
2024-06-09 06:14:02 -05:00 |
|
Sebastian Raschka
|
72a073bbbf
|
Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer
* add endoftext token
|
2024-06-08 14:57:34 -05:00 |
|
rasbt
|
b352d9ef0a
|
update loss
|
2024-05-31 07:30:57 -05:00 |
|
Kumar Utsav
|
bc5d73857c
|
Update ch05.ipynb
Fixed incorrect token ids
|
2024-05-29 20:34:23 +05:30 |
|
rasbt
|
98d453b666
|
update formatting
|
2024-05-24 07:20:37 -05:00 |
|
Daniel Kleine
|
aa67e6e1ac
|
removed unnecessary .gitignore
|
2024-05-21 19:25:16 +00:00 |
|
rasbt
|
a5593f9860
|
change defaults to 0 temp
|
2024-05-19 09:04:49 -05:00 |
|
rasbt
|
1463b2ae47
|
use default value for temperature
|
2024-05-19 08:48:10 -05:00 |
|
rasbt
|
4851d5a0fa
|
add eos_id option for ch07
|
2024-05-18 12:35:40 -05:00 |
|
rasbt
|
cd7ea15e8d
|
add readme
|
2024-05-13 08:50:55 -05:00 |
|
speed
|
45f6e72f40
|
fix 1024 characters to 1024 tokens (#152)
|
2024-05-11 13:17:07 -05:00 |
|
rasbt
|
aec169dc12
|
link formatting
|
2024-04-30 06:26:23 -05:00 |
|
Sebastian Raschka
|
97ed38116a
|
Rename drop_resid to drop_shortcut (#136)
|
2024-04-28 14:31:27 -05:00 |
|
rasbt
|
90d239b4f7
|
fix merge conflict
|
2024-04-22 07:05:40 -05:00 |
|
rasbt
|
72be9f4e8e
|
update numbering
|
2024-04-22 07:00:20 -05:00 |
|
rasbt
|
868955f6a5
|
file header
|
2024-04-22 06:53:38 -05:00 |
|
Sebastian Raschka
|
44b3815960
|
remove requests dependency (#125)
|
2024-04-21 14:15:05 -05:00 |
|
Sebastian Raschka
|
c70ddff558
|
Return nan if val loader is empty (#124)
|
2024-04-20 08:02:30 -05:00 |
|
Sebastian Raschka
|
155ac03f61
|
use torch no grad for loss (#119)
|
2024-04-14 08:13:07 -05:00 |
|
Sebastian Raschka
|
dd51d4ad83
|
Make datesets and loaders compatible with multiprocessing (#118)
|
2024-04-13 13:57:56 -05:00 |
|
Sebastian Raschka
|
9f3f231ac7
|
use correct lr
|
2024-04-12 19:55:07 -04:00 |
|
Sebastian Raschka
|
55ebabf95c
|
Automated link checking (#117)
* Automated link checking
* Fix links in Jupyter Nbs
|
2024-04-12 19:08:34 -04:00 |
|
Sebastian Raschka
|
e757091301
|
Organized setup instructions (#115)
* Organized setup instructions
* update tets
* link checker action
* raise error upon broken link
* fix links
* fix links
* delete duplicated paragraph
|
2024-04-10 22:09:46 -04:00 |
|
James Holcombe
|
05718c6b94
|
Use instance tokenizer (#116)
* Use instance tokenizer
* consistency updates
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-04-10 21:16:19 -04:00 |
|
rasbt
|
58d5bd9e39
|
address suggestions to improve clarity
|
2024-04-07 08:41:09 -05:00 |
|
rasbt
|
42eda8b70f
|
renumber exercises
|
2024-04-07 06:03:41 -05:00 |
|
rasbt
|
c5a17393fc
|
variable renaming for clarity
|
2024-04-05 07:26:42 -05:00 |
|
rasbt
|
8c36399e7c
|
rename hparams to settings
|
2024-04-05 07:24:46 -05:00 |
|
Sebastian Raschka
|
adc2964fc5
|
Fix Loss in Gutenberg bonus section (#109)
|
2024-04-04 20:54:09 -05:00 |
|
Sebastian Raschka
|
2de60d1bfb
|
Rename variable to context_length to make it easier on readers (#106)
* rename to context length
* fix spacing
|
2024-04-04 07:27:41 -05:00 |
|
Sebastian Raschka
|
3829ccdb34
|
Remove reundant dropout in MLP module (#105)
|
2024-04-03 20:19:08 -05:00 |
|
rasbt
|
e14585e954
|
rename batch to text
|
2024-04-02 20:46:53 -05:00 |
|
rasbt
|
776a517d18
|
figure scaling
|
2024-04-01 08:05:01 -05:00 |
|
rasbt
|
ee096986ea
|
upload exercise solutions of ch05
|
2024-03-31 20:28:51 -05:00 |
|
rasbt
|
83adc4a2ac
|
add weight sizes
|
2024-03-31 08:48:19 -05:00 |
|
rasbt
|
1c173e4f44
|
update figures
|
2024-03-30 09:43:51 -05:00 |
|
rasbt
|
797cfb20de
|
fix test
|
2024-03-29 09:03:36 -05:00 |
|
rasbt
|
ab1e56a323
|
reorg files and make standalone download file
|
2024-03-29 08:16:22 -05:00 |
|
rasbt
|
3c5b288ca0
|
minor typo fixes
|
2024-03-28 08:02:05 -05:00 |
|
rasbt
|
88b2dd780a
|
make batch loss calculatution more efficient
|
2024-03-27 07:11:56 -05:00 |
|
rasbt
|
3cb5a52a1b
|
simplify calc_loss_loader
|
2024-03-26 20:34:50 -05:00 |
|
rasbt
|
9cc9c4244e
|
simplify
|
2024-03-26 07:52:36 -05:00 |
|
rasbt
|
12fff1ddcb
|
add endoftext token
|
2024-03-26 06:47:05 -05:00 |
|
rasbt
|
de576296de
|
simplify .view code
|
2024-03-25 08:09:31 -05:00 |
|
rasbt
|
45e7826954
|
Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch
|
2024-03-24 07:09:18 -05:00 |
|