Sebastian Raschka
|
7bf70baf10
|
Remove duplicated cell (#212)
* add a suggestion since code snippet has been repeated.
* remove duplicated cell
---------
Co-authored-by: Shuyib <benmainye@gmail.com>
|
2024-06-15 12:48:34 -05:00 |
|
rasbt
|
c6466990bb
|
explain truncation in ch05
|
2024-06-12 19:50:11 -05:00 |
|
Sebastian Raschka
|
bcccda728b
|
check gpt files (#208)
|
2024-06-12 07:19:10 -05:00 |
|
Daniel Kleine
|
ef40f2f9ad
|
minor bug fixes (#207)
* fixed path arg for create_dataset_csvs()
* updated assign_check() to remove user warning
|
2024-06-12 06:27:56 -05:00 |
|
rasbt
|
e24fd98cdf
|
distinguish better between main chapter code and bonus materials
|
2024-06-11 21:07:42 -05:00 |
|
Daniel Kleine
|
dcbdc1d2e5
|
fixes for code (#206)
* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2024-06-11 20:59:48 -05:00 |
|
rasbt
|
1b1fd21d64
|
fix typo in comment
|
2024-06-09 06:14:02 -05:00 |
|
Sebastian Raschka
|
72a073bbbf
|
Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer
* add endoftext token
|
2024-06-08 14:57:34 -05:00 |
|
rasbt
|
6f0a5c320b
|
fix learning rate scheduler
|
2024-06-03 07:06:42 -05:00 |
|
rasbt
|
b352d9ef0a
|
update loss
|
2024-05-31 07:30:57 -05:00 |
|
Kumar Utsav
|
bc5d73857c
|
Update ch05.ipynb
Fixed incorrect token ids
|
2024-05-29 20:34:23 +05:30 |
|
Sebastian Raschka
|
39a831a4d8
|
Make header more clear
|
2024-05-25 10:44:12 -05:00 |
|
rasbt
|
98d453b666
|
update formatting
|
2024-05-24 07:20:37 -05:00 |
|
rasbt
|
b40c260859
|
update how to retrieve learning rate
|
2024-05-23 17:19:01 -05:00 |
|
Daniel Kleine
|
aa67e6e1ac
|
removed unnecessary .gitignore
|
2024-05-21 19:25:16 +00:00 |
|
rasbt
|
a5593f9860
|
change defaults to 0 temp
|
2024-05-19 09:04:49 -05:00 |
|
rasbt
|
1463b2ae47
|
use default value for temperature
|
2024-05-19 08:48:10 -05:00 |
|
rasbt
|
4851d5a0fa
|
add eos_id option for ch07
|
2024-05-18 12:35:40 -05:00 |
|
Daniel Kleine
|
cf8b6c1094
|
fixed empty space
|
2024-05-17 10:44:18 +02:00 |
|
rasbt
|
cd7ea15e8d
|
add readme
|
2024-05-13 08:50:55 -05:00 |
|
speed
|
45f6e72f40
|
fix 1024 characters to 1024 tokens (#152)
|
2024-05-11 13:17:07 -05:00 |
|
rasbt
|
aec169dc12
|
link formatting
|
2024-04-30 06:26:23 -05:00 |
|
Sebastian Raschka
|
97ed38116a
|
Rename drop_resid to drop_shortcut (#136)
|
2024-04-28 14:31:27 -05:00 |
|
rasbt
|
90d239b4f7
|
fix merge conflict
|
2024-04-22 07:05:40 -05:00 |
|
rasbt
|
72be9f4e8e
|
update numbering
|
2024-04-22 07:00:20 -05:00 |
|
rasbt
|
868955f6a5
|
file header
|
2024-04-22 06:53:38 -05:00 |
|
Sebastian Raschka
|
44b3815960
|
remove requests dependency (#125)
|
2024-04-21 14:15:05 -05:00 |
|
Sebastian Raschka
|
c70ddff558
|
Return nan if val loader is empty (#124)
|
2024-04-20 08:02:30 -05:00 |
|
Sebastian Raschka
|
155ac03f61
|
use torch no grad for loss (#119)
|
2024-04-14 08:13:07 -05:00 |
|
Sebastian Raschka
|
dd51d4ad83
|
Make datesets and loaders compatible with multiprocessing (#118)
|
2024-04-13 13:57:56 -05:00 |
|
Sebastian Raschka
|
9f3f231ac7
|
use correct lr
|
2024-04-12 19:55:07 -04:00 |
|
Sebastian Raschka
|
55ebabf95c
|
Automated link checking (#117)
* Automated link checking
* Fix links in Jupyter Nbs
|
2024-04-12 19:08:34 -04:00 |
|
Sebastian Raschka
|
e757091301
|
Organized setup instructions (#115)
* Organized setup instructions
* update tets
* link checker action
* raise error upon broken link
* fix links
* fix links
* delete duplicated paragraph
|
2024-04-10 22:09:46 -04:00 |
|
James Holcombe
|
05718c6b94
|
Use instance tokenizer (#116)
* Use instance tokenizer
* consistency updates
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-04-10 21:16:19 -04:00 |
|
Daniel Kleine
|
61b6e35ddf
|
Added PDF display support to Docker image and VS Code and updated first step for gutenberg project (#111)
* added VS Code extensions recommendations
* Added PDF display support to Docker image and VS Code
* fixed steps to download the dataset
|
2024-04-08 20:37:55 -04:00 |
|
rasbt
|
58d5bd9e39
|
address suggestions to improve clarity
|
2024-04-07 08:41:09 -05:00 |
|
rasbt
|
42eda8b70f
|
renumber exercises
|
2024-04-07 06:03:41 -05:00 |
|
rasbt
|
c5a17393fc
|
variable renaming for clarity
|
2024-04-05 07:26:42 -05:00 |
|
rasbt
|
8c36399e7c
|
rename hparams to settings
|
2024-04-05 07:24:46 -05:00 |
|
Daniel Kleine
|
44c0494406
|
Updated devcontainer, .gitignore and README for gutenberg project (#107)
* added ch05/03_bonus_pretraining_on_gutenberg model checkpoints and preprocessing output folders to .gitignore
* removed prettier extension, added github alerts markdown extension
* specified download instructions and fixed code markdown
* Update ch05/03_bonus_pretraining_on_gutenberg/README.md
* Update ch05/03_bonus_pretraining_on_gutenberg/README.md
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-04-05 06:53:01 -05:00 |
|
Sebastian Raschka
|
adc2964fc5
|
Fix Loss in Gutenberg bonus section (#109)
|
2024-04-04 20:54:09 -05:00 |
|
Sebastian Raschka
|
2de60d1bfb
|
Rename variable to context_length to make it easier on readers (#106)
* rename to context length
* fix spacing
|
2024-04-04 07:27:41 -05:00 |
|
Sebastian Raschka
|
3829ccdb34
|
Remove reundant dropout in MLP module (#105)
|
2024-04-03 20:19:08 -05:00 |
|
rasbt
|
e14585e954
|
rename batch to text
|
2024-04-02 20:46:53 -05:00 |
|
rasbt
|
7d1eadd0be
|
update notes
|
2024-04-02 18:27:13 -05:00 |
|
Sebastian Raschka
|
2fab89d47e
|
Use max size properly
|
2024-04-02 13:29:23 -05:00 |
|
Sebastian Raschka
|
4a617b8343
|
Gutenberg for Windows users (#99)
|
2024-04-02 08:54:24 -05:00 |
|
rasbt
|
f30dd2dd2b
|
improve instructions
|
2024-04-02 07:12:22 -05:00 |
|
rasbt
|
776a517d18
|
figure scaling
|
2024-04-01 08:05:01 -05:00 |
|
rasbt
|
ee096986ea
|
upload exercise solutions of ch05
|
2024-03-31 20:28:51 -05:00 |
|