Sebastian Raschka
|
da61d5b76a
|
Ch06 draft (#138)
* Ch06 first draft
* add utility files
|
2024-05-03 08:37:58 -05:00 |
|
rasbt
|
c735c21e87
|
fix swiglu acronym
|
2024-05-01 20:26:17 -05:00 |
|
rasbt
|
aec169dc12
|
link formatting
|
2024-04-30 06:26:23 -05:00 |
|
rasbt
|
d249960bdc
|
Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch
|
2024-04-30 06:25:37 -05:00 |
|
Sebastian Raschka
|
82d6bd47a4
|
use training set len (#137)
|
2024-04-29 21:56:05 -05:00 |
|
rasbt
|
0ac19a1e50
|
use training set len
|
2024-04-29 21:50:07 -05:00 |
|
Sebastian Raschka
|
97ed38116a
|
Rename drop_resid to drop_shortcut (#136)
|
2024-04-28 14:31:27 -05:00 |
|
Sebastian Raschka
|
70cd174091
|
add roberta option (#135)
|
2024-04-28 13:57:36 -05:00 |
|
Sebastian Raschka
|
ca47c5e4b2
|
Formatting improvements (#134)
* formatting improvements
* .yml triggers
|
2024-04-28 12:05:32 -05:00 |
|
Sebastian Raschka
|
9a5d4d8ac9
|
Try windows runners (#133)
* try windows runners
* update triggers
* trigger with code file update
* add new status badges
|
2024-04-28 07:39:23 -05:00 |
|
Sebastian Raschka
|
e1d094b655
|
Update README.md
|
2024-04-27 07:59:42 -05:00 |
|
Sebastian Raschka
|
fc3d70f72f
|
Data loader intuition with numbers (#132)
* data loader intuition with numbers
* fix link
* fix tests
|
2024-04-27 07:56:41 -05:00 |
|
Sebastian Raschka
|
4adb96d7ee
|
Make code more consistent and add projection layer (#131)
* Make code more consistent and add projection
* remove redundant buffer
|
2024-04-26 17:13:08 -05:00 |
|
Sebastian Raschka
|
59b4fd3e25
|
IMDB experiments (#128)
* IMDB experiments
* style fixes
* Update README.md
|
2024-04-25 07:20:53 -05:00 |
|
rasbt
|
258aff3e9a
|
style checks
|
2024-04-24 07:48:51 -05:00 |
|
rasbt
|
46d09b30d9
|
add usage
|
2024-04-24 07:27:04 -05:00 |
|
rasbt
|
5ef438aa3b
|
add more experiments
|
2024-04-24 07:23:11 -05:00 |
|
rasbt
|
642f819910
|
update requirements
|
2024-04-24 06:38:02 -05:00 |
|
rasbt
|
3b4484029d
|
rename folder
|
2024-04-23 21:02:57 -05:00 |
|
rasbt
|
c7cdedf981
|
update figures in bonus notebook
|
2024-04-23 21:01:27 -05:00 |
|
Sebastian Raschka
|
16964a6486
|
Chapter 6 ablation studies (#127)
* Chapter 6 ablation studies
* add table
* formatting
* formatting
* formatting
|
2024-04-23 09:51:52 -05:00 |
|
Sebastian Raschka
|
0bd2608a6c
|
update stride wording
|
2024-04-22 20:40:48 -05:00 |
|
rasbt
|
90d239b4f7
|
fix merge conflict
|
2024-04-22 07:05:40 -05:00 |
|
rasbt
|
72be9f4e8e
|
update numbering
|
2024-04-22 07:00:20 -05:00 |
|
rasbt
|
868955f6a5
|
file header
|
2024-04-22 06:53:38 -05:00 |
|
Sebastian Raschka
|
44b3815960
|
remove requests dependency (#125)
|
2024-04-21 14:15:05 -05:00 |
|
rasbt
|
d202cabdee
|
update figures
|
2024-04-20 11:42:03 -05:00 |
|
Sebastian Raschka
|
c70ddff558
|
Return nan if val loader is empty (#124)
|
2024-04-20 08:02:30 -05:00 |
|
Sebastian Raschka
|
7740d556a0
|
Use dim=-1 for consistency (#122)
|
2024-04-18 05:56:23 -05:00 |
|
Sebastian Raschka
|
e0ce5ca459
|
Calculate warmup steps as a fraction (#121)
|
2024-04-17 20:30:42 -05:00 |
|
Sebastian Raschka
|
8d53e8d8cd
|
extend setup instructions (#120)
|
2024-04-15 21:05:03 -05:00 |
|
Sebastian Raschka
|
b59eacb01f
|
Update README.md
|
2024-04-14 12:42:02 -05:00 |
|
rasbt
|
0729afa835
|
shorten badge names
|
2024-04-14 12:41:23 -05:00 |
|
Sebastian Raschka
|
a9c1b94d09
|
Check hyperlink badge
|
2024-04-14 12:38:55 -05:00 |
|
Sebastian Raschka
|
155ac03f61
|
use torch no grad for loss (#119)
|
2024-04-14 08:13:07 -05:00 |
|
Sebastian Raschka
|
a3a5574758
|
Update README.md
|
2024-04-13 15:04:08 -05:00 |
|
Sebastian Raschka
|
dd51d4ad83
|
Make datesets and loaders compatible with multiprocessing (#118)
|
2024-04-13 13:57:56 -05:00 |
|
Sebastian Raschka
|
9f3f231ac7
|
use correct lr
|
2024-04-12 19:55:07 -04:00 |
|
Sebastian Raschka
|
55ebabf95c
|
Automated link checking (#117)
* Automated link checking
* Fix links in Jupyter Nbs
|
2024-04-12 19:08:34 -04:00 |
|
Sebastian Raschka
|
33b27368a3
|
improve check-links.yml
|
2024-04-11 17:23:15 -04:00 |
|
Sebastian Raschka
|
5ca4384eb7
|
unit test indicator placement
|
2024-04-10 22:15:07 -04:00 |
|
Sebastian Raschka
|
ae3020bc12
|
setup instruction note
|
2024-04-10 22:13:22 -04:00 |
|
Sebastian Raschka
|
e757091301
|
Organized setup instructions (#115)
* Organized setup instructions
* update tets
* link checker action
* raise error upon broken link
* fix links
* fix links
* delete duplicated paragraph
|
2024-04-10 22:09:46 -04:00 |
|
James Holcombe
|
05718c6b94
|
Use instance tokenizer (#116)
* Use instance tokenizer
* consistency updates
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-04-10 21:16:19 -04:00 |
|
Sebastian Raschka
|
028a346498
|
move devcontainer (#113)
|
2024-04-08 20:49:37 -04:00 |
|
Daniel Kleine
|
61b6e35ddf
|
Added PDF display support to Docker image and VS Code and updated first step for gutenberg project (#111)
* added VS Code extensions recommendations
* Added PDF display support to Docker image and VS Code
* fixed steps to download the dataset
|
2024-04-08 20:37:55 -04:00 |
|
rasbt
|
58d5bd9e39
|
address suggestions to improve clarity
|
2024-04-07 08:41:09 -05:00 |
|
rasbt
|
42eda8b70f
|
renumber exercises
|
2024-04-07 06:03:41 -05:00 |
|
Daniel Kleine
|
e43e0760f9
|
added VS Code extensions recommendations (#110)
|
2024-04-05 08:30:56 -05:00 |
|
rasbt
|
c5a17393fc
|
variable renaming for clarity
|
2024-04-05 07:26:42 -05:00 |
|