403 Commits

Author SHA1 Message Date
rasbt
9e149417b2 fix swiglu acronym 2024-05-01 20:26:17 -05:00
rasbt
bb59cbc525 link formatting 2024-04-30 06:26:23 -05:00
rasbt
c5886b7865 Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch 2024-04-30 06:25:37 -05:00
Sebastian Raschka
8d84800bcf use training set len (#137) 2024-04-29 21:56:05 -05:00
rasbt
354bb35726 use training set len 2024-04-29 21:50:07 -05:00
Sebastian Raschka
a5b353667d Rename drop_resid to drop_shortcut (#136) 2024-04-28 14:31:27 -05:00
Sebastian Raschka
d1edfcb63f add roberta option (#135) 2024-04-28 13:57:36 -05:00
Sebastian Raschka
d088753fca Formatting improvements (#134)
* formatting improvements

* .yml triggers
2024-04-28 12:05:32 -05:00
Sebastian Raschka
5ae5e9df3b Try windows runners (#133)
* try windows runners

* update triggers

* trigger with code file update

* add new status badges
2024-04-28 07:39:23 -05:00
Sebastian Raschka
1887b89af6 Update README.md 2024-04-27 07:59:42 -05:00
Sebastian Raschka
0f03c20483 Data loader intuition with numbers (#132)
* data loader intuition with numbers

* fix link

* fix tests
2024-04-27 07:56:41 -05:00
Sebastian Raschka
0528446584 Make code more consistent and add projection layer (#131)
* Make code more consistent and add projection

* remove redundant buffer
2024-04-26 17:13:08 -05:00
Sebastian Raschka
4bbd476e7a IMDB experiments (#128)
* IMDB experiments

* style fixes

* Update README.md
2024-04-25 07:20:53 -05:00
rasbt
51f4980a42 style checks 2024-04-24 07:48:51 -05:00
rasbt
d311bae25a add usage 2024-04-24 07:27:04 -05:00
rasbt
fb54b064c9 add more experiments 2024-04-24 07:23:11 -05:00
rasbt
b2cf956054 update requirements 2024-04-24 06:38:02 -05:00
rasbt
881075aeb0 rename folder 2024-04-23 21:02:57 -05:00
rasbt
379a8ab39c update figures in bonus notebook 2024-04-23 21:01:27 -05:00
Sebastian Raschka
f656ef996d Chapter 6 ablation studies (#127)
* Chapter 6 ablation studies

* add table

* formatting

* formatting

* formatting
2024-04-23 09:51:52 -05:00
Sebastian Raschka
44a009f7e6 update stride wording 2024-04-22 20:40:48 -05:00
rasbt
4abaa168ac fix merge conflict 2024-04-22 07:05:40 -05:00
rasbt
df4fc602d8 update numbering 2024-04-22 07:00:20 -05:00
rasbt
2dd7bf9cda file header 2024-04-22 06:53:38 -05:00
Sebastian Raschka
79d40c25bf remove requests dependency (#125) 2024-04-21 14:15:05 -05:00
rasbt
90fb214822 update figures 2024-04-20 11:42:03 -05:00
Sebastian Raschka
4557d5830e Return nan if val loader is empty (#124) 2024-04-20 08:02:30 -05:00
Sebastian Raschka
b5878a80ff Use dim=-1 for consistency (#122) 2024-04-18 05:56:23 -05:00
Sebastian Raschka
49f01d06d0 Calculate warmup steps as a fraction (#121) 2024-04-17 20:30:42 -05:00
Sebastian Raschka
fdcb8f99fa extend setup instructions (#120) 2024-04-15 21:05:03 -05:00
Sebastian Raschka
e5c567ad02 Update README.md 2024-04-14 12:42:02 -05:00
rasbt
6aa58ee2a9 shorten badge names 2024-04-14 12:41:23 -05:00
Sebastian Raschka
dc7bfde0e7 Check hyperlink badge 2024-04-14 12:38:55 -05:00
Sebastian Raschka
ef2de4718e use torch no grad for loss (#119) 2024-04-14 08:13:07 -05:00
Sebastian Raschka
98f1e97452 Update README.md 2024-04-13 15:04:08 -05:00
Sebastian Raschka
bae4b0fb08 Make datesets and loaders compatible with multiprocessing (#118) 2024-04-13 13:57:56 -05:00
Sebastian Raschka
8fe63a9a0e use correct lr 2024-04-12 19:55:07 -04:00
Sebastian Raschka
bbce1cb143 Automated link checking (#117)
* Automated link checking

* Fix links in Jupyter Nbs
2024-04-12 19:08:34 -04:00
Sebastian Raschka
d1a4157d71 improve check-links.yml 2024-04-11 17:23:15 -04:00
Sebastian Raschka
02f8b2ee26 unit test indicator placement 2024-04-10 22:15:07 -04:00
Sebastian Raschka
40539a2542 setup instruction note 2024-04-10 22:13:22 -04:00
Sebastian Raschka
790d0808b2 Organized setup instructions (#115)
* Organized setup instructions

* update tets

* link checker action

* raise error upon broken link

* fix links

* fix links

* delete duplicated paragraph
2024-04-10 22:09:46 -04:00
James Holcombe
0b866c133f Use instance tokenizer (#116)
* Use instance tokenizer

* consistency updates

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-04-10 21:16:19 -04:00
Sebastian Raschka
94f6582cff move devcontainer (#113) 2024-04-08 20:49:37 -04:00
Daniel Kleine
b01204ca3a Added PDF display support to Docker image and VS Code and updated first step for gutenberg project (#111)
* added VS Code extensions recommendations

* Added PDF display support to Docker image and VS Code

* fixed steps to download the dataset
2024-04-08 20:37:55 -04:00
rasbt
8462a777bd address suggestions to improve clarity 2024-04-07 08:41:09 -05:00
rasbt
040ce578be renumber exercises 2024-04-07 06:03:41 -05:00
Daniel Kleine
c76941e061 added VS Code extensions recommendations (#110) 2024-04-05 08:30:56 -05:00
rasbt
84b785ddd0 variable renaming for clarity 2024-04-05 07:26:42 -05:00
rasbt
c31e99720d rename hparams to settings 2024-04-05 07:24:46 -05:00