535 Commits

Author SHA1 Message Date
rasbt
742f0a6d29 add missing output in bonus 2024-03-03 17:29:46 -06:00
rasbt
f526a8d7fb add requirements file for bonus notebook 2024-03-02 16:54:24 -06:00
rasbt
cc2383c4de remove duplicated exercise code 2024-03-02 16:44:36 -06:00
Sebastian Raschka
c071ea73f9 Update DDP-script.py
Fix for-loop
2024-03-01 18:31:05 -06:00
Sebastian Raschka
c9dccb0c40 Merge pull request #33 from rayedbw/patch-1
Update ch04.ipynb
2024-02-29 20:00:09 -06:00
rasbt
267e33cfaf remove redundant import 2024-02-29 19:59:05 -06:00
Sebastian Raschka
d419c02792 Merge pull request #39 from rayedbw/patch-3
Update Dockerfile
2024-02-29 12:30:50 -06:00
Rayed Bin Wahed
32087331ae Update Dockerfile
Use significantly smaller docker image
2024-03-01 02:10:01 +08:00
Sebastian Raschka
a94d53a752 Merge pull request #38 from rayedbw/patch-2
Update README.md
2024-02-29 12:06:05 -06:00
Rayed Bin Wahed
c47e434162 Update README.md
Correct spelling mistake
2024-03-01 01:56:58 +08:00
rasbt
7d732a5db0 add readme for devcontainer 2024-02-29 09:00:06 -06:00
rasbt
ee24acd481 Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch 2024-02-29 08:31:20 -06:00
rasbt
b827bf4eea remove redundant double-unsequeeze 2024-02-29 08:31:07 -06:00
Sebastian Raschka
3278243dd5 Merge pull request #31 from rayedbw/main
Add devcontainer
2024-02-29 08:24:29 -06:00
rasbt
fb770ef97c update docker files and docs 2024-02-29 08:22:53 -06:00
Rayed Bin Wahed
2fb035435e Update ch04.ipynb
Add missing import
2024-02-27 23:05:36 +08:00
rasbt
d89aaf319d update folder name 2024-02-27 08:53:04 -06:00
Sebastian Raschka
a060f923d3 Merge pull request #32 from rasbt/hparam
Add hparam tuning script
2024-02-27 08:52:01 -06:00
rasbt
87a743076d hparam tuning script 2024-02-27 08:51:03 -06:00
rasbt
f6266c3756 improve code comments 2024-02-27 06:40:35 -06:00
Rayed Bin Wahed
45a10dd823 Add devcontainer starter doc 2024-02-27 13:04:06 +08:00
Rayed Bin Wahed
fa7e659eb3 Add devcontainer 2024-02-26 22:29:27 +08:00
Sebastian Raschka
78ed2e35bc Add requirements.txt to main repo 2024-02-25 13:32:30 -06:00
Sebastian Raschka
3debb2f0df Update README.md 2024-02-25 13:31:32 -06:00
rasbt
3f186ab072 use .shape instead of .size() for consistency 2024-02-25 08:47:25 -06:00
rasbt
cdcd73ba7f drop_last=True 2024-02-25 07:23:38 -06:00
rasbt
6243726ab3 rename to dataloader v1 2024-02-24 07:48:18 -06:00
rasbt
4e68649f16 comment update 2024-02-24 06:52:17 -06:00
rasbt
f057156181 use smaller number of tokens to emphasize next token prediction goal 2024-02-15 20:09:20 -06:00
rasbt
557ddfc684 make a new example for shortcut connections 2024-02-15 19:34:12 -06:00
rasbt
250e6306e2 use attn_scores from sec 3.4 instead of 3.3 2024-02-14 20:23:59 -06:00
rasbt
231a854ae7 use less ambiguous var name 2024-02-13 07:05:37 -06:00
Sebastian Raschka
320f63829f Merge pull request #29 from Intelligence-Manifesto/patch-5
**step 2**
2024-02-12 07:34:37 -06:00
Intelligence-Manifesto
6a09e7b03a **step 2**
step 2: According to the context, the formatting here should be **step 2**. 
Additionally, it seems that there is a lack of text description for step 1 in this section, as other sections are all labeled with steps 1, 2, 3 in order, clearly indicating the steps.
2024-02-12 18:32:28 +08:00
rasbt
1d6f2c9084 rearrange exercise order 2024-02-11 14:46:05 -06:00
Sebastian Raschka
79d90d8147 Merge pull request #28 from rasbt/ch4-exercise-solutions
Add chapter 4 exercise solutions
2024-02-11 11:52:18 -06:00
rasbt
fe332006de ch4 exercise solutions 2024-02-11 11:51:39 -06:00
rasbt
103f7826ad use same iter to make figs consistent 2024-02-11 09:12:52 -06:00
rasbt
352b83d225 make softmax explicit 2024-02-11 08:42:21 -06:00
rasbt
7d86023fc4 make softmax explicit 2024-02-11 08:41:45 -06:00
rasbt
5840b4b5f8 update name of last section 2024-02-11 07:35:07 -06:00
Sebastian Raschka
e0b6fdbc53 Merge pull request #27 from Intelligence-Manifesto/patch-4
12 -> 21
2024-02-11 07:31:06 -06:00
Intelligence-Manifesto
1278615c25 12 -> 21
12 -> 21
2024-02-11 14:17:55 +08:00
rasbt
baa8617921 variable name fix 2024-02-10 17:53:54 -06:00
rasbt
496b52f842 format the other GPT architecture sizes 2024-02-10 17:47:56 -06:00
rasbt
40477c55b3 add missing ex sol to table 2024-02-10 10:13:21 -06:00
rasbt
10aa2d099d add print statements for illustration purposes 2024-02-10 10:10:14 -06:00
rasbt
cc459b6b5a Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch 2024-02-08 20:17:01 -06:00
rasbt
5d1d8ce511 add shape information for clarity 2024-02-08 20:16:54 -06:00
Sebastian Raschka
24d71784e2 Merge pull request #26 from Intelligence-Manifesto/patch-3
if -> in
2024-02-08 17:19:29 -06:00