rasbt
742f0a6d29
add missing output in bonus
2024-03-03 17:29:46 -06:00
rasbt
f526a8d7fb
add requirements file for bonus notebook
2024-03-02 16:54:24 -06:00
rasbt
cc2383c4de
remove duplicated exercise code
2024-03-02 16:44:36 -06:00
Sebastian Raschka
c071ea73f9
Update DDP-script.py
...
Fix for-loop
2024-03-01 18:31:05 -06:00
Sebastian Raschka
c9dccb0c40
Merge pull request #33 from rayedbw/patch-1
...
Update ch04.ipynb
2024-02-29 20:00:09 -06:00
rasbt
267e33cfaf
remove redundant import
2024-02-29 19:59:05 -06:00
Sebastian Raschka
d419c02792
Merge pull request #39 from rayedbw/patch-3
...
Update Dockerfile
2024-02-29 12:30:50 -06:00
Rayed Bin Wahed
32087331ae
Update Dockerfile
...
Use significantly smaller docker image
2024-03-01 02:10:01 +08:00
Sebastian Raschka
a94d53a752
Merge pull request #38 from rayedbw/patch-2
...
Update README.md
2024-02-29 12:06:05 -06:00
Rayed Bin Wahed
c47e434162
Update README.md
...
Correct spelling mistake
2024-03-01 01:56:58 +08:00
rasbt
7d732a5db0
add readme for devcontainer
2024-02-29 09:00:06 -06:00
rasbt
ee24acd481
Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch
2024-02-29 08:31:20 -06:00
rasbt
b827bf4eea
remove redundant double-unsequeeze
2024-02-29 08:31:07 -06:00
Sebastian Raschka
3278243dd5
Merge pull request #31 from rayedbw/main
...
Add devcontainer
2024-02-29 08:24:29 -06:00
rasbt
fb770ef97c
update docker files and docs
2024-02-29 08:22:53 -06:00
Rayed Bin Wahed
2fb035435e
Update ch04.ipynb
...
Add missing import
2024-02-27 23:05:36 +08:00
rasbt
d89aaf319d
update folder name
2024-02-27 08:53:04 -06:00
Sebastian Raschka
a060f923d3
Merge pull request #32 from rasbt/hparam
...
Add hparam tuning script
2024-02-27 08:52:01 -06:00
rasbt
87a743076d
hparam tuning script
2024-02-27 08:51:03 -06:00
rasbt
f6266c3756
improve code comments
2024-02-27 06:40:35 -06:00
Rayed Bin Wahed
45a10dd823
Add devcontainer starter doc
2024-02-27 13:04:06 +08:00
Rayed Bin Wahed
fa7e659eb3
Add devcontainer
2024-02-26 22:29:27 +08:00
Sebastian Raschka
78ed2e35bc
Add requirements.txt to main repo
2024-02-25 13:32:30 -06:00
Sebastian Raschka
3debb2f0df
Update README.md
2024-02-25 13:31:32 -06:00
rasbt
3f186ab072
use .shape instead of .size() for consistency
2024-02-25 08:47:25 -06:00
rasbt
cdcd73ba7f
drop_last=True
2024-02-25 07:23:38 -06:00
rasbt
6243726ab3
rename to dataloader v1
2024-02-24 07:48:18 -06:00
rasbt
4e68649f16
comment update
2024-02-24 06:52:17 -06:00
rasbt
f057156181
use smaller number of tokens to emphasize next token prediction goal
2024-02-15 20:09:20 -06:00
rasbt
557ddfc684
make a new example for shortcut connections
2024-02-15 19:34:12 -06:00
rasbt
250e6306e2
use attn_scores from sec 3.4 instead of 3.3
2024-02-14 20:23:59 -06:00
rasbt
231a854ae7
use less ambiguous var name
2024-02-13 07:05:37 -06:00
Sebastian Raschka
320f63829f
Merge pull request #29 from Intelligence-Manifesto/patch-5
...
**step 2**
2024-02-12 07:34:37 -06:00
Intelligence-Manifesto
6a09e7b03a
**step 2**
...
step 2: According to the context, the formatting here should be **step 2**.
Additionally, it seems that there is a lack of text description for step 1 in this section, as other sections are all labeled with steps 1, 2, 3 in order, clearly indicating the steps.
2024-02-12 18:32:28 +08:00
rasbt
1d6f2c9084
rearrange exercise order
2024-02-11 14:46:05 -06:00
Sebastian Raschka
79d90d8147
Merge pull request #28 from rasbt/ch4-exercise-solutions
...
Add chapter 4 exercise solutions
2024-02-11 11:52:18 -06:00
rasbt
fe332006de
ch4 exercise solutions
2024-02-11 11:51:39 -06:00
rasbt
103f7826ad
use same iter to make figs consistent
2024-02-11 09:12:52 -06:00
rasbt
352b83d225
make softmax explicit
2024-02-11 08:42:21 -06:00
rasbt
7d86023fc4
make softmax explicit
2024-02-11 08:41:45 -06:00
rasbt
5840b4b5f8
update name of last section
2024-02-11 07:35:07 -06:00
Sebastian Raschka
e0b6fdbc53
Merge pull request #27 from Intelligence-Manifesto/patch-4
...
12 -> 21
2024-02-11 07:31:06 -06:00
Intelligence-Manifesto
1278615c25
12 -> 21
...
12 -> 21
2024-02-11 14:17:55 +08:00
rasbt
baa8617921
variable name fix
2024-02-10 17:53:54 -06:00
rasbt
496b52f842
format the other GPT architecture sizes
2024-02-10 17:47:56 -06:00
rasbt
40477c55b3
add missing ex sol to table
2024-02-10 10:13:21 -06:00
rasbt
10aa2d099d
add print statements for illustration purposes
2024-02-10 10:10:14 -06:00
rasbt
cc459b6b5a
Merge branch 'main' of https://github.com/rasbt/LLMs-from-scratch
2024-02-08 20:17:01 -06:00
rasbt
5d1d8ce511
add shape information for clarity
2024-02-08 20:16:54 -06:00
Sebastian Raschka
24d71784e2
Merge pull request #26 from Intelligence-Manifesto/patch-3
...
if -> in
2024-02-08 17:19:29 -06:00