497 Commits

Author SHA1 Message Date
Sebastian Raschka
4eea9ce12c
ch07 first draft (#203) 2024-06-09 10:35:26 -05:00
rasbt
1b1fd21d64
fix typo in comment 2024-06-09 06:14:02 -05:00
rasbt
39c4a887eb
add allowed_special={"<|endoftext|>"} 2024-06-09 06:04:02 -05:00
Sebastian Raschka
72a073bbbf
Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer

* add endoftext token
2024-06-08 14:57:34 -05:00
Sebastian Raschka
c303a7f36d
Explain value truncation in some figures (#199)
* clarify truncation

* typo fix
2024-06-08 13:24:37 -05:00
rasbt
3b696a0b3c
make error more explicit 2024-06-08 13:21:40 -05:00
rasbt
1e12da90e6
clarify truncation 2024-06-08 13:13:43 -05:00
rasbt
4ac480c9ae
add instruction dataset 2024-06-08 10:38:41 -05:00
Sebastian Raschka
233c861930
Add A.1 and A.2 solutions (#198)
* add A.1 and A.2 solutions

* fix links
2024-06-08 09:50:01 -05:00
rasbt
f42290e83b
remove redundant file 2024-06-07 08:37:46 -05:00
Daniel Kleine
b178040b79
fixed code (#197) 2024-06-07 06:52:05 -05:00
rasbt
53060dfdfc
update ollama instructions 2024-06-06 21:03:40 -05:00
Sebastian Raschka
e27d9475cb
correlation analysis (#196) 2024-06-06 09:15:08 -05:00
rasbt
d271f317e0
explain ollama serve command 2024-06-06 06:42:54 -05:00
Daniel Kleine
ba1c1e74aa
updated Dockerfile and Additional Classification Finetuning Experiments (#195)
* accuracy to .2f

* added curl
2024-06-05 20:17:49 -05:00
rasbt
c6efa78325
remove empty cell 2024-06-05 18:18:16 -05:00
Sebastian Raschka
32251f27d5
Merge pull request #193 from rasbt/ollama-eval
Ollama-based model evaluation
2024-06-05 08:26:06 -05:00
Sebastian Raschka
ef580a0d57
Merge branch 'main' into ollama-eval 2024-06-05 08:23:45 -05:00
rasbt
30ebd7427c
Ollama-based model evaluation 2024-06-05 08:21:28 -05:00
Sebastian Raschka
6290dade88
remove redundant dependency 2024-06-04 20:54:19 -05:00
rasbt
f5c4e0778f
remove redundant import 2024-06-04 07:11:12 -05:00
rasbt
054cdfa4b1
restore file 2024-06-03 07:17:56 -05:00
rasbt
7fdbd16551
add number of workers to data loader 2024-06-03 07:12:47 -05:00
rasbt
6f0a5c320b
fix learning rate scheduler 2024-06-03 07:06:42 -05:00
rasbt
f95e0a910d
easier to read tensor formatting 2024-06-02 21:08:35 -05:00
rasbt
60f64bdc23
update figure 2.13 2024-06-01 09:38:33 -05:00
Sebastian Raschka
da26b54b82
Merge pull request #189 from rasbt/kuutsav/main
Fixed possibly wrong token ids in ch05.ipynb plus update the loss
2024-05-31 08:06:57 -05:00
rasbt
b352d9ef0a
update loss 2024-05-31 07:30:57 -05:00
Kumar Utsav
bc5d73857c
Update ch05.ipynb
Fixed incorrect token ids
2024-05-29 20:34:23 +05:30
Sebastian Raschka
c577f52bfc
Merge pull request #184 from rasbt/api-key-approach
Change API key retrieval approach
2024-05-27 08:47:04 -04:00
rasbt
71831890a0
update mha dim 2024-05-27 07:46:29 -05:00
rasbt
554cbd6bff
revert 2024-05-27 07:37:53 -05:00
rasbt
42af52fef4
revert unnecessary changes 2024-05-27 07:37:06 -05:00
rasbt
87c3e78dcb
Revert "Revert "newline""
This reverts commit a53ca10508aacc0c2a8da5467626f5fce33ef162.
2024-05-27 07:32:45 -05:00
rasbt
a53ca10508
Revert "newline"
This reverts commit 23982ed3fabc277ca50a998c325d304940eb78a5.
2024-05-27 07:32:22 -05:00
rasbt
23982ed3fa
newline 2024-05-27 07:30:27 -05:00
rasbt
8ad5facfb8
Update API approach and add progress bar 2024-05-27 07:29:06 -05:00
rasbt
dd7ba32b56
add comment 2024-05-27 07:18:07 -05:00
rasbt
306336c32d
add keys 2024-05-27 07:13:59 -05:00
Sebastian Raschka
f359db8a10
Merge pull request #183 from superkostya/fix-mha-wrapper-class
Fix an incorrect input dimension
2024-05-26 17:09:19 -04:00
rasbt
050c8b7b73
update pr 2024-05-26 15:38:35 -05:00
Sebastian Raschka
e2aeb80fdb
Merge branch 'main' into fix-mha-wrapper-class 2024-05-26 16:20:06 -04:00
Sebastian Raschka
ffe8bb7489
Merge pull request #182 from rasbt/refine-duplicate-search
Refine duplicate search utility function
2024-05-26 16:19:17 -04:00
Kostyantyn Borysenko
76cdf5e299 Fix an incorrect input dimension 2024-05-26 13:05:07 -07:00
rasbt
fa757e5e48
fixes 2024-05-26 14:28:30 -05:00
rasbt
bd23d7e515
refine similarity search 2024-05-26 14:25:09 -05:00
Sebastian Raschka
82ede6010b
Merge pull request #180 from rasbt/model-eval
Add openai model eval utility code
2024-05-26 12:02:36 -04:00
rasbt
771992c486
Add openai model eval utility code 2024-05-26 10:44:15 -05:00
Sebastian Raschka
602bc07fce
Update requirements.txt 2024-05-26 07:02:13 -05:00
Sebastian Raschka
6f8b125a6e
Add recent bonus materials 2024-05-26 06:40:57 -05:00