702 Commits

Author SHA1 Message Date
Sebastian Raschka
e91718f1e7 Add eos token to each response (#205)
* add eos token to each response

* remove figure
2024-06-11 08:57:12 -05:00
rasbt
cbbd4c5600 add performance of llama 3 models for reference 2024-06-10 18:21:58 -05:00
Daniel Kleine
9a81230968 ch07 fixes (#204)
* updated .gitginore for ch07

* fixed extract_response()
2024-06-10 17:31:13 -05:00
rasbt
029efee920 reorg first section 2024-06-10 08:20:12 -05:00
rasbt
b9ed5811c3 fix gradient comment 2024-06-09 20:23:18 -05:00
Sebastian Raschka
c3c7e64a63 ch07 first draft (#203) 2024-06-09 10:35:26 -05:00
rasbt
f0e4c99bc3 fix typo in comment 2024-06-09 06:14:02 -05:00
rasbt
e1adeb14f3 add allowed_special={"<|endoftext|>"} 2024-06-09 06:04:02 -05:00
Sebastian Raschka
40ba3a4068 Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer

* add endoftext token
2024-06-08 14:57:34 -05:00
Sebastian Raschka
98d23751f7 Explain value truncation in some figures (#199)
* clarify truncation

* typo fix
2024-06-08 13:24:37 -05:00
rasbt
d4dba08922 make error more explicit 2024-06-08 13:21:40 -05:00
rasbt
a6113fcd33 clarify truncation 2024-06-08 13:13:43 -05:00
rasbt
b80e7804b3 add instruction dataset 2024-06-08 10:38:41 -05:00
Sebastian Raschka
517d86e58e Add A.1 and A.2 solutions (#198)
* add A.1 and A.2 solutions

* fix links
2024-06-08 09:50:01 -05:00
rasbt
fbecc2b48b remove redundant file 2024-06-07 08:37:46 -05:00
Daniel Kleine
42ecfc1c81 fixed code (#197) 2024-06-07 06:52:05 -05:00
rasbt
017f73d50c update ollama instructions 2024-06-06 21:03:40 -05:00
Sebastian Raschka
de36026e5a correlation analysis (#196) 2024-06-06 09:15:08 -05:00
rasbt
9e257212b2 explain ollama serve command 2024-06-06 06:42:54 -05:00
Daniel Kleine
e637393056 updated Dockerfile and Additional Classification Finetuning Experiments (#195)
* accuracy to .2f

* added curl
2024-06-05 20:17:49 -05:00
rasbt
1efd9313b1 remove empty cell 2024-06-05 18:18:16 -05:00
Sebastian Raschka
429cde81b5 Merge pull request #193 from rasbt/ollama-eval
Ollama-based model evaluation
2024-06-05 08:26:06 -05:00
Sebastian Raschka
1c7c937602 Merge branch 'main' into ollama-eval 2024-06-05 08:23:45 -05:00
rasbt
9f8c3f2b35 Ollama-based model evaluation 2024-06-05 08:21:28 -05:00
Sebastian Raschka
cdb7bf71df remove redundant dependency 2024-06-04 20:54:19 -05:00
rasbt
c97c717a7b remove redundant import 2024-06-04 07:11:12 -05:00
rasbt
089dfb756a restore file 2024-06-03 07:17:56 -05:00
rasbt
d51099a9e7 add number of workers to data loader 2024-06-03 07:12:47 -05:00
rasbt
5a1e0eecce fix learning rate scheduler 2024-06-03 07:06:42 -05:00
rasbt
5adc6a8f69 easier to read tensor formatting 2024-06-02 21:08:35 -05:00
rasbt
20f1ef553c update figure 2.13 2024-06-01 09:38:33 -05:00
Sebastian Raschka
64fdb4a249 Merge pull request #189 from rasbt/kuutsav/main
Fixed possibly wrong token ids in ch05.ipynb plus update the loss
2024-05-31 08:06:57 -05:00
rasbt
f7e528fca6 update loss 2024-05-31 07:30:57 -05:00
Kumar Utsav
b48d436bfc Update ch05.ipynb
Fixed incorrect token ids
2024-05-29 20:34:23 +05:30
Sebastian Raschka
688df76bc0 Merge pull request #184 from rasbt/api-key-approach
Change API key retrieval approach
2024-05-27 08:47:04 -04:00
rasbt
c0f564ee87 update mha dim 2024-05-27 07:46:29 -05:00
rasbt
9a4861ee7f revert 2024-05-27 07:37:53 -05:00
rasbt
f86a929665 revert unnecessary changes 2024-05-27 07:37:06 -05:00
rasbt
134334ce21 Revert "Revert "newline""
This reverts commit 6aa2a587d22105910bd6f07c6c79a5abf83a5eb6.
2024-05-27 07:32:45 -05:00
rasbt
6aa2a587d2 Revert "newline"
This reverts commit 9eeeb67329f6ee0ee562a716586722bf00d68bb8.
2024-05-27 07:32:22 -05:00
rasbt
9eeeb67329 newline 2024-05-27 07:30:27 -05:00
rasbt
474ad17546 Update API approach and add progress bar 2024-05-27 07:29:06 -05:00
rasbt
b2ad4fb0d6 add comment 2024-05-27 07:18:07 -05:00
rasbt
36e169f3ab add keys 2024-05-27 07:13:59 -05:00
Sebastian Raschka
68123a5433 Merge pull request #183 from superkostya/fix-mha-wrapper-class
Fix an incorrect input dimension
2024-05-26 17:09:19 -04:00
rasbt
ec6e09136a update pr 2024-05-26 15:38:35 -05:00
Sebastian Raschka
2088d75966 Merge branch 'main' into fix-mha-wrapper-class 2024-05-26 16:20:06 -04:00
Sebastian Raschka
3853ce48c5 Merge pull request #182 from rasbt/refine-duplicate-search
Refine duplicate search utility function
2024-05-26 16:19:17 -04:00
Kostyantyn Borysenko
18b5724e75 Fix an incorrect input dimension 2024-05-26 13:05:07 -07:00
rasbt
5044ee2dda fixes 2024-05-26 14:28:30 -05:00