rasbt
|
7b2174b115
|
formating updates
|
2024-06-17 07:40:04 -05:00 |
|
rasbt
|
f6274117b9
|
formatting
|
2024-06-16 08:27:25 -05:00 |
|
Sebastian Raschka
|
aba7ed2eb1
|
Updated ch07 (#213)
* Updated ch07
* fix links
* check links
|
2024-06-15 15:10:01 -05:00 |
|
Sebastian Raschka
|
7bf70baf10
|
Remove duplicated cell (#212)
* add a suggestion since code snippet has been repeated.
* remove duplicated cell
---------
Co-authored-by: Shuyib <benmainye@gmail.com>
|
2024-06-15 12:48:34 -05:00 |
|
rasbt
|
c6466990bb
|
explain truncation in ch05
|
2024-06-12 19:50:11 -05:00 |
|
rasbt
|
aaa54b10b3
|
dim-consistency
|
2024-06-12 19:43:25 -05:00 |
|
Sebastian Raschka
|
bcccda728b
|
check gpt files (#208)
|
2024-06-12 07:19:10 -05:00 |
|
Daniel Kleine
|
ef40f2f9ad
|
minor bug fixes (#207)
* fixed path arg for create_dataset_csvs()
* updated assign_check() to remove user warning
|
2024-06-12 06:27:56 -05:00 |
|
rasbt
|
e24fd98cdf
|
distinguish better between main chapter code and bonus materials
|
2024-06-11 21:07:42 -05:00 |
|
Daniel Kleine
|
dcbdc1d2e5
|
fixes for code (#206)
* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2024-06-11 20:59:48 -05:00 |
|
Sebastian Raschka
|
1a65020d81
|
Add eos token to each response (#205)
* add eos token to each response
* remove figure
|
2024-06-11 08:57:12 -05:00 |
|
rasbt
|
101f3b949a
|
add performance of llama 3 models for reference
|
2024-06-10 18:21:58 -05:00 |
|
Daniel Kleine
|
da9f64215a
|
ch07 fixes (#204)
* updated .gitginore for ch07
* fixed extract_response()
|
2024-06-10 17:31:13 -05:00 |
|
rasbt
|
888ce71796
|
reorg first section
|
2024-06-10 08:20:12 -05:00 |
|
rasbt
|
1d278c65da
|
fix gradient comment
|
2024-06-09 20:23:18 -05:00 |
|
Sebastian Raschka
|
4eea9ce12c
|
ch07 first draft (#203)
|
2024-06-09 10:35:26 -05:00 |
|
rasbt
|
1b1fd21d64
|
fix typo in comment
|
2024-06-09 06:14:02 -05:00 |
|
rasbt
|
39c4a887eb
|
add allowed_special={"<|endoftext|>"}
|
2024-06-09 06:04:02 -05:00 |
|
Sebastian Raschka
|
72a073bbbf
|
Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer
* add endoftext token
|
2024-06-08 14:57:34 -05:00 |
|
Sebastian Raschka
|
c303a7f36d
|
Explain value truncation in some figures (#199)
* clarify truncation
* typo fix
|
2024-06-08 13:24:37 -05:00 |
|
rasbt
|
3b696a0b3c
|
make error more explicit
|
2024-06-08 13:21:40 -05:00 |
|
rasbt
|
1e12da90e6
|
clarify truncation
|
2024-06-08 13:13:43 -05:00 |
|
rasbt
|
4ac480c9ae
|
add instruction dataset
|
2024-06-08 10:38:41 -05:00 |
|
Sebastian Raschka
|
233c861930
|
Add A.1 and A.2 solutions (#198)
* add A.1 and A.2 solutions
* fix links
|
2024-06-08 09:50:01 -05:00 |
|
rasbt
|
f42290e83b
|
remove redundant file
|
2024-06-07 08:37:46 -05:00 |
|
Daniel Kleine
|
b178040b79
|
fixed code (#197)
|
2024-06-07 06:52:05 -05:00 |
|
rasbt
|
53060dfdfc
|
update ollama instructions
|
2024-06-06 21:03:40 -05:00 |
|
Sebastian Raschka
|
e27d9475cb
|
correlation analysis (#196)
|
2024-06-06 09:15:08 -05:00 |
|
rasbt
|
d271f317e0
|
explain ollama serve command
|
2024-06-06 06:42:54 -05:00 |
|
Daniel Kleine
|
ba1c1e74aa
|
updated Dockerfile and Additional Classification Finetuning Experiments (#195)
* accuracy to .2f
* added curl
|
2024-06-05 20:17:49 -05:00 |
|
rasbt
|
c6efa78325
|
remove empty cell
|
2024-06-05 18:18:16 -05:00 |
|
Sebastian Raschka
|
32251f27d5
|
Merge pull request #193 from rasbt/ollama-eval
Ollama-based model evaluation
|
2024-06-05 08:26:06 -05:00 |
|
Sebastian Raschka
|
ef580a0d57
|
Merge branch 'main' into ollama-eval
|
2024-06-05 08:23:45 -05:00 |
|
rasbt
|
30ebd7427c
|
Ollama-based model evaluation
|
2024-06-05 08:21:28 -05:00 |
|
Sebastian Raschka
|
6290dade88
|
remove redundant dependency
|
2024-06-04 20:54:19 -05:00 |
|
rasbt
|
f5c4e0778f
|
remove redundant import
|
2024-06-04 07:11:12 -05:00 |
|
rasbt
|
054cdfa4b1
|
restore file
|
2024-06-03 07:17:56 -05:00 |
|
rasbt
|
7fdbd16551
|
add number of workers to data loader
|
2024-06-03 07:12:47 -05:00 |
|
rasbt
|
6f0a5c320b
|
fix learning rate scheduler
|
2024-06-03 07:06:42 -05:00 |
|
rasbt
|
f95e0a910d
|
easier to read tensor formatting
|
2024-06-02 21:08:35 -05:00 |
|
rasbt
|
60f64bdc23
|
update figure 2.13
|
2024-06-01 09:38:33 -05:00 |
|
Sebastian Raschka
|
da26b54b82
|
Merge pull request #189 from rasbt/kuutsav/main
Fixed possibly wrong token ids in ch05.ipynb plus update the loss
|
2024-05-31 08:06:57 -05:00 |
|
rasbt
|
b352d9ef0a
|
update loss
|
2024-05-31 07:30:57 -05:00 |
|
Kumar Utsav
|
bc5d73857c
|
Update ch05.ipynb
Fixed incorrect token ids
|
2024-05-29 20:34:23 +05:30 |
|
Sebastian Raschka
|
c577f52bfc
|
Merge pull request #184 from rasbt/api-key-approach
Change API key retrieval approach
|
2024-05-27 08:47:04 -04:00 |
|
rasbt
|
71831890a0
|
update mha dim
|
2024-05-27 07:46:29 -05:00 |
|
rasbt
|
554cbd6bff
|
revert
|
2024-05-27 07:37:53 -05:00 |
|
rasbt
|
42af52fef4
|
revert unnecessary changes
|
2024-05-27 07:37:06 -05:00 |
|
rasbt
|
87c3e78dcb
|
Revert "Revert "newline""
This reverts commit a53ca10508aacc0c2a8da5467626f5fce33ef162.
|
2024-05-27 07:32:45 -05:00 |
|
rasbt
|
a53ca10508
|
Revert "newline"
This reverts commit 23982ed3fabc277ca50a998c325d304940eb78a5.
|
2024-05-27 07:32:22 -05:00 |
|