Sebastian Raschka
|
bd93790de3
|
remove unknown option from env
|
2024-06-18 07:52:54 -05:00 |
|
Sebastian Raschka
|
1aa1b44222
|
Update compute env
|
2024-06-18 07:51:54 -05:00 |
|
Sebastian Raschka
|
9f7a8c52d8
|
select computing env
|
2024-06-18 07:51:22 -05:00 |
|
Sebastian Raschka
|
bfe5219366
|
Update ask-a-question.md
|
2024-06-18 07:37:28 -05:00 |
|
rasbt
|
9f1c969a26
|
update bug report template
|
2024-06-18 07:34:43 -05:00 |
|
rasbt
|
97966efb79
|
indentation
|
2024-06-18 07:11:22 -05:00 |
|
rasbt
|
4a6043d16c
|
update template
|
2024-06-18 07:09:47 -05:00 |
|
rasbt
|
a85ee2759a
|
update template
|
2024-06-18 07:08:25 -05:00 |
|
rasbt
|
bbe8b4edf7
|
update template
|
2024-06-18 07:07:55 -05:00 |
|
Sebastian Raschka
|
4758b7bd03
|
Update issue templates
|
2024-06-18 07:03:55 -05:00 |
|
Sebastian Raschka
|
2c326213bb
|
Update year
|
2024-06-18 05:59:18 -05:00 |
|
Jinge Wang
|
8e2c8d0987
|
Fixed some typos in ch06.ipynb (#219)
|
2024-06-18 05:54:01 -05:00 |
|
rasbt
|
c8c0fd4fb5
|
fix spelling
|
2024-06-18 05:50:40 -05:00 |
|
rasbt
|
88ad21490c
|
replace figure
|
2024-06-18 05:46:36 -05:00 |
|
Sebastian Raschka
|
fde5bea1d9
|
Update dependency checker (#218)
* update environment checker
* update requirements.txt
|
2024-06-17 21:09:31 -05:00 |
|
rasbt
|
e2f0d936f8
|
update dotted line
|
2024-06-17 20:17:56 -05:00 |
|
Sebastian Raschka
|
d74689fc5f
|
dealing with numpy 2.0 (#216)
|
2024-06-17 10:22:36 -05:00 |
|
rasbt
|
339a7ce040
|
formating updates
|
2024-06-17 07:40:04 -05:00 |
|
rasbt
|
0ee9312662
|
formatting
|
2024-06-16 08:27:25 -05:00 |
|
Sebastian Raschka
|
232c4f338b
|
Updated ch07 (#213)
* Updated ch07
* fix links
* check links
|
2024-06-15 15:10:01 -05:00 |
|
Sebastian Raschka
|
fcf8bcab0d
|
Remove duplicated cell (#212)
* add a suggestion since code snippet has been repeated.
* remove duplicated cell
---------
Co-authored-by: Shuyib <benmainye@gmail.com>
|
2024-06-15 12:48:34 -05:00 |
|
rasbt
|
a796b9d657
|
explain truncation in ch05
|
2024-06-12 19:50:11 -05:00 |
|
rasbt
|
8fa64806fc
|
dim-consistency
|
2024-06-12 19:43:25 -05:00 |
|
Sebastian Raschka
|
8d3e58ff81
|
check gpt files (#208)
|
2024-06-12 07:19:10 -05:00 |
|
Daniel Kleine
|
e5c3c5ce99
|
minor bug fixes (#207)
* fixed path arg for create_dataset_csvs()
* updated assign_check() to remove user warning
|
2024-06-12 06:27:56 -05:00 |
|
rasbt
|
b2ff989174
|
distinguish better between main chapter code and bonus materials
|
2024-06-11 21:07:42 -05:00 |
|
Daniel Kleine
|
79210eb393
|
fixes for code (#206)
* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2024-06-11 20:59:48 -05:00 |
|
Sebastian Raschka
|
e91718f1e7
|
Add eos token to each response (#205)
* add eos token to each response
* remove figure
|
2024-06-11 08:57:12 -05:00 |
|
rasbt
|
cbbd4c5600
|
add performance of llama 3 models for reference
|
2024-06-10 18:21:58 -05:00 |
|
Daniel Kleine
|
9a81230968
|
ch07 fixes (#204)
* updated .gitginore for ch07
* fixed extract_response()
|
2024-06-10 17:31:13 -05:00 |
|
rasbt
|
029efee920
|
reorg first section
|
2024-06-10 08:20:12 -05:00 |
|
rasbt
|
b9ed5811c3
|
fix gradient comment
|
2024-06-09 20:23:18 -05:00 |
|
Sebastian Raschka
|
c3c7e64a63
|
ch07 first draft (#203)
|
2024-06-09 10:35:26 -05:00 |
|
rasbt
|
f0e4c99bc3
|
fix typo in comment
|
2024-06-09 06:14:02 -05:00 |
|
rasbt
|
e1adeb14f3
|
add allowed_special={"<|endoftext|>"}
|
2024-06-09 06:04:02 -05:00 |
|
Sebastian Raschka
|
40ba3a4068
|
Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer
* add endoftext token
|
2024-06-08 14:57:34 -05:00 |
|
Sebastian Raschka
|
98d23751f7
|
Explain value truncation in some figures (#199)
* clarify truncation
* typo fix
|
2024-06-08 13:24:37 -05:00 |
|
rasbt
|
d4dba08922
|
make error more explicit
|
2024-06-08 13:21:40 -05:00 |
|
rasbt
|
a6113fcd33
|
clarify truncation
|
2024-06-08 13:13:43 -05:00 |
|
rasbt
|
b80e7804b3
|
add instruction dataset
|
2024-06-08 10:38:41 -05:00 |
|
Sebastian Raschka
|
517d86e58e
|
Add A.1 and A.2 solutions (#198)
* add A.1 and A.2 solutions
* fix links
|
2024-06-08 09:50:01 -05:00 |
|
rasbt
|
fbecc2b48b
|
remove redundant file
|
2024-06-07 08:37:46 -05:00 |
|
Daniel Kleine
|
42ecfc1c81
|
fixed code (#197)
|
2024-06-07 06:52:05 -05:00 |
|
rasbt
|
017f73d50c
|
update ollama instructions
|
2024-06-06 21:03:40 -05:00 |
|
Sebastian Raschka
|
de36026e5a
|
correlation analysis (#196)
|
2024-06-06 09:15:08 -05:00 |
|
rasbt
|
9e257212b2
|
explain ollama serve command
|
2024-06-06 06:42:54 -05:00 |
|
Daniel Kleine
|
e637393056
|
updated Dockerfile and Additional Classification Finetuning Experiments (#195)
* accuracy to .2f
* added curl
|
2024-06-05 20:17:49 -05:00 |
|
rasbt
|
1efd9313b1
|
remove empty cell
|
2024-06-05 18:18:16 -05:00 |
|
Sebastian Raschka
|
429cde81b5
|
Merge pull request #193 from rasbt/ollama-eval
Ollama-based model evaluation
|
2024-06-05 08:26:06 -05:00 |
|
Sebastian Raschka
|
1c7c937602
|
Merge branch 'main' into ollama-eval
|
2024-06-05 08:23:45 -05:00 |
|