rasbt
|
e1046746e8
|
remove redundant line
|
2024-06-20 10:12:28 -05:00 |
|
rasbt
|
cb194fa8fa
|
fix device loading
|
2024-06-20 08:07:00 -05:00 |
|
Sebastian Raschka
|
d440eb17bc
|
Add standalone instruction finetuning script (#233)
|
2024-06-20 07:37:47 -05:00 |
|
rasbt
|
bebd3f453f
|
example code to use the finetuned model
|
2024-06-19 20:21:14 -05:00 |
|
rasbt
|
3ba51abf53
|
consistency
|
2024-06-19 19:47:31 -05:00 |
|
rasbt
|
c1f9361428
|
add main and optional sections
|
2024-06-19 17:48:25 -05:00 |
|
rasbt
|
eb1da36e98
|
note about dropout
|
2024-06-19 17:37:48 -05:00 |
|
Daniel Kleine
|
73be1c592f
|
fixed num_workers (#229)
* fixed num_workers
* ch06 & ch07: added num_workers to create_dataloader_v1
|
2024-06-19 17:36:46 -05:00 |
|
Sebastian Raschka
|
c935725a26
|
Add pytest retry for link checks (#228)
* add pytest retry for link checks
* Update .github/workflows/check-links.yml
* newline
|
2024-06-19 07:37:41 -05:00 |
|
Daniel Kleine
|
49c77d9724
|
minor: fixed API name (#227)
* fixed api name
* Update ch07/01_main-chapter-code/ch07.ipynb
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-06-19 07:09:32 -05:00 |
|
Jinge Wang
|
605ec00a2a
|
Fix some typos in ch07.ipynb (#224)
* Fixed some typos in ch06.ipynb
* Fix some typos in ch07.ipynb
|
2024-06-19 06:14:25 -05:00 |
|
Sebastian Raschka
|
f4c8bb024c
|
add mps runtime (#223)
|
2024-06-18 20:58:59 -05:00 |
|
Daniel Kleine
|
b114053378
|
minor fixes (#222)
* fixed labels
* fixed typo
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-06-18 19:37:26 -05:00 |
|
rasbt
|
3874c477fa
|
update report template
|
2024-06-18 19:21:00 -05:00 |
|
rasbt
|
f0d2af06de
|
update report template
|
2024-06-18 19:20:45 -05:00 |
|
Sebastian Raschka
|
bd93790de3
|
remove unknown option from env
|
2024-06-18 07:52:54 -05:00 |
|
Sebastian Raschka
|
1aa1b44222
|
Update compute env
|
2024-06-18 07:51:54 -05:00 |
|
Sebastian Raschka
|
9f7a8c52d8
|
select computing env
|
2024-06-18 07:51:22 -05:00 |
|
Sebastian Raschka
|
bfe5219366
|
Update ask-a-question.md
|
2024-06-18 07:37:28 -05:00 |
|
rasbt
|
9f1c969a26
|
update bug report template
|
2024-06-18 07:34:43 -05:00 |
|
rasbt
|
97966efb79
|
indentation
|
2024-06-18 07:11:22 -05:00 |
|
rasbt
|
4a6043d16c
|
update template
|
2024-06-18 07:09:47 -05:00 |
|
rasbt
|
a85ee2759a
|
update template
|
2024-06-18 07:08:25 -05:00 |
|
rasbt
|
bbe8b4edf7
|
update template
|
2024-06-18 07:07:55 -05:00 |
|
Sebastian Raschka
|
4758b7bd03
|
Update issue templates
|
2024-06-18 07:03:55 -05:00 |
|
Sebastian Raschka
|
2c326213bb
|
Update year
|
2024-06-18 05:59:18 -05:00 |
|
Jinge Wang
|
8e2c8d0987
|
Fixed some typos in ch06.ipynb (#219)
|
2024-06-18 05:54:01 -05:00 |
|
rasbt
|
c8c0fd4fb5
|
fix spelling
|
2024-06-18 05:50:40 -05:00 |
|
rasbt
|
88ad21490c
|
replace figure
|
2024-06-18 05:46:36 -05:00 |
|
Sebastian Raschka
|
fde5bea1d9
|
Update dependency checker (#218)
* update environment checker
* update requirements.txt
|
2024-06-17 21:09:31 -05:00 |
|
rasbt
|
e2f0d936f8
|
update dotted line
|
2024-06-17 20:17:56 -05:00 |
|
Sebastian Raschka
|
d74689fc5f
|
dealing with numpy 2.0 (#216)
|
2024-06-17 10:22:36 -05:00 |
|
rasbt
|
339a7ce040
|
formating updates
|
2024-06-17 07:40:04 -05:00 |
|
rasbt
|
0ee9312662
|
formatting
|
2024-06-16 08:27:25 -05:00 |
|
Sebastian Raschka
|
232c4f338b
|
Updated ch07 (#213)
* Updated ch07
* fix links
* check links
|
2024-06-15 15:10:01 -05:00 |
|
Sebastian Raschka
|
fcf8bcab0d
|
Remove duplicated cell (#212)
* add a suggestion since code snippet has been repeated.
* remove duplicated cell
---------
Co-authored-by: Shuyib <benmainye@gmail.com>
|
2024-06-15 12:48:34 -05:00 |
|
rasbt
|
a796b9d657
|
explain truncation in ch05
|
2024-06-12 19:50:11 -05:00 |
|
rasbt
|
8fa64806fc
|
dim-consistency
|
2024-06-12 19:43:25 -05:00 |
|
Sebastian Raschka
|
8d3e58ff81
|
check gpt files (#208)
|
2024-06-12 07:19:10 -05:00 |
|
Daniel Kleine
|
e5c3c5ce99
|
minor bug fixes (#207)
* fixed path arg for create_dataset_csvs()
* updated assign_check() to remove user warning
|
2024-06-12 06:27:56 -05:00 |
|
rasbt
|
b2ff989174
|
distinguish better between main chapter code and bonus materials
|
2024-06-11 21:07:42 -05:00 |
|
Daniel Kleine
|
79210eb393
|
fixes for code (#206)
* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2024-06-11 20:59:48 -05:00 |
|
Sebastian Raschka
|
e91718f1e7
|
Add eos token to each response (#205)
* add eos token to each response
* remove figure
|
2024-06-11 08:57:12 -05:00 |
|
rasbt
|
cbbd4c5600
|
add performance of llama 3 models for reference
|
2024-06-10 18:21:58 -05:00 |
|
Daniel Kleine
|
9a81230968
|
ch07 fixes (#204)
* updated .gitginore for ch07
* fixed extract_response()
|
2024-06-10 17:31:13 -05:00 |
|
rasbt
|
029efee920
|
reorg first section
|
2024-06-10 08:20:12 -05:00 |
|
rasbt
|
b9ed5811c3
|
fix gradient comment
|
2024-06-09 20:23:18 -05:00 |
|
Sebastian Raschka
|
c3c7e64a63
|
ch07 first draft (#203)
|
2024-06-09 10:35:26 -05:00 |
|
rasbt
|
f0e4c99bc3
|
fix typo in comment
|
2024-06-09 06:14:02 -05:00 |
|
rasbt
|
e1adeb14f3
|
add allowed_special={"<|endoftext|>"}
|
2024-06-09 06:04:02 -05:00 |
|