62 Commits

Author SHA1 Message Date
Sebastian Raschka
541b237eff Add utility to prevent double execution of certain cells (#437) 2024-11-14 19:56:49 +09:00
rasbt
59a5c83726 remove redundant code line 2024-10-13 15:58:11 -05:00
Sebastian Raschka
68505fab64 Fix truncation issue in classify_review function (#373) 2024-09-25 19:54:36 -05:00
Sebastian Raschka
081676d8dd Add missing bullet point 2024-09-21 12:59:12 -05:00
Mingyuan Xu
21e6971b11 Run generate example in ch06 optionally on GPU (#352)
* model.to("cuda")

model.to("cuda")

* update device placement

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-09-13 08:01:52 -05:00
Sebastian Raschka
a82169290e Note about MPS in ch06 and ch07 (#325) 2024-08-19 08:11:33 -05:00
TITC
0b998dff97 track tokens seen in chapter5, track examples seen in chapter6 (#319) 2024-08-13 07:09:05 -05:00
Sebastian Raschka
6dd8666d9c Test code in pytorch 2.4 (#285)
* test code in pytorch 2.4

* update
2024-07-24 21:53:41 -05:00
Sebastian Raschka
d0f3b034d8 Add download help message (#274) 2024-07-19 08:29:29 -05:00
Sebastian Raschka
2ce1d16de0 show how to use the finetuned model 2024-07-09 06:43:26 -07:00
Sebastian Raschka
2d8eacb0fa Fix links in summary sections (#254) 2024-06-29 07:51:31 -05:00
rasbt
5e24a042c1 add links to summary sections 2024-06-29 07:33:26 -05:00
Daniel Kleine
fb4e37ae15 fixed minor issues (#252)
* fixed typo

* fixed var name in md text
2024-06-29 06:38:25 -05:00
Daniel Kleine
e387742b77 minor markdown fixes (#236) 2024-06-21 13:55:34 -05:00
Sebastian Raschka
87deec0f5f Add standalone finetuning and evaluation scripts for chapter 7 (#234)
* add finetuning and eval scripts

* update link

* update links

* fix link
2024-06-21 05:23:24 -05:00
rasbt
c1f9361428 add main and optional sections 2024-06-19 17:48:25 -05:00
Daniel Kleine
73be1c592f fixed num_workers (#229)
* fixed num_workers

* ch06 & ch07: added num_workers to create_dataloader_v1
2024-06-19 17:36:46 -05:00
Jinge Wang
8e2c8d0987 Fixed some typos in ch06.ipynb (#219) 2024-06-18 05:54:01 -05:00
rasbt
c8c0fd4fb5 fix spelling 2024-06-18 05:50:40 -05:00
rasbt
88ad21490c replace figure 2024-06-18 05:46:36 -05:00
Daniel Kleine
79210eb393 fixes for code (#206)
* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-06-11 20:59:48 -05:00
rasbt
f0e4c99bc3 fix typo in comment 2024-06-09 06:14:02 -05:00
Sebastian Raschka
40ba3a4068 Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer

* add endoftext token
2024-06-08 14:57:34 -05:00
rasbt
fe8bb9291e update formatting 2024-05-24 07:20:37 -05:00
rasbt
c35cf65dbf add assertion about data set length 2024-05-23 06:50:43 -05:00
rasbt
c4cd48475c Fix device setting 2024-05-22 17:51:51 -05:00
rasbt
3b72e55c26 remove duplicated text 2024-05-19 11:34:47 -05:00
rasbt
5541f7c8fe add test mode for dataset download 2024-05-18 17:38:19 -05:00
rasbt
87bf79e888 tokens seen -> examples seen 2024-05-13 20:08:48 -05:00
rasbt
d9e364c04a spelling 2024-05-13 20:06:38 -05:00
rasbt
b350daaa93 add readme 2024-05-13 08:50:55 -05:00
rasbt
c95abad6d1 pep8 fixes 2024-05-13 07:50:51 -05:00
rasbt
13e4282567 tests and exercises 2024-05-13 07:45:59 -05:00
rasbt
c8bcdf5206 fix tests 2024-05-12 19:03:14 -05:00
rasbt
37c33d6fee add chapter 6 unit test 2024-05-12 18:51:28 -05:00
rasbt
6b5bc7a1cd add missing figure 2024-05-12 18:37:02 -05:00
rasbt
ccb862cc36 chapter 06 summary file 2024-05-12 18:27:50 -05:00
rasbt
98c0723b3d update dataset naming 2024-05-12 09:22:42 -05:00
rasbt
beeaf323f1 rename download_and_unzip to make it more specific 2024-05-12 08:36:24 -05:00
rasbt
84edcfaf43 use spam / not spam labels 2024-05-11 13:42:18 -05:00
rasbt
c94f24e759 reorder section 6.6 2024-05-11 08:27:07 -05:00
rasbt
db29f5c685 explain how class labels are obtained 2024-05-11 07:42:13 -05:00
rasbt
774974de97 6 -> 4 2024-05-10 07:02:14 -05:00
rasbt
dadd0f7ea3 clarify overfitting 2024-05-09 09:09:26 -05:00
rasbt
1638dc8b7f spelling improvements 2024-05-09 07:25:52 -05:00
rasbt
1e34f5a429 add note about worker number 2024-05-08 21:20:43 -05:00
rasbt
1e7d1f3bcb update figure 6.6 2024-05-08 20:46:54 -05:00
rasbt
a31d571625 text -> dataset 2024-05-08 08:14:03 -05:00
rasbt
6cc9cf9f4e make spam spelling consistent 2024-05-08 06:48:28 -05:00
rasbt
7082ecac80 formatting improvements 2024-05-06 20:35:51 -05:00