46 Commits

Author SHA1 Message Date
Sebastian Raschka
4fad4695f6 Fix timeout issue related to spam data backup url (#544)
* Add backup url for Spam Dataset

* import urllib

* fix url

* fix timeout issue
2025-02-20 09:26:23 -06:00
Sebastian Raschka
eb6787397c Add backup url for Spam Dataset (#543)
* Add backup url for Spam Dataset

* import urllib

* fix url
2025-02-20 08:08:28 -06:00
Sebastian Raschka
5016499d1d Uv workflow improvements (#531)
* Uv workflow improvements

* Uv workflow improvements

* linter improvements

* pytproject.toml fixes

* pytproject.toml fixes

* pytproject.toml fixes

* pytproject.toml fixes

* pytproject.toml fixes

* pytproject.toml fixes

* windows fixes

* windows fixes

* windows fixes

* windows fixes

* windows fixes

* windows fixes

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix
2025-02-16 13:16:51 -06:00
Sebastian Raschka
9dce43ec31 Upgrade to NumPy 2.0 (#520)
* Upgrade to NumPy 2.0

* bump pytorch

* bump pytorch

* bump pytorch

* bump pytorch

* bump pytorch

* update

* update packages
2025-02-09 06:21:58 -06:00
Sebastian Raschka
7e2092dd01 More pythonic way to find the longest sequence (#512)
* More pythonic way to find the longest sequence

* pep8 fix
2025-02-01 10:22:47 -06:00
Sebastian Raschka
541b237eff Add utility to prevent double execution of certain cells (#437) 2024-11-14 19:56:49 +09:00
Sebastian Raschka
68505fab64 Fix truncation issue in classify_review function (#373) 2024-09-25 19:54:36 -05:00
Sebastian Raschka
a82169290e Note about MPS in ch06 and ch07 (#325) 2024-08-19 08:11:33 -05:00
TITC
0b998dff97 track tokens seen in chapter5, track examples seen in chapter6 (#319) 2024-08-13 07:09:05 -05:00
Sebastian Raschka
6dd8666d9c Test code in pytorch 2.4 (#285)
* test code in pytorch 2.4

* update
2024-07-24 21:53:41 -05:00
Sebastian Raschka
2ce1d16de0 show how to use the finetuned model 2024-07-09 06:43:26 -07:00
Sebastian Raschka
2d8eacb0fa Fix links in summary sections (#254) 2024-06-29 07:51:31 -05:00
rasbt
5e24a042c1 add links to summary sections 2024-06-29 07:33:26 -05:00
Daniel Kleine
fb4e37ae15 fixed minor issues (#252)
* fixed typo

* fixed var name in md text
2024-06-29 06:38:25 -05:00
Jinge Wang
8e2c8d0987 Fixed some typos in ch06.ipynb (#219) 2024-06-18 05:54:01 -05:00
rasbt
c8c0fd4fb5 fix spelling 2024-06-18 05:50:40 -05:00
rasbt
88ad21490c replace figure 2024-06-18 05:46:36 -05:00
Daniel Kleine
79210eb393 fixes for code (#206)
* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-06-11 20:59:48 -05:00
rasbt
f0e4c99bc3 fix typo in comment 2024-06-09 06:14:02 -05:00
rasbt
fe8bb9291e update formatting 2024-05-24 07:20:37 -05:00
rasbt
c35cf65dbf add assertion about data set length 2024-05-23 06:50:43 -05:00
rasbt
3b72e55c26 remove duplicated text 2024-05-19 11:34:47 -05:00
rasbt
87bf79e888 tokens seen -> examples seen 2024-05-13 20:08:48 -05:00
rasbt
6b5bc7a1cd add missing figure 2024-05-12 18:37:02 -05:00
rasbt
98c0723b3d update dataset naming 2024-05-12 09:22:42 -05:00
rasbt
beeaf323f1 rename download_and_unzip to make it more specific 2024-05-12 08:36:24 -05:00
rasbt
84edcfaf43 use spam / not spam labels 2024-05-11 13:42:18 -05:00
rasbt
c94f24e759 reorder section 6.6 2024-05-11 08:27:07 -05:00
rasbt
db29f5c685 explain how class labels are obtained 2024-05-11 07:42:13 -05:00
rasbt
774974de97 6 -> 4 2024-05-10 07:02:14 -05:00
rasbt
dadd0f7ea3 clarify overfitting 2024-05-09 09:09:26 -05:00
rasbt
1638dc8b7f spelling improvements 2024-05-09 07:25:52 -05:00
rasbt
1e34f5a429 add note about worker number 2024-05-08 21:20:43 -05:00
rasbt
1e7d1f3bcb update figure 6.6 2024-05-08 20:46:54 -05:00
rasbt
a31d571625 text -> dataset 2024-05-08 08:14:03 -05:00
rasbt
6cc9cf9f4e make spam spelling consistent 2024-05-08 06:48:28 -05:00
rasbt
7082ecac80 formatting improvements 2024-05-06 20:35:51 -05:00
rasbt
0448162fdc show downloads 2024-05-06 07:40:09 -05:00
rasbt
78829f28e9 tokenizing example 2024-05-06 07:16:40 -05:00
rasbt
c6528ede9e ch06 dataset 2024-05-06 06:55:56 -05:00
rasbt
e574d04eba classfication -> classification 2024-05-06 06:50:38 -05:00
Ikko Eltociear Ashimine
d361cef65f Update ch06.ipynb (#143)
ouput -> output
2024-05-05 12:18:20 -05:00
rasbt
a63b0f626c make code more general for larger models 2024-05-05 10:18:46 -05:00
Sebastian Raschka
c6fcadb087 Add figures for ch06 (#141) 2024-05-05 07:10:04 -05:00
rasbt
97106950c1 add description 2024-05-04 07:34:29 -05:00
Sebastian Raschka
004b0614fc Ch06 draft (#138)
* Ch06 first draft

* add utility files
2024-05-03 08:37:58 -05:00