816 Commits

Author SHA1 Message Date
Sebastian Raschka
5fc78ff9fd
Add a note about "zsh: illegal hardware instruction python" error (#555) 2025-03-02 15:18:24 -06:00
Sebastian Raschka
e9ad6cf86d
add link to supplementary ch02 video (#553) 2025-03-02 13:17:42 -06:00
Sebastian Raschka
e7740b3312
Use correct ch02 title (#551) 2025-02-28 10:16:21 -06:00
Sebastian Raschka
b1773897d3
Add BPE from scratch link (#550) 2025-02-28 09:57:41 -06:00
Greg Gandenberger
b92c0dff89
Add note about context_length (#549)
* Add note about context_length

* update note

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2025-02-27 08:36:41 -06:00
Sebastian Raschka
8939fdc846
Add setup video tutorial (#547)
* Add setup video tutorial

* updated link checks

* updated link checks
2025-02-26 13:53:22 -06:00
Sebastian Raschka
cd5cf8112b
Consitent spacing (#546) 2025-02-23 17:52:00 -06:00
Sebastian Raschka
f12b899d96
GitHub markdown updates (#545)
* GitHub markdown updates

* Apply suggestions from code review

* Apply suggestions from code review
2025-02-23 12:25:44 -06:00
Sebastian Raschka
bca82a9443
Simplify the setup in CI (#542)
* Simplify the setup in CI

* windows --no-binary option

* windows uv workflow

* add path

* some fixes

* some fixes

* some fixes

* cleanup

* cleanup

* cleanup

* cleanup

* cleanup

* more windows issues

* more windows issues

* more windows issues

* more windows issues

* more windows issues

* more windows issues

* more windows issues

* more windows issues

* revert
2025-02-20 11:56:52 -06:00
Sebastian Raschka
d1e99f6092
Fix timeout issue related to spam data backup url (#544)
* Add backup url for Spam Dataset

* import urllib

* fix url

* fix timeout issue
2025-02-20 09:26:23 -06:00
Sebastian Raschka
c39aa32ef5
Add backup url for Spam Dataset (#543)
* Add backup url for Spam Dataset

* import urllib

* fix url
2025-02-20 08:08:28 -06:00
Tim Hopper
1039bf9b80
Don't explicitly install Python. Use uv sync instead of uv add. (#540)
* Don't explicitly install Python. Use uv sync instead of uv add.

* updates

* update

* updated 3.11 workflows

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2025-02-19 10:42:13 -06:00
Kasen
7bd36dccb4
Improve BPE vocabulary saving and pair frequency handling (#539) 2025-02-19 09:51:04 -06:00
Daniel Kleine
3098b4f889
added uv to Dockerfile (#537)
* added uv to Dockerfile

* Update Dockerfile

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2025-02-18 14:53:19 -06:00
Kasen
b47884ced0
Fix incorrect indentation (#536) 2025-02-18 14:47:31 -06:00
Sebastian Raschka
67c226bf67
Badge url updates 2025-02-17 12:07:47 -06:00
rasbt
9ccecd13ae
update badges 2025-02-17 12:02:06 -06:00
rasbt
24f78865df
update badges 2025-02-17 12:00:46 -06:00
rasbt
2f67cbca0b
update readme badges 2025-02-17 11:49:41 -06:00
Matthew Feickert
a8b8eb4731
feat: Add pixi environment (#534)
* feat: Add pixi environment

* Add pixi manifest pixi.toml for Linux x86, macOS arm64, Windows 64.

* ci: Update CI workflow and unify to one

* Enable workflow dispatch.
* Add concurrency limits.
* Use pixi for CI workflow.
* Unify to a single workflow for all OS tested

* feat: Add pixi lock file

* Ensure tensorflow-cpu installed on Windows

* fix package check

* fix package check

* simplification plus uv and pip runners

* some fixes to pixi and pip

* create pixi.lock

* fix pixi.lock issue

* another attempt trying to fix get_packages

* another attempt trying to fix get_packages

* clean up python_environment_check.py

* updated runner and docs

* use bash

* proper env activiation

* proper env activiation

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2025-02-17 11:33:53 -06:00
Sebastian Raschka
16738b61fd
note about google colab (#535) 2025-02-16 21:12:36 -06:00
Sebastian Raschka
bacb7aa90c
Update README.md 2025-02-16 13:37:32 -06:00
ridhachahed
03d72e6eb5
Fixing small typo in package description of the pyproject.toml file (#533)
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2025-02-16 13:19:26 -06:00
Sebastian Raschka
a08d7aaa84
Uv workflow improvements (#531)
* Uv workflow improvements

* Uv workflow improvements

* linter improvements

* pytproject.toml fixes

* pytproject.toml fixes

* pytproject.toml fixes

* pytproject.toml fixes

* pytproject.toml fixes

* pytproject.toml fixes

* windows fixes

* windows fixes

* windows fixes

* windows fixes

* windows fixes

* windows fixes

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix

* win32 fix
2025-02-16 13:16:51 -06:00
rasbt
29353c74d8
reduce redundancies 2025-02-15 21:31:22 -06:00
Sebastian Raschka
fc11940ace
Add performance comparison 2025-02-15 21:16:36 -06:00
rasbt
7e335f8af0
formatting 2025-02-15 21:05:58 -06:00
rasbt
61ca01c7c5
cosmetics 2025-02-15 20:53:26 -06:00
Sebastian Raschka
3e3dc3c5dc
Native uv docs (#530)
* Replace pip by more modern uv

* uv tests

* Native uv docs

* resolve merge conflicts

* resolve merge conflicts
2025-02-15 20:35:23 -06:00
Sebastian Raschka
e9c4dac3ad
Update README.md 2025-02-15 13:17:43 -06:00
Sebastian Raschka
88fd849b88
Switch from pip to uv (#529)
* Replace pip by more modern uv

* uv tests

* update yaml

* update yaml

* update yaml

* update flake8

* update windows commands

* fix windows test

* windows fix

* windows fix

* windows fix

* windows fix

* windows fix

* windows fix

* windows fix

* windows fix

* windows fix

* windows fix
2025-02-15 13:13:13 -06:00
Sebastian Raschka
074a6efb33
Update link to vocab size increase (#526)
* Update link to vocab size increase

* Update ch05/10_llm-training-speed/README.md

* Update ch05/10_llm-training-speed/README.md
2025-02-14 08:03:01 -06:00
Sebastian Raschka
908dd2f71e
PyTorch tips for better training performance (#525)
* PyTorch tips for better training performance

* formatting

* pep 8
2025-02-12 16:10:34 -06:00
Sebastian Raschka
3c29b67cd0
Add torchrun bonus code (#524) 2025-02-11 17:01:09 -06:00
Victor Skvortsov
f90bec7dfb
Comment that DDP-script.py does not work with GPUs > 2 (#523) 2025-02-11 13:23:49 -06:00
Sebastian Raschka
a6cc574605
Upgrade to NumPy 2.0 (#520)
* Upgrade to NumPy 2.0

* bump pytorch

* bump pytorch

* bump pytorch

* bump pytorch

* bump pytorch

* update

* update packages
2025-02-09 06:21:58 -06:00
Sebastian Raschka
68e2efe1c9
Mention small discrepancy due to Dropout non-reproducibility in PyTorch (#519)
* Mention small discrepancy due to Dropout non-reproducibility in PyTorch

* bump pytorch version
2025-02-06 14:59:52 -06:00
Daniel Kleine
bd8f7522cb
fixed indention and enumeration for nvct (#518) 2025-02-06 08:17:12 -06:00
Sebastian Raschka
2dc46bedc6
Fix typo in Ch02 comments (#516) 2025-02-04 20:16:07 -06:00
Sebastian Raschka
8cfa52bf1d
More pythonic way to find the longest sequence (#512)
* More pythonic way to find the longest sequence

* pep8 fix
2025-02-01 10:22:47 -06:00
Sebastian Raschka
0e14c76dee
Test PyTorch nightly releases (#509) 2025-01-30 12:45:48 -06:00
Sebastian Raschka
25ea71e713
Alternative weight loading via .safetensors (#507) 2025-01-29 08:15:29 -06:00
Sebastian Raschka
9daa7e7511
Fix default argument in ex 7.2 (#506) 2025-01-25 10:46:48 -06:00
Sebastian Raschka
fd8d77a79d
A few cosmetic updates (#504) 2025-01-23 09:38:55 -06:00
Sebastian Raschka
0911e71497
Test for PyTorch 2.6 release candidate (#500)
* Test for PyTorch 2.6 release candidate

* update

* update

* remove extra added file
2025-01-22 18:37:48 -06:00
Sebastian Raschka
a22d612be6
Bonus material: extending tokenizers (#496)
* Bonus material: extending tokenizers

* small wording update
2025-01-22 09:26:54 -06:00
Daniel Kleine
dce46038da
add GPT2TokenizerFast to BPE comparison (#498)
* added HF BPE Fast

* update benchmarks

* add note about performance

* revert accidental changes

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2025-01-22 09:26:44 -06:00
Austin Welch
0f35e370ed
fix: preserve newline tokens in BPE encoder (#495)
* fix: preserve newline tokens in BPE encoder

* further fixes

* more fixes

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2025-01-21 12:47:15 -06:00
Daniel Kleine
60acb94894
BPE: fixed typo (#492)
* fixed typo

* use rel path if exists

* mod gitignore and use existing vocab files

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2025-01-20 20:49:53 -06:00
Sebastian Raschka
0d4967eda6
Implementingthe BPE Tokenizer from Scratch (#487) 2025-01-17 12:22:00 -06:00