12 Commits

Author SHA1 Message Date
Sebastian Raschka
6aec412421 Fix BPE bonus materials (#561)
* Fix BPE bonus materials

* fix bpe implementation

* update

* Add 'Hello, world. Is this-- a test?' test case

* update link to test file

* update path handling

* update path handling

* fix pytest paths
2025-03-08 17:21:30 -06:00
Daniel Kleine
9175590ea4 add GPT2TokenizerFast to BPE comparison (#498)
* added HF BPE Fast

* update benchmarks

* add note about performance

* revert accidental changes

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2025-01-22 09:26:44 -06:00
Sebastian Raschka
b17d097742 Implementingthe BPE Tokenizer from Scratch (#487) 2025-01-17 12:22:00 -06:00
rasbt
fe8bb9291e update formatting 2024-05-24 07:20:37 -05:00
rasbt
edcae09884 improve importlib experience for windows users 2024-04-03 06:31:15 -05:00
Sebastian Raschka
cf39abac04 Add and link bonus material (#84) 2024-03-23 07:27:43 -05:00
Sebastian Raschka
a2cd8436cb Ch05 supplementary code (#81) 2024-03-19 09:26:26 -05:00
Sebastian Raschka
ca96abac8a Set up basic test gh worklows (#79)
* Set up basic test gh worklows

* update file paths

* env check

* add env check

* Update requirements.txt

* simplify

* upd
2024-03-18 11:58:37 -05:00
Ikko Eltociear Ashimine
47519f4d14 Update compare-bpe-tiktoken.ipynb
HuggingFace -> Hugging Face
2024-03-10 01:11:35 +09:00
rasbt
f526a8d7fb add requirements file for bonus notebook 2024-03-02 16:54:24 -06:00
rasbt
0074c98968 add download utilities for vocab and encoder files 2024-01-15 17:08:04 -06:00
rasbt
d1d29d0555 ch02 code 2023-09-28 07:08:50 -05:00