LLMs-from-scratch/ch02/05_bpe-from-scratch
Sebastian Raschka 02ca4ac42d BPE cosmetics (#629)
* Llama3 from scratch improvements

* Cosmetic BPE improvements

* restore

* Update ch02/05_bpe-from-scratch/bpe-from-scratch.ipynb

* Update ch02/05_bpe-from-scratch/bpe-from-scratch.ipynb

* endoftext whitespace
2025-04-18 18:57:09 -05:00
..
2025-03-08 17:21:30 -06:00
2025-04-18 18:57:09 -05:00

Byte Pair Encoding (BPE) Tokenizer From Scratch

  • bpe-from-scratch.ipynb contains optional (bonus) code that explains and shows how the BPE tokenizer works under the hood.