LLMs-from-scratch/ch05/09_extending-tokenizers
Sebastian Raschka a22d612be6
Bonus material: extending tokenizers (#496)
* Bonus material: extending tokenizers

* small wording update
2025-01-22 09:26:54 -06:00
..

Extending the Tiktoken BPE Tokenizer with New Tokens

  • extend-tiktoken.ipynb contains optional (bonus) code to explain how we can add special tokens to a tokenizer implemented via tiktoken and how to update the LLM accordingly