mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2025-10-29 00:39:58 +00:00
* added HF BPE Fast * update benchmarks * add note about performance * revert accidental changes --------- Co-authored-by: rasbt <mail@sebastianraschka.com>
Chapter 2: Working with Text Data
- compare-bpe-tiktoken.ipynb benchmarks various byte pair encoding implementations
- bpe_openai_gpt2.py is the original bytepair encoder code used by OpenAI