mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2025-10-06 13:36:18 +00:00

* updated .gitignore * removed unused GELU import * fixed model_configs, fixed all tensors on same device * removed unused tiktoken * update * update hparam search * remove redundant tokenizer argument --------- Co-authored-by: rasbt <mail@sebastianraschka.com>
Chapter 4: Implementing a GPT Model from Scratch to Generate Text
- 01_main-chapter-code contains the main chapter code.
- 02_performance-analysis contains optional code analyzing the performance of the GPT model(s) implemented in the main chapter.