mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2025-08-09 17:23:06 +00:00

* updated .gitignore * removed unused GELU import * fixed model_configs, fixed all tensors on same device * removed unused tiktoken * update * update hparam search * remove redundant tokenizer argument --------- Co-authored-by: rasbt <mail@sebastianraschka.com>
Chapter 4: Implementing a GPT Model from Scratch To Generate Text
- flops-analysis.ipynb analyses the floating point operations per second (FLOPS) of the GPT model(s) implemented in the main chapter.
- previous_chapters.py is a Python module containing the
GPTModel
code we implemented in chapter 4 and other code implemented in previous chapters, which we import in the analysis notebook. requirements-extra.txt
includes additional Python libraries that need to be installed (viapip install -r requirements-extra.txt
.