LLMs-from-scratch/ch04/02_performance-analysis
Daniel Kleine dcbdc1d2e5
fixes for code (#206)
* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-06-11 20:59:48 -05:00
..
2024-06-11 20:59:48 -05:00
2024-05-23 20:35:41 -05:00
2024-05-23 20:35:41 -05:00

Chapter 4: Implementing a GPT Model from Scratch To Generate Text

  • flops-analysis.ipynb analyses the floating point operations per second (FLOPS) of the GPT model(s) implemented in the main chapter.
  • previous_chapters.py is a Python module containing the GPTModel code we implemented in chapter 4 and other code implemented in previous chapters, which we import in the analysis notebook.
  • requirements-extra.txt includes additional Python libraries that need to be installed (via pip install -r requirements-extra.txt.