mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2025-07-24 17:33:51 +00:00

* updated .gitignore * removed unused GELU import * fixed model_configs, fixed all tensors on same device * removed unused tiktoken * update * update hparam search * remove redundant tokenizer argument --------- Co-authored-by: rasbt <mail@sebastianraschka.com>
Chapter 4: Implementing a GPT Model from Scratch To Generate Text
- ch04.ipynb contains all the code as it appears in the chapter
- previous_chapters.py is a Python module that contains the
MultiHeadAttention
module from the previous chapter, which we import in ch04.ipynb to create the GPT model - gpt.py is a standalone Python script file with the code that we implemented thus far, including the GPT model we coded in this chapter