mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-12-13 16:01:34 +00:00

History

* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>

2024-06-11 20:59:48 -05:00

ch04.ipynb

update formatting

2024-05-24 07:20:37 -05:00

exercise-solutions.ipynb

fixes for code (#206 )

2024-06-11 20:59:48 -05:00

gpt.py

add allowed_special={"<|endoftext|>"}

2024-06-09 06:04:02 -05:00

previous_chapters.py

Remove leftover instances of self.tokenizer (#201 )

2024-06-08 14:57:34 -05:00

README.md

flops analysis

2024-05-23 20:35:41 -05:00

tests.py

Ch05 supplementary code (#81 )

2024-03-19 09:26:26 -05:00

README.md

Chapter 4: Implementing a GPT Model from Scratch To Generate Text

ch04.ipynb contains all the code as it appears in the chapter
previous_chapters.py is a Python module that contains the MultiHeadAttention module from the previous chapter, which we import in ch04.ipynb to create the GPT model
gpt.py is a standalone Python script file with the code that we implemented thus far, including the GPT model we coded in this chapter