mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2025-07-03 15:10:23 +00:00
Chapter 5: Pretraining on Unlabeled Data
- ch05.ipynb contains all the code as it appears in the chapter
- previous_chapters.py is a Python module that contains the
MultiHeadAttention
module from the previous chapter, which we import in ch05.ipynb to pretrain the GPT model - train.py is a standalone Python script file with the code that we implemented in ch05.ipynb to train the GPT model
- generate.py is a standalone Python script file with the code that we implemented in ch05.ipynb to load and use the pretrained model weights from OpenAI