LLMs-from-scratch/ch05/01_main-chapter-code
Sebastian Raschka e757091301
Organized setup instructions (#115)
* Organized setup instructions

* update tets

* link checker action

* raise error upon broken link

* fix links

* fix links

* delete duplicated paragraph
2024-04-10 22:09:46 -04:00
..
2024-04-10 22:09:46 -04:00
2024-04-07 06:03:41 -05:00
2024-04-05 07:24:46 -05:00
2024-04-10 22:09:46 -04:00
2024-04-05 07:24:46 -05:00

Chapter 5: Pretraining on Unlabeled Data

  • ch05.ipynb contains all the code as it appears in the chapter
  • previous_chapters.py is a Python module that contains the MultiHeadAttention module from the previous chapter, which we import in ch05.ipynb to pretrain the GPT model
  • gpt_train.py is a standalone Python script file with the code that we implemented in ch05.ipynb to train the GPT model
  • gpt_generate.py is a standalone Python script file with the code that we implemented in ch05.ipynb to load and use the pretrained model weights from OpenAI