LLMs-from-scratch/ch05/01_main-chapter-code
Sebastian Raschka cf0df54d7d
Show epochs as integers on x-axis (#241)
* Show epochs as integers on x-axis

* Update ch07/01_main-chapter-code/previous_chapters.py

* remove extra s

* modify exercise plots

* update chapter 7 plot

* resave ch07 for better file diff
2024-06-23 07:41:25 -05:00
..
2024-06-21 08:40:54 -05:00
2024-04-22 07:05:40 -05:00
2024-06-09 06:14:02 -05:00
2024-06-19 17:36:46 -05:00
2024-06-19 17:48:25 -05:00
2024-06-12 07:19:10 -05:00

Chapter 5: Pretraining on Unlabeled Data

Main Chapter Code

  • ch05.ipynb contains all the code as it appears in the chapter
  • previous_chapters.py is a Python module that contains the MultiHeadAttention module and GPTModel class from the previous chapters, which we import in ch05.ipynb to pretrain the GPT model
  • gpt_download.py contains the utility functions for downloading the pretrained GPT model weights
  • exercise-solutions.ipynb contains the exercise solutions for this chapter

Optional Code

  • gpt_train.py is a standalone Python script file with the code that we implemented in ch05.ipynb to train the GPT model (you can think of it as a code file summarizing this chapter)
  • gpt_generate.py is a standalone Python script file with the code that we implemented in ch05.ipynb to load and use the pretrained model weights from OpenAI