mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2025-07-24 01:13:44 +00:00
10 lines
979 B
Markdown
10 lines
979 B
Markdown
# Chapter 5: Pretraining on Unlabeled Data
|
|
|
|
- [ch05.ipynb](ch05.ipynb) contains all the code as it appears in the chapter
|
|
- [previous_chapters.py](previous_chapters.py) is a Python module that contains the `MultiHeadAttention` module and `GPTModel` class from the previous chapters, which we import in [ch05.ipynb](ch05.ipynb) to pretrain the GPT model
|
|
- [gpt_train.py](gpt_train.py) is a standalone Python script file with the code that we implemented in [ch05.ipynb](ch05.ipynb) to train the GPT model (you can think of it as a code file summarizing this chapter)
|
|
- [gpt_generate.py](gpt_generate.py) is a standalone Python script file with the code that we implemented in [ch05.ipynb](ch05.ipynb) to load and use the pretrained model weights from OpenAI
|
|
- [gpt_download.py](gpt_download.py) contains the utility functions for downloading the pretrained GPT model weights
|
|
- [exercise-solutions.ipynb](exercise-solutions.ipynb) contains the exercise solutions for this chapter
|
|
|