mirror of
				https://github.com/rasbt/LLMs-from-scratch.git
				synced 2025-10-31 09:50:23 +00:00 
			
		
		
		
	 fcf8bcab0d
			
		
	
	
		fcf8bcab0d
		
	
	
	
	
		
			
			* add a suggestion since code snippet has been repeated. * remove duplicated cell --------- Co-authored-by: Shuyib <benmainye@gmail.com>
Chapter 5: Pretraining on Unlabeled Data
- ch05.ipynb contains all the code as it appears in the chapter
- previous_chapters.py is a Python module that contains the MultiHeadAttentionmodule andGPTModelclass from the previous chapters, which we import in ch05.ipynb to pretrain the GPT model
- gpt_train.py is a standalone Python script file with the code that we implemented in ch05.ipynb to train the GPT model (you can think of it as a code file summarizing this chapter)
- gpt_generate.py is a standalone Python script file with the code that we implemented in ch05.ipynb to load and use the pretrained model weights from OpenAI
- gpt_download.py contains the utility functions for downloading the pretrained GPT model weights
- exercise-solutions.ipynb contains the exercise solutions for this chapter