LLMs-from-scratch/ch06/01_main-chapter-code
Daniel Kleine dcbdc1d2e5
fixes for code (#206)
* updated .gitignore

* removed unused GELU import

* fixed model_configs, fixed all tensors on same device

* removed unused tiktoken

* update

* update hparam search

* remove redundant tokenizer argument

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-06-11 20:59:48 -05:00
..
2024-06-11 20:59:48 -05:00
2024-05-24 07:20:37 -05:00
2024-05-03 08:37:58 -05:00
2024-06-09 06:14:02 -05:00
2024-05-13 08:50:55 -05:00
2024-05-13 07:45:59 -05:00

Chapter 6: Finetuning for Classification

  • ch06.ipynb contains all the code as it appears in the chapter
  • previous_chapters.py is a Python module that contains the GPT model we coded and trained in previous chapters, alongside many utility functions, which we reuse in this chapter
  • gpt-class-finetune.py is a standalone Python script file with the code that we implemented in ch06.ipynb to finetune the GPT model (you can think of it as a chapter summary)
  • gpt_download.py contains the utility functions for downloading the pretrained GPT model weights
  • exercise-solutions.ipynb contains the exercise solutions for this chapter