mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-11-02 02:41:00 +00:00

History

* typo fix

experiment 5 is the same size as experiment 1, both are 124 Million. and experiment 6 is 355M ~3x 124M.

* typo fix

all layer FT vs all layer FT, "slightly worse by 1.3% " indicates it's exp 5 vs exp 9

* exp 5 vs 9

* Update ch06/02_bonus_additional-experiments/README.md

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>

2024-08-15 08:45:34 -05:00

01_main-chapter-code

track tokens seen in chapter5, track examples seen in chapter6 (#319 )

2024-08-13 07:09:05 -05:00

02_bonus_additional-experiments

typo fix (#321 )

2024-08-15 08:45:34 -05:00

03_bonus_imdb-classification

Add download help message (#274 )

2024-07-19 08:29:29 -05:00

README.md

distinguish better between main chapter code and bonus materials

2024-06-11 21:07:42 -05:00

README.md

Chapter 6: Finetuning for Classification

Main Chapter Code

01_main-chapter-code contains the main chapter code

Bonus Materials

02_bonus_additional-experiments includes additional experiments (e.g., training the last vs first token, extending the input length, etc.)
03_bonus_imdb-classification compares the LLM from chapter 6 with other models on a 50k IMDB movie review sentiment classification dataset