LLMs-from-scratch

mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-10-20 04:20:13 +00:00

Author	SHA1	Message	Date
rasbt	2ae4ad15ba	add section numbers	2024-09-30 08:42:22 -05:00
rasbt	58d0ce83a4	llama note	2024-09-26 07:41:11 -05:00
Sebastian Raschka	b8497c1bf5	Add llama2 unit tests (#372 ) * add llama2 unit tests * update * updates * updates * update file path * update requirements file * rmsnorm test * update	2024-09-25 19:40:36 -05:00
rasbt	a23fca84d5	improve formatting	2024-09-24 18:49:17 -05:00
Daniel Kleine	4541177063	ch05/07 gpt_to_llama text improvements (#369 ) * fixed typo * fixed RMSnorm formula * fixed SwiGLU formula * temperature=0 for untrained model for reproducibility * added extra info hf token	2024-09-24 18:45:49 -05:00
rasbt	941629d2c7	add json import	2024-09-23 09:12:35 -05:00
rasbt	835832a0f9	move access token to config.json	2024-09-23 08:56:16 -05:00
rasbt	5e6c7230ac	add llama3 comparison	2024-09-23 08:17:10 -05:00
Sebastian Raschka	c38b003aa9	GPT to Llama (#368 ) * GPT to Llama * fix urls	2024-09-23 07:34:06 -05:00
Sebastian Raschka	7a9a17608d	Add user interface to ch06 and ch07 (#366 ) * Add user interface to ch06 and ch07 * pep8 * fix url	2024-09-21 20:33:00 -05:00
rasbt	0f395921d7	remove unused function from user interface	2024-09-21 14:17:35 -05:00
Daniel Kleine	92ad9570e4	Chainlit bonus material fixes (#361 ) * fix cmd * moved idx to device * improved code with clone().detach() * fixed path * fix: added extra line for pep8 * updated .gitginore * Update ch05/06_user_interface/app_orig.py * Update ch05/06_user_interface/app_own.py * Apply suggestions from code review --------- Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>	2024-09-18 08:08:50 -07:00
Sebastian Raschka	1bc560fb13	Add chatpgpt-like user interface (#360 ) * Add chatpgpt-like user interface * fixes	2024-09-17 08:26:44 -05:00
Sebastian Raschka	092b5b5429	topk comment	2024-08-20 20:44:15 -05:00
rasbt	f4e45a3f40	add note about duplicated cell	2024-08-19 21:04:18 -05:00
Sebastian Raschka	11e2f56af5	Note about MPS devices (#329 )	2024-08-19 20:58:45 -05:00
Sebastian Raschka	0991c1ff24	Note about ch05 mps support (#324 )	2024-08-19 07:40:24 -05:00
rasbt	742525cb28	remove redundant indentation	2024-08-16 07:54:02 -05:00
rasbt	6533ce63c1	fix code cell ordering	2024-08-12 19:04:05 -05:00
Sebastian Raschka	8d79fb13b0	Update README.md	2024-08-10 07:54:51 -05:00
Daniel Kleine	c91999b9f4	fixed bash command (#305 ) Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>	2024-08-09 21:29:04 -05:00
TITC	3067ed83dc	remove all non-English texts and notice (#304 ) * remove all non-English texts and notice 1. almost 18GB txt left after `is_english` filtered. 2. remove notice use gutenberg's strip_headers 3. after re-run get_data.py, seems all data are under `gutenberg/data/.mirror` folder. * some improvements * update readme --------- Co-authored-by: rasbt <mail@sebastianraschka.com>	2024-08-09 17:09:14 -05:00
TITC	7374d617b4	total training iters may equal to warmup_iters (#301 ) total_training_iters=20, warmup_iters=20= len(train_loader) 4 multiply n_epochs 5, then ZeroDivisionError occurred. ```shell Traceback (most recent call last): File "LLMs-from-scratch/ch05/05_bonus_hparam_tuning/hparam_search.py", line 191, in <module> train_loss, val_loss = train_model( ^^^^^^^^^^^^ File "/mnt/raid1/docker/ai/LLMs-from-scratch/ch05/05_bonus_hparam_tuning/hparam_search.py", line 90, in train_model progress = (global_step - warmup_iters) / (total_training_iters - warmup_iters) ~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ZeroDivisionError: division by zero ```	2024-08-06 07:10:05 -05:00
SSebo	22681878a8	Update ch05.ipynb (#297 ) typo	2024-08-05 07:12:27 -05:00
Sebastian Raschka	6dd8666d9c	Test code in pytorch 2.4 (#285 ) * test code in pytorch 2.4 * update	2024-07-24 21:53:41 -05:00
TITC	bce3a708f9	47,678-->48,725 (#281 )	2024-07-22 21:24:57 -05:00
Sebastian Raschka	d0f3b034d8	Add download help message (#274 )	2024-07-19 08:29:29 -05:00
rasbt	5e24a042c1	add links to summary sections	2024-06-29 07:33:26 -05:00
rasbt	0f43890a15	refresh cross entropy figure	2024-06-29 07:22:23 -05:00
Daniel Kleine	fb4e37ae15	fixed minor issues (#252 ) * fixed typo * fixed var name in md text	2024-06-29 06:38:25 -05:00
Daniel Kleine	7a54d383e7	minor fixes (#246 ) * removed duplicated white spaces * Update ch07/01_main-chapter-code/ch07.ipynb * Update ch07/05_dataset-generation/llama3-ollama.ipynb * removed duplicated white spaces * fixed title again --------- Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>	2024-06-25 17:30:30 -05:00
Sebastian Raschka	def84a039c	Show epochs as integers on x-axis (#241 ) * Show epochs as integers on x-axis * Update ch07/01_main-chapter-code/previous_chapters.py * remove extra s * modify exercise plots * update chapter 7 plot * resave ch07 for better file diff	2024-06-23 07:41:25 -05:00
rasbt	0026e6206b	update generate to match output in main chapter	2024-06-22 12:01:51 -05:00
Daniel Kleine	7e0c5c0975	minor fixes (#235 ) * removed unnecessary imports * removed unnecessary semicolons * format markdown * format markdown * fixed markdown	2024-06-21 08:40:54 -05:00
rasbt	e1046746e8	remove redundant line	2024-06-20 10:12:28 -05:00
rasbt	cb194fa8fa	fix device loading	2024-06-20 08:07:00 -05:00
rasbt	c1f9361428	add main and optional sections	2024-06-19 17:48:25 -05:00
rasbt	eb1da36e98	note about dropout	2024-06-19 17:37:48 -05:00
Daniel Kleine	73be1c592f	fixed num_workers (#229 ) * fixed num_workers * ch06 & ch07: added num_workers to create_dataloader_v1	2024-06-19 17:36:46 -05:00
Sebastian Raschka	fcf8bcab0d	Remove duplicated cell (#212 ) * add a suggestion since code snippet has been repeated. * remove duplicated cell --------- Co-authored-by: Shuyib <benmainye@gmail.com>	2024-06-15 12:48:34 -05:00
rasbt	a796b9d657	explain truncation in ch05	2024-06-12 19:50:11 -05:00
Sebastian Raschka	8d3e58ff81	check gpt files (#208 )	2024-06-12 07:19:10 -05:00
Daniel Kleine	e5c3c5ce99	minor bug fixes (#207 ) * fixed path arg for create_dataset_csvs() * updated assign_check() to remove user warning	2024-06-12 06:27:56 -05:00
rasbt	b2ff989174	distinguish better between main chapter code and bonus materials	2024-06-11 21:07:42 -05:00
Daniel Kleine	79210eb393	fixes for code (#206 ) * updated .gitignore * removed unused GELU import * fixed model_configs, fixed all tensors on same device * removed unused tiktoken * update * update hparam search * remove redundant tokenizer argument --------- Co-authored-by: rasbt <mail@sebastianraschka.com>	2024-06-11 20:59:48 -05:00
rasbt	f0e4c99bc3	fix typo in comment	2024-06-09 06:14:02 -05:00
Sebastian Raschka	40ba3a4068	Remove leftover instances of self.tokenizer (#201 ) * Remove leftover instances of self.tokenizer * add endoftext token	2024-06-08 14:57:34 -05:00
rasbt	5a1e0eecce	fix learning rate scheduler	2024-06-03 07:06:42 -05:00
rasbt	f7e528fca6	update loss	2024-05-31 07:30:57 -05:00
Kumar Utsav	b48d436bfc	Update ch05.ipynb Fixed incorrect token ids	2024-05-29 20:34:23 +05:30

1 2 3

120 Commits