LLMs-from-scratch

mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-12-21 04:02:26 +00:00

Author	SHA1	Message	Date
Sebastian Raschka	7659af7cdd	Add backup URL for gpt2 weights (#469 ) * Add backup URL for gpt2 weights * newline	2025-01-05 11:28:09 -06:00
Sebastian Raschka	3c3dae0967	Add mean pooling experiment to classifier bonus experiments (#406 ) * Add mean pooling experiment to classifier bonus experiments * formatting * add average embeddings option * pep8	2024-10-20 11:04:18 -05:00
Daniel Kleine	95926535f8	ch06/03 fixes (#336 ) * fixed bash commands * fixed help docstrings * added missing logreg bash cmd * Update train_bert_hf.py * Update train_bert_hf_spam.py * Update README.md --------- Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>	2024-08-27 08:23:25 +02:00
rasbt	8eb6fc0ad0	sklearn baseline and roberta-large update	2024-08-26 10:31:54 +02:00
TITC	5acab58d41	add RoBERTa and params frozen (#335 ) * add roberta experiment result * add roberta & params frozen * Update README.md * modify lr * modify lr --------- Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>	2024-08-26 10:27:09 +02:00
Sebastian Raschka	296a91afb8	add BERT experiment results (#333 ) * add BERT experiment results * cleanup * formatting	2024-08-23 08:40:40 -05:00
Sebastian Raschka	d0f3b034d8	Add download help message (#274 )	2024-07-19 08:29:29 -05:00
Daniel Kleine	87f47a281a	fixed spelling typos (#258 )	2024-07-03 07:47:33 -05:00
Daniel Kleine	73be1c592f	fixed num_workers (#229 ) * fixed num_workers * ch06 & ch07: added num_workers to create_dataloader_v1	2024-06-19 17:36:46 -05:00
Daniel Kleine	79210eb393	fixes for code (#206 ) * updated .gitignore * removed unused GELU import * fixed model_configs, fixed all tensors on same device * removed unused tiktoken * update * update hparam search * remove redundant tokenizer argument --------- Co-authored-by: rasbt <mail@sebastianraschka.com>	2024-06-11 20:59:48 -05:00
rasbt	f0e4c99bc3	fix typo in comment	2024-06-09 06:14:02 -05:00
Sebastian Raschka	40ba3a4068	Remove leftover instances of self.tokenizer (#201 ) * Remove leftover instances of self.tokenizer * add endoftext token	2024-06-08 14:57:34 -05:00
rasbt	fe8bb9291e	update formatting	2024-05-24 07:20:37 -05:00
Daniel Kleine	4b0fdab1de	removed empty line	2024-05-22 16:15:13 +00:00
rasbt	05738f8be6	fix link	2024-05-17 08:20:35 -05:00
Sebastian Raschka	47b3ff15ec	improve bonus code in chapter 06	2024-05-14 20:35:50 -04:00
Sebastian Raschka	30010c7a91	Merge branch 'main' into main	2024-05-14 08:28:02 -05:00
rasbt	6aff47ba60	fix file path name	2024-05-14 08:27:46 -05:00
Sebastian Raschka	2f1e1a3d4b	Merge branch 'main' into main	2024-05-14 08:12:19 -05:00
rasbt	0b176bb1fc	add previous chapters file	2024-05-14 08:11:58 -05:00
Sebastian Raschka	d499c90903	Merge branch 'main' into main	2024-05-14 08:07:58 -05:00
rasbt	df4c59cf6e	add missing gpt-download.py	2024-05-14 08:05:56 -05:00
Daniel Kleine	c754b14a79	added missing python run statement	2024-05-14 12:17:09 +00:00
rasbt	73e1c68f45	use validation path	2024-05-12 09:41:46 -05:00
rasbt	1c13810d30	use path	2024-05-12 09:36:35 -05:00
rasbt	a0adf0d5d3	basepath	2024-05-12 09:27:38 -05:00
rasbt	913662ebeb	basepath	2024-05-12 09:25:56 -05:00
rasbt	98c0723b3d	update dataset naming	2024-05-12 09:22:42 -05:00
rasbt	beeaf323f1	rename download_and_unzip to make it more specific	2024-05-12 08:36:24 -05:00
Sebastian Raschka	49306b271f	add header	2024-05-11 14:37:21 -05:00
rasbt	75545e4c1b	experiments with largest model	2024-05-09 07:40:09 -05:00
rasbt	9457676640	ouput -> output	2024-05-05 12:21:10 -05:00
rasbt	354bb35726	use training set len	2024-04-29 21:50:07 -05:00
Sebastian Raschka	d1edfcb63f	add roberta option (#135 )	2024-04-28 13:57:36 -05:00
Sebastian Raschka	4bbd476e7a	IMDB experiments (#128 ) * IMDB experiments * style fixes * Update README.md	2024-04-25 07:20:53 -05:00

35 Commits