LLMs-from-scratch

mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2025-08-28 18:40:01 +00:00

Author	SHA1	Message	Date
Sebastian Raschka	07c3122b5c	Qwen3 and Llama3 equivalency teests with HF transformers (#768 ) * Qwen3 and Llama3 equivalency teests with HF transformers * update	2025-08-14 18:36:07 -05:00
Sebastian Raschka	d23b1f07b8	Add more sophisticated Qwen3 tokenizer (#729 )	2025-07-09 13:16:26 -05:00
Sebastian Raschka	e9ffdbace4	CPU compile performance for Qwen3 models (#704 ) * Ch06 classifier function asserts * Qwen3 cpu compilation perf	2025-06-23 11:06:10 -05:00
Daniel Kleine	2a530b49fe	added pkg fixes (#676 ) Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>	2025-06-21 16:07:50 -05:00
Daniel Kleine	479b0e2aa9	fixed gqa qkv code comments (#660 )	2025-06-13 08:21:28 -05:00
Sebastian Raschka	a3c4c33347	Reduce Llama 3 RoPE memory requirements (#658 ) * Llama3 from scratch improvements * Fix Llama 3 expensive RoPE memory issue * updates * update package * benchmark * remove unused rescale_theta	2025-06-12 11:08:02 -05:00
Sebastian Raschka	2dc2df593a	Llama3Fast (#593 ) * Llama3Fast * Update pkg/llms_from_scratch/tests/test_llama3.py	2025-04-01 12:56:11 -05:00
Sebastian Raschka	4128a91c1d	Add Llama 3.2 to pkg (#591 ) * Add Llama 3.2 to pkg * remove redundant attributes * update tests * updates * updates * updates * fix link * fix link	2025-03-31 18:59:47 -05:00