| 
							
							
								 casinca | bb31de8999 | [minor] typo & comments (#441) * typo & comment
- safe -> save
- commenting code: batch_size, seq_len = in_idx.shape
* comment
- adding # NEW for assert num_heads % num_kv_groups == 0
* update memory wording
---------
Co-authored-by: rasbt <mail@sebastianraschka.com> | 2024-11-18 19:52:42 +09:00 |  | 
			
				
					| 
							
							
								 Daniel Kleine | 81eed9afe2 | updated RoPE statement (#423) * updated RoPE statement
* updated .gitignore
* Update ch05/07_gpt_to_llama/converting-gpt-to-llama2.ipynb
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com> | 2024-10-30 08:00:08 -05:00 |  | 
			
				
					| 
							
							
								 ROHAN WINSOR | cd24a27161 | Fix argument name in LlamaTokenizer constructor (#421) This PR addresses an oversight in the LlamaTokenizer class by changing the constructor argument from filepath to tokenizer_file. | 2024-10-29 18:01:36 -05:00 |  | 
			
				
					| 
							
							
								 Daniel Kleine | d38083c401 | Updated Llama 2 to 3 paths (#413) * llama 2 and 3 path fixes
* updated llama 3, 3.1 and 3.2 paths
* updated .gitignore
* Typo fix
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com> | 2024-10-24 07:40:08 -05:00 |  | 
			
				
					| 
							
							
								 Sebastian Raschka | 7cd6a670ed | RoPE updates (#412) * RoPE updates
* Apply suggestions from code review
* updates
* updates
* updates | 2024-10-23 18:07:49 -05:00 |  | 
			
				
					| 
							
							
								 Sebastian Raschka | b44096acef | Implement Llama 3.2 (#383) | 2024-10-05 07:30:47 -05:00 |  | 
			
				
					| 
							
							
								 Sebastian Raschka | a5405c255d | Cos-sin fix in Llama 2 bonus notebook (#381) | 2024-10-03 20:45:40 -05:00 |  | 
			
				
					| 
							
							
								 Sebastian Raschka | b993c2b25b | Improve rope settings for llama3 (#380) | 2024-10-03 08:29:54 -05:00 |  | 
			
				
					| 
							
							
								 rasbt | 278a50a348 | add section numbers | 2024-09-30 08:42:22 -05:00 |  | 
			
				
					| 
							
							
								 Sebastian Raschka | b56d0b2942 | Add llama2 unit tests (#372) * add llama2 unit tests
* update
* updates
* updates
* update file path
* update requirements file
* rmsnorm test
* update | 2024-09-25 19:40:36 -05:00 |  | 
			
				
					| 
							
							
								 rasbt | a6d8e93da3 | improve formatting | 2024-09-24 18:49:17 -05:00 |  | 
			
				
					| 
							
							
								 Daniel Kleine | ff31b345b0 | ch05/07 gpt_to_llama text improvements (#369) * fixed typo
* fixed RMSnorm formula
* fixed SwiGLU formula
* temperature=0 for untrained model for reproducibility
* added extra info hf token | 2024-09-24 18:45:49 -05:00 |  | 
			
				
					| 
							
							
								 rasbt | d144bd5b7a | add json import | 2024-09-23 09:12:35 -05:00 |  | 
			
				
					| 
							
							
								 rasbt | 6bc3de165c | move access token to config.json | 2024-09-23 08:56:16 -05:00 |  | 
			
				
					| 
							
							
								 rasbt | 58df945ed4 | add llama3 comparison | 2024-09-23 08:17:10 -05:00 |  | 
			
				
					| 
							
							
								 Sebastian Raschka | 0467c8289b | GPT to Llama (#368) * GPT to Llama
* fix urls | 2024-09-23 07:34:06 -05:00 |  |