| 
							
							
								 casinca | 57fdd94358 | [minor] typo & comments (#441) * typo & comment
- safe -> save
- commenting code: batch_size, seq_len = in_idx.shape
* comment
- adding # NEW for assert num_heads % num_kv_groups == 0
* update memory wording
---------
Co-authored-by: rasbt <mail@sebastianraschka.com> | 2024-11-18 19:52:42 +09:00 |  | 
			
				
					| 
							
							
								 Daniel Kleine | 7e6f8ce020 | updated RoPE statement (#423) * updated RoPE statement
* updated .gitignore
* Update ch05/07_gpt_to_llama/converting-gpt-to-llama2.ipynb
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com> | 2024-10-30 08:00:08 -05:00 |  | 
			
				
					| 
							
							
								 ROHAN WINSOR | e85d154522 | Fix argument name in LlamaTokenizer constructor (#421) This PR addresses an oversight in the LlamaTokenizer class by changing the constructor argument from filepath to tokenizer_file. | 2024-10-29 18:01:36 -05:00 |  | 
			
				
					| 
							
							
								 Daniel Kleine | 8b60460319 | Updated Llama 2 to 3 paths (#413) * llama 2 and 3 path fixes
* updated llama 3, 3.1 and 3.2 paths
* updated .gitignore
* Typo fix
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com> | 2024-10-24 07:40:08 -05:00 |  | 
			
				
					| 
							
							
								 Sebastian Raschka | f8bdfe12e1 | RoPE updates (#412) * RoPE updates
* Apply suggestions from code review
* updates
* updates
* updates | 2024-10-23 18:07:49 -05:00 |  | 
			
				
					| 
							
							
								 Sebastian Raschka | 6f86c78763 | Implement Llama 3.2 (#383) | 2024-10-05 07:30:47 -05:00 |  | 
			
				
					| 
							
							
								 Sebastian Raschka | d313f61c86 | Cos-sin fix in Llama 2 bonus notebook (#381) | 2024-10-03 20:45:40 -05:00 |  | 
			
				
					| 
							
							
								 Sebastian Raschka | feb0647c79 | Improve rope settings for llama3 (#380) | 2024-10-03 08:29:54 -05:00 |  | 
			
				
					| 
							
							
								 rasbt | 2ae4ad15ba | add section numbers | 2024-09-30 08:42:22 -05:00 |  | 
			
				
					| 
							
							
								 Sebastian Raschka | b8497c1bf5 | Add llama2 unit tests (#372) * add llama2 unit tests
* update
* updates
* updates
* update file path
* update requirements file
* rmsnorm test
* update | 2024-09-25 19:40:36 -05:00 |  | 
			
				
					| 
							
							
								 rasbt | a23fca84d5 | improve formatting | 2024-09-24 18:49:17 -05:00 |  | 
			
				
					| 
							
							
								 Daniel Kleine | 4541177063 | ch05/07 gpt_to_llama text improvements (#369) * fixed typo
* fixed RMSnorm formula
* fixed SwiGLU formula
* temperature=0 for untrained model for reproducibility
* added extra info hf token | 2024-09-24 18:45:49 -05:00 |  | 
			
				
					| 
							
							
								 rasbt | 941629d2c7 | add json import | 2024-09-23 09:12:35 -05:00 |  | 
			
				
					| 
							
							
								 rasbt | 835832a0f9 | move access token to config.json | 2024-09-23 08:56:16 -05:00 |  | 
			
				
					| 
							
							
								 rasbt | 5e6c7230ac | add llama3 comparison | 2024-09-23 08:17:10 -05:00 |  | 
			
				
					| 
							
							
								 Sebastian Raschka | c38b003aa9 | GPT to Llama (#368) * GPT to Llama
* fix urls | 2024-09-23 07:34:06 -05:00 |  |