Sebastian Raschka
							
						 
					 | 
					
						
						
						
						
							
						
						
							6dd8666d9c
							
						
					 | 
					
						
						
							
							Test code in pytorch 2.4 (#285)
						
						
						
						
						
						
						
						* test code in pytorch 2.4
* update 
						
						
					 | 
					
						2024-07-24 21:53:41 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Daniel Kleine
							
						 
					 | 
					
						
						
						
						
							
						
						
							79210eb393
							
						
					 | 
					
						
						
							
							fixes for code (#206)
						
						
						
						
						
						
						
						* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com> 
						
						
					 | 
					
						2024-06-11 20:59:48 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							fe8bb9291e
							
						
					 | 
					
						
						
							
							update formatting
						
						
						
						
						
						
					 | 
					
						2024-05-24 07:20:37 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
						
						
							
						
						
							a5b353667d
							
						
					 | 
					
						
						
							
							Rename drop_resid to drop_shortcut (#136)
						
						
						
						
						
						
					 | 
					
						2024-04-28 14:31:27 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
						
						
							
						
						
							ccd7cebbb3
							
						
					 | 
					
						
						
							
							Rename variable to context_length to make it easier on readers (#106)
						
						
						
						
						
						
						
						* rename to context length
* fix spacing 
						
						
					 | 
					
						2024-04-04 07:27:41 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
						
						
							
						
						
							5beff4e25a
							
						
					 | 
					
						
						
							
							Remove reundant dropout in MLP module (#105)
						
						
						
						
						
						
					 | 
					
						2024-04-03 20:19:08 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
						
						
							
						
						
							a2cd8436cb
							
						
					 | 
					
						
						
							
							Ch05 supplementary code (#81)
						
						
						
						
						
						
					 | 
					
						2024-03-19 09:26:26 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							e0df4df433
							
						
					 | 
					
						
						
							
							add dropout for embedding layers
						
						
						
						
						
						
					 | 
					
						2024-03-04 07:05:06 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							1d6f2c9084
							
						
					 | 
					
						
						
							
							rearrange exercise order
						
						
						
						
						
						
					 | 
					
						2024-02-11 14:46:05 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							fe332006de
							
						
					 | 
					
						
						
							
							ch4 exercise solutions
						
						
						
						
						
						
					 | 
					
						2024-02-11 11:51:39 -06:00 | 
					
					
						
						
							
							
							
						
					 |