Daniel Kleine
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							bbb2a0c3d5
							
						
					 | 
					
						
						
							
							fixed num_workers (#229)
						
						
						
						
						
						
						
						* fixed num_workers
* ch06 & ch07: added num_workers to create_dataloader_v1 
						
						
					 | 
					
						2024-06-19 17:36:46 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Daniel Kleine
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							dcbdc1d2e5
							
						
					 | 
					
						
						
							
							fixes for code (#206)
						
						
						
						
						
						
						
						* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com> 
						
						
					 | 
					
						2024-06-11 20:59:48 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							72a073bbbf
							
						
					 | 
					
						
						
							
							Remove leftover instances of self.tokenizer (#201)
						
						
						
						
						
						
						
						* Remove leftover instances of self.tokenizer
* add endoftext token 
						
						
					 | 
					
						2024-06-08 14:57:34 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							97ed38116a
							
						
					 | 
					
						
						
							
							Rename drop_resid to drop_shortcut (#136)
						
						
						
						
						
						
					 | 
					
						2024-04-28 14:31:27 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							dd51d4ad83
							
						
					 | 
					
						
						
							
							Make datesets and loaders compatible with multiprocessing (#118)
						
						
						
						
						
						
					 | 
					
						2024-04-13 13:57:56 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								James Holcombe
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							05718c6b94
							
						
					 | 
					
						
						
							
							Use instance tokenizer (#116)
						
						
						
						
						
						
						
						* Use instance tokenizer
* consistency updates
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com> 
						
						
					 | 
					
						2024-04-10 21:16:19 -04:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							2de60d1bfb
							
						
					 | 
					
						
						
							
							Rename variable to context_length to make it easier on readers (#106)
						
						
						
						
						
						
						
						* rename to context length
* fix spacing 
						
						
					 | 
					
						2024-04-04 07:27:41 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
							
							
						
						
						
							
						
						
							3829ccdb34
							
						
					 | 
					
						
						
							
							Remove reundant dropout in MLP module (#105)
						
						
						
						
						
						
					 | 
					
						2024-04-03 20:19:08 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
						
						
							
						
						
							cf39abac04
							
						
					 | 
					
						
						
							
							Add and link bonus material (#84)
						
						
						
						
						
						
					 | 
					
						2024-03-23 07:27:43 -05:00 | 
					
					
						
						
							
							
							
						
					 |