Sebastian Raschka
							
						 
					 | 
					
						
						
						
						
							
						
						
							6dd8666d9c
							
						
					 | 
					
						
						
							
							Test code in pytorch 2.4 (#285)
						
						
						
						
						
						
						
						* test code in pytorch 2.4
* update 
						
						
					 | 
					
						2024-07-24 21:53:41 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
						
						
							
						
						
							40ba3a4068
							
						
					 | 
					
						
						
							
							Remove leftover instances of self.tokenizer (#201)
						
						
						
						
						
						
						
						* Remove leftover instances of self.tokenizer
* add endoftext token 
						
						
					 | 
					
						2024-06-08 14:57:34 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							ec6e09136a
							
						
					 | 
					
						
						
							
							update pr
						
						
						
						
						
						
					 | 
					
						2024-05-26 15:38:35 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Kostyantyn Borysenko
							
						 
					 | 
					
						
						
						
						
							
						
						
							18b5724e75
							
						
					 | 
					
						
						
							
							Fix an incorrect input dimension
						
						
						
						
						
						
					 | 
					
						2024-05-26 13:05:07 -07:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							fe8bb9291e
							
						
					 | 
					
						
						
							
							update formatting
						
						
						
						
						
						
					 | 
					
						2024-05-24 07:20:37 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								James Holcombe
							
						 
					 | 
					
						
						
						
						
							
						
						
							0b866c133f
							
						
					 | 
					
						
						
							
							Use instance tokenizer (#116)
						
						
						
						
						
						
						
						* Use instance tokenizer
* consistency updates
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com> 
						
						
					 | 
					
						2024-04-10 21:16:19 -04:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
						
						
							
						
						
							ccd7cebbb3
							
						
					 | 
					
						
						
							
							Rename variable to context_length to make it easier on readers (#106)
						
						
						
						
						
						
						
						* rename to context length
* fix spacing 
						
						
					 | 
					
						2024-04-04 07:27:41 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							3ad442ee90
							
						
					 | 
					
						
						
							
							skip version cell
						
						
						
						
						
						
					 | 
					
						2024-03-28 08:23:33 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
						
						
							
						
						
							a2cd8436cb
							
						
					 | 
					
						
						
							
							Ch05 supplementary code (#81)
						
						
						
						
						
						
					 | 
					
						2024-03-19 09:26:26 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Sebastian Raschka
							
						 
					 | 
					
						
						
						
						
							
						
						
							ca96abac8a
							
						
					 | 
					
						
						
							
							Set up basic test gh worklows (#79)
						
						
						
						
						
						
						
						* Set up basic test gh worklows
* update file paths
* env check
* add env check
* Update requirements.txt
* simplify
* upd 
						
						
					 | 
					
						2024-03-18 11:58:37 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							1870b4bacd
							
						
					 | 
					
						
						
							
							update stride param
						
						
						
						
						
						
					 | 
					
						2024-03-13 08:39:59 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							da33ce8054
							
						
					 | 
					
						
						
							
							remove redundant unsqueeze in mask
						
						
						
						
						
						
					 | 
					
						2024-03-09 17:42:31 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							87fcfd9245
							
						
					 | 
					
						
						
							
							mha variants
						
						
						
						
						
						
					 | 
					
						2024-03-06 08:30:32 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							d4754f1bdd
							
						
					 | 
					
						
						
							
							change dim=1 to dim=-1
						
						
						
						
						
						
					 | 
					
						2024-03-04 18:54:43 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							b827bf4eea
							
						
					 | 
					
						
						
							
							remove redundant double-unsequeeze
						
						
						
						
						
						
					 | 
					
						2024-02-29 08:31:07 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							8860e16e05
							
						
					 | 
					
						
						
							
							<|endoftext|> token in dataset v1
						
						
						
						
						
						
					 | 
					
						2024-01-21 12:03:04 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							92896d817c
							
						
					 | 
					
						
						
							
							add toggle for qkv_bias
						
						
						
						
						
						
					 | 
					
						2024-01-17 07:50:57 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							dfe2c3b46f
							
						
					 | 
					
						
						
							
							use blocksize in positional embedding
						
						
						
						
						
						
					 | 
					
						2024-01-15 08:15:33 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							9e85f13ba9
							
						
					 | 
					
						
						
							
							readability improvements
						
						
						
						
						
						
					 | 
					
						2024-01-15 07:36:19 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							a7b4880179
							
						
					 | 
					
						
						
							
							small readability updates
						
						
						
						
						
						
					 | 
					
						2024-01-14 11:58:42 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							4f161bd549
							
						
					 | 
					
						
						
							
							use block size variable in positional embedding layer
						
						
						
						
						
						
					 | 
					
						2023-12-28 19:05:06 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								rasbt
							
						 
					 | 
					
						
						
						
						
							
						
						
							31980a6ef1
							
						
					 | 
					
						
						
							
							add ch03 and TOC
						
						
						
						
						
						
					 | 
					
						2023-12-09 17:13:56 -06:00 | 
					
					
						
						
							
							
							
						
					 |