Sebastian Raschka 
							
						 
					 
					
						
						
						
						
							
						
						
							5016499d1d 
							
						 
					 
					
						
						
							
							Uv workflow improvements ( #531 )  
						
						... 
						
						
						
						* Uv workflow improvements
* Uv workflow improvements
* linter improvements
* pytproject.toml fixes
* pytproject.toml fixes
* pytproject.toml fixes
* pytproject.toml fixes
* pytproject.toml fixes
* pytproject.toml fixes
* windows fixes
* windows fixes
* windows fixes
* windows fixes
* windows fixes
* windows fixes
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix
* win32 fix 
						
						
					 
					
						2025-02-16 13:16:51 -06:00 
						 
				 
			
				
					
						
							
							
								Daniel Kleine 
							
						 
					 
					
						
						
						
						
							
						
						
							0ed1e0d099 
							
						 
					 
					
						
						
							
							fixed typos ( #414 )  
						
						... 
						
						
						
						* fixed typos
* fixed formatting
* Update ch03/02_bonus_efficient-multihead-attention/mha-implementations.ipynb
* del weights after load into model
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com> 
						
						
					 
					
						2024-10-24 18:23:53 -05:00 
						 
				 
			
				
					
						
							
							
								Daniel Kleine 
							
						 
					 
					
						
						
						
						
							
						
						
							cba4f89514 
							
						 
					 
					
						
						
							
							updates for PyTorch 2.5 ( #408 )  
						
						... 
						
						
						
						* updated Dockerfile
* updated MHA implementations for PT 2.5
* fixed typo
* update installation instruction
* Update setup/03_optional-docker-environment/.devcontainer/Dockerfile
---------
Co-authored-by: rasbt <mail@sebastianraschka.com> 
						
						
					 
					
						2024-10-22 20:23:31 -05:00 
						 
				 
			
				
					
						
							
							
								Daniel Kleine 
							
						 
					 
					
						
						
						
						
							
						
						
							9674e62166 
							
						 
					 
					
						
						
							
							nbviewer links / typo ( #346 )  
						
						... 
						
						
						
						* fixed typo
* removed remaining nbviewer links
* Update mha-implementations.ipynb
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com> 
						
						
					 
					
						2024-09-07 07:27:28 +02:00 
						 
				 
			
				
					
						
							
							
								Sebastian Raschka 
							
						 
					 
					
						
						
						
						
							
						
						
							9bb203b1b7 
							
						 
					 
					
						
						
							
							Einsum multi-head attention ( #345 )  
						
						... 
						
						
						
						* Einsum multi-head attention
* update diff 
						
						
					 
					
						2024-09-05 18:24:33 +02:00 
						 
				 
			
				
					
						
							
							
								Daniel Kleine 
							
						 
					 
					
						
						
						
						
							
						
						
							bf1d8687f4 
							
						 
					 
					
						
						
							
							added std error bars ( #320 )  
						
						... 
						
						
						
						* added std error bars
* fixed changes
* Update on A100
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com> 
						
						
					 
					
						2024-08-13 20:57:41 -05:00 
						 
				 
			
				
					
						
							
							
								Jeroen Van Goey 
							
						 
					 
					
						
						
						
						
							
						
						
							75abb61f13 
							
						 
					 
					
						
						
							
							Small typo fix ( #313 )  
						
						... 
						
						
						
						* typo fix
* Update ch03/02_bonus_efficient-multihead-attention/mha-implementations.ipynb
* Update ch03/02_bonus_efficient-multihead-attention/mha-implementations.ipynb
* Update ch03/02_bonus_efficient-multihead-attention/mha-implementations.ipynb
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com> 
						
						
					 
					
						2024-08-12 07:54:12 -05:00 
						 
				 
			
				
					
						
							
							
								Sebastian Raschka 
							
						 
					 
					
						
						
						
						
							
						
						
							f58c2b969b 
							
						 
					 
					
						
						
							
							update attention benchmarks ( #307 )  
						
						
						
						
					 
					
						2024-08-10 09:44:11 -05:00 
						 
				 
			
				
					
						
							
							
								Jeroen Van Goey 
							
						 
					 
					
						
						
						
						
							
						
						
							70cfced899 
							
						 
					 
					
						
						
							
							fix typos, add codespell pre-commit hook ( #264 )  
						
						... 
						
						
						
						* fix typos, add codespell pre-commit hook
* Update .pre-commit-config.yaml
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com> 
						
						
					 
					
						2024-07-16 07:07:04 -05:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							c0f564ee87 
							
						 
					 
					
						
						
							
							update mha dim  
						
						
						
						
					 
					
						2024-05-27 07:46:29 -05:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							ec6e09136a 
							
						 
					 
					
						
						
							
							update pr  
						
						
						
						
					 
					
						2024-05-26 15:38:35 -05:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							fe8bb9291e 
							
						 
					 
					
						
						
							
							update formatting  
						
						
						
						
					 
					
						2024-05-24 07:20:37 -05:00 
						 
				 
			
				
					
						
							
							
								Sebastian Raschka 
							
						 
					 
					
						
						
						
						
							
						
						
							0528446584 
							
						 
					 
					
						
						
							
							Make code more consistent and add projection layer ( #131 )  
						
						... 
						
						
						
						* Make code more consistent and add projection
* remove redundant buffer 
						
						
					 
					
						2024-04-26 17:13:08 -05:00 
						 
				 
			
				
					
						
							
							
								Sebastian Raschka 
							
						 
					 
					
						
						
						
						
							
						
						
							ccd7cebbb3 
							
						 
					 
					
						
						
							
							Rename variable to context_length to make it easier on readers ( #106 )  
						
						... 
						
						
						
						* rename to context length
* fix spacing 
						
						
					 
					
						2024-04-04 07:27:41 -05:00 
						 
				 
			
				
					
						
							
							
								Sebastian Raschka 
							
						 
					 
					
						
						
						
						
							
						
						
							cf39abac04 
							
						 
					 
					
						
						
							
							Add and link bonus material ( #84 )  
						
						
						
						
					 
					
						2024-03-23 07:27:43 -05:00 
						 
				 
			
				
					
						
							
							
								Sebastian Raschka 
							
						 
					 
					
						
						
						
						
							
						
						
							a2cd8436cb 
							
						 
					 
					
						
						
							
							Ch05 supplementary code ( #81 )  
						
						
						
						
					 
					
						2024-03-19 09:26:26 -05:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							0d517e98b9 
							
						 
					 
					
						
						
							
							update  
						
						
						
						
					 
					
						2024-03-13 08:37:54 -05:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							569f6bc7f0 
							
						 
					 
					
						
						
							
							benchmark numbers  
						
						
						
						
					 
					
						2024-03-13 07:12:10 -05:00 
						 
				 
			
				
					
						
							
							
								taihaozesong 
							
						 
					 
					
						
						
						
						
							
						
						
							f1fa9df15c 
							
						 
					 
					
						
						
							
							Fix mha wrapper implementations in ch03 bonus  
						
						
						
						
					 
					
						2024-03-13 18:02:26 +08:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							321f3d33f9 
							
						 
					 
					
						
						
							
							add cuda warmup  
						
						
						
						
					 
					
						2024-03-10 10:31:55 -05:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							da33ce8054 
							
						 
					 
					
						
						
							
							remove redundant unsqueeze in mask  
						
						
						
						
					 
					
						2024-03-09 17:42:31 -06:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							6ba97adaee 
							
						 
					 
					
						
						
							
							add PyTorch version  
						
						
						
						
					 
					
						2024-03-09 17:42:30 -06:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							5ca60321c4 
							
						 
					 
					
						
						
							
							add a100 numbers  
						
						
						
						
					 
					
						2024-03-09 10:20:08 -06:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							29ca41799a 
							
						 
					 
					
						
						
							
							use need_weights=False  
						
						
						
						
					 
					
						2024-03-09 10:09:17 -06:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							5643c88db9 
							
						 
					 
					
						
						
							
							add pytorch mha  
						
						
						
						
					 
					
						2024-03-08 09:30:55 -06:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							404f48aa74 
							
						 
					 
					
						
						
							
							automatically run on gpu or cpu  
						
						
						
						
					 
					
						2024-03-07 20:14:03 -06:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							99a5e28def 
							
						 
					 
					
						
						
							
							rename q,k,v for consistency with chapter 3  
						
						
						
						
					 
					
						2024-03-07 06:30:40 -06:00 
						 
				 
			
				
					
						
							
							
								Rayed Bin Wahed 
							
						 
					 
					
						
						
						
						
							
						
						
							496079c61e 
							
						 
					 
					
						
						
							
							Update mha-implementations.ipynb  
						
						... 
						
						
						
						Fix variable spelling in comments to keep consistent with code 
						
						
					 
					
						2024-03-06 23:03:57 +08:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							b6fe1a37b3 
							
						 
					 
					
						
						
							
							also add simple wrapper  
						
						
						
						
					 
					
						2024-03-06 08:38:53 -06:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							571377a2d6 
							
						 
					 
					
						
						
							
							update title  
						
						
						
						
					 
					
						2024-03-06 08:34:04 -06:00 
						 
				 
			
				
					
						
							
							
								rasbt 
							
						 
					 
					
						
						
						
						
							
						
						
							87fcfd9245 
							
						 
					 
					
						
						
							
							mha variants  
						
						
						
						
					 
					
						2024-03-06 08:30:32 -06:00