mirror of
				https://github.com/Unstructured-IO/unstructured.git
				synced 2025-10-30 17:38:13 +00:00 
			
		
		
		
	 79f734d3f9
			
		
	
	
		79f734d3f9
		
			
		
	
	
	
	
		
			
			auto strategy was choosing the fast strategy in cases where the pdf contents were just a flat image, resulting in no output. This PR changes the behavior of auto so that elements that can be extracted by fast are extracted, a cursory examination of the elements is made to see if there are elements with text present, and if so then these elements are used as the output. Otherwise fallback strategies come into play.
		
			
				
	
	
	
		
			496 KiB
		
	
	
	
	
	
	
	
			
		
		
	
	
			496 KiB