mirror of
				https://github.com/Unstructured-IO/unstructured.git
				synced 2025-10-25 15:03:54 +00:00 
			
		
		
		
	 2d1923ac7e
			
		
	
	
		2d1923ac7e
		
			
		
	
	
	
	
		
			
			Part two of: https://github.com/Unstructured-IO/unstructured/pull/2842 Main changes compared to part one: * hash computation includes element's sequence number on page, page number, document filename and its text * there are more test for deterministic behavior of IDs returned by partitioning functions + their uniqueness (guaranteed at the document level, and high probability across multiple documents) This PR addresses the following issue: https://github.com/Unstructured-IO/unstructured/issues/2461