mirror of
				https://github.com/Unstructured-IO/unstructured.git
				synced 2025-10-26 15:42:15 +00:00 
			
		
		
		
	 bc791d53f4
			
		
	
	
		bc791d53f4
		
			
		
	
	
	
	
		
			
			Adds OpenSearch as a source and destination. Since OpenSearch is a fork of Elasticsearch, these connectors rely heavily on inheriting the Elasticsearch connectors whenever possible. - Adds OpenSearch source connector to be able to ingest documents from OpenSearch. - Adds OpenSearch destination connector to be able to ingest documents from any supported source, embed them and write the embeddings / documents into OpenSearch. - Defines an example unstructured elements schema for users to be able to setup their unstructured OpenSearch indexes easily. --------- Co-authored-by: potter-potter <david.potter@gmail.com>
		
			
				
	
	
		
			11 lines
		
	
	
		
			300 B
		
	
	
	
		
			Bash
		
	
	
	
	
	
			
		
		
	
	
			11 lines
		
	
	
		
			300 B
		
	
	
	
		
			Bash
		
	
	
	
	
	
| #!/usr/bin/env bash
 | |
| 
 | |
| unstructured-ingest \
 | |
|   opensearch \
 | |
|   --metadata-exclude filename,file_directory,metadata.data_source.date_processed \
 | |
|   --url http://localhost:9200 \
 | |
|   --index-name movies \
 | |
|   --fields 'ethnicity, director, plot' \
 | |
|   --output-dir opensearch-ingest-output \
 | |
|   --num-processes 2
 |