mirror of
				https://github.com/Unstructured-IO/unstructured.git
				synced 2025-10-25 15:03:54 +00:00 
			
		
		
		
	 9b0dbc7026
			
		
	
	
		9b0dbc7026
		
			
		
	
	
	
	
		
			
			* bump cryptography version * re pip-compile for latest versions * update argilla example requirements * dependency updates * bump versions * pin unstructured-inference due to multithreading issue * linting, linting, linting * dependency on one line
ISW Summarization Example
This directory shows and example of how to use unstructured, argilla, and transformers
to train a custom summarization model on Institute for the Study of War (ISW) reports
about the was in Ukraine. This example shows how, by combining these three libraries, you can
complete a data science project in hours that previously would have taken weeks.
To get started, use the following steps:
- Ensure you have Python 3.8 or higher installed on your system
- Create a new Python virtual environment
- Run pip install -r requirements.txtto install the dependencies
- Run PYTHONPATH=. jupyter notebookfrom this directory to launch the notebook
At this point, you'll be able to run the model training notebook.