This guide demonstrates how to process multiple files using the Unstructured API and S3 Connector and implement context-aware chunking. The process involves installing dependencies, configuring settings, and utilizing Python scripts to manage and chunk data effectively.
Prerequisites
=============
Ensure you have Unstructured API key and access to an S3 bucket containing the target files.
Step 1: Install Unstructured and S3 Dependency
==============================================
Install the `unstructured` package with S3 support.
Convert the combined JSON data into Unstructured Elements and apply chunking by title.
..code-block:: python
elements = dict_to_elements(combined_json_data)
chunks = chunk_by_title(elements)
Conclusion
==========
Following these steps allows for efficient processing of multiple files using the Unstructured S3 Connector. The context-aware chunking helps in organizing and analyzing the data effectively.