2023-05-01 18:17:52 -04:00
|
|
|
Unstructured Core Library
|
|
|
|
=========================
|
2022-06-29 14:35:19 -04:00
|
|
|
|
|
|
|
The ``unstructured`` library is designed to help preprocess structure unstructured text documents
|
|
|
|
for use in downstream machine learning tasks. Examples of documents that can be processes
|
|
|
|
using the ``unstructured`` library include PDFs, XML and HTML documents.
|
|
|
|
|
|
|
|
Library Documentation
|
|
|
|
---------------------
|
|
|
|
|
|
|
|
:doc:`installing`
|
2023-02-27 18:11:49 -05:00
|
|
|
Instructions on how to install the ``unstructured`` library on your system.
|
|
|
|
|
|
|
|
:doc:`getting_started`
|
|
|
|
Check out this section to learn about basic workflows in ``unstructured``.
|
|
|
|
|
2023-07-14 14:28:57 -04:00
|
|
|
:doc:`api`
|
|
|
|
Access all the power of ``unstructured`` through the ``unstructured-api`` or learn to host it locally.
|
|
|
|
|
2023-02-27 18:11:49 -05:00
|
|
|
:doc:`bricks`
|
2023-06-16 10:10:56 -04:00
|
|
|
Learn more about partitioning, cleaning, and staging bricks, including advanced usage patterns.
|
|
|
|
|
2023-07-12 14:56:09 -04:00
|
|
|
:doc:`connectors`
|
2023-08-11 07:02:37 -04:00
|
|
|
Connect to your favorite data storage platforms for an effortless batch processing of your files.
|
2023-07-12 14:56:09 -04:00
|
|
|
|
2023-06-16 10:10:56 -04:00
|
|
|
:doc:`metadata`
|
|
|
|
Learn more about how metadata is tracked in the ``unstructured`` library.
|
2022-06-29 14:35:19 -04:00
|
|
|
|
|
|
|
:doc:`examples`
|
2023-02-27 18:11:49 -05:00
|
|
|
Examples of other types of workflows within the ``unstructured`` package.
|
2022-06-29 14:35:19 -04:00
|
|
|
|
2023-03-17 20:11:38 +01:00
|
|
|
:doc:`integrations`
|
|
|
|
We make it easy for you to connect your output with other popular ML services.
|
2022-06-29 14:35:19 -04:00
|
|
|
|
|
|
|
.. Hidden TOCs
|
|
|
|
|
|
|
|
.. toctree::
|
2023-05-01 18:17:52 -04:00
|
|
|
:caption: Documentation
|
2022-06-29 14:35:19 -04:00
|
|
|
:maxdepth: 2
|
|
|
|
:hidden:
|
|
|
|
|
|
|
|
installing
|
2023-02-27 10:10:53 -05:00
|
|
|
getting_started
|
2023-07-14 14:28:57 -04:00
|
|
|
api
|
2022-06-29 14:35:19 -04:00
|
|
|
bricks
|
2023-07-12 14:56:09 -04:00
|
|
|
connectors
|
2023-06-16 10:10:56 -04:00
|
|
|
metadata
|
2022-06-29 14:35:19 -04:00
|
|
|
examples
|
2023-03-17 20:11:38 +01:00
|
|
|
integrations
|