mirror of
https://github.com/Unstructured-IO/unstructured.git
synced 2025-07-09 01:55:55 +00:00

* docs: update index, add integrations * docs: fix typos * docs: create integrations.rst section structure * docs: descriptions and use for 8 integrations * refactor: SEC example in Label Studio section * Apply suggestions from code review Co-authored-by: qued <64741807+qued@users.noreply.github.com> * docs: change links order and refactor|paraphrase --------- Co-authored-by: qued <64741807+qued@users.noreply.github.com>
38 lines
1014 B
ReStructuredText
38 lines
1014 B
ReStructuredText
Document Parsing
|
|
================
|
|
|
|
The ``unstructured`` library is designed to help preprocess structure unstructured text documents
|
|
for use in downstream machine learning tasks. Examples of documents that can be processes
|
|
using the ``unstructured`` library include PDFs, XML and HTML documents.
|
|
|
|
Library Documentation
|
|
---------------------
|
|
|
|
:doc:`installing`
|
|
Instructions on how to install the ``unstructured`` library on your system.
|
|
|
|
:doc:`getting_started`
|
|
Check out this section to learn about basic workflows in ``unstructured``.
|
|
|
|
:doc:`bricks`
|
|
Learning more about partitioning, cleaning, and staging bricks, included advanced usage patterns.
|
|
|
|
:doc:`examples`
|
|
Examples of other types of workflows within the ``unstructured`` package.
|
|
|
|
:doc:`integrations`
|
|
We make it easy for you to connect your output with other popular ML services.
|
|
|
|
.. Hidden TOCs
|
|
|
|
.. toctree::
|
|
:caption: Library Documentation
|
|
:maxdepth: 2
|
|
:hidden:
|
|
|
|
installing
|
|
getting_started
|
|
bricks
|
|
examples
|
|
integrations
|