35 lines
897 B
ReStructuredText
Raw Normal View History

2022-06-29 14:35:19 -04:00
Document Parsing
================
The ``unstructured`` library is designed to help preprocess structure unstructured text documents
for use in downstream machine learning tasks. Examples of documents that can be processes
using the ``unstructured`` library include PDFs, XML and HTML documents.
Library Documentation
---------------------
:doc:`installing`
Instructions on how to install the ``unstructured`` library on your system.
:doc:`getting_started`
Check out this section to learn about basic workflows in ``unstructured``.
:doc:`bricks`
Learning more about partitioning, cleaning, and staging bricks, included advanced usage patterns.
2022-06-29 14:35:19 -04:00
:doc:`examples`
Examples of other types of workflows within the ``unstructured`` package.
2022-06-29 14:35:19 -04:00
.. Hidden TOCs
.. toctree::
:caption: Library Documentation
:maxdepth: 2
:hidden:
installing
getting_started
2022-06-29 14:35:19 -04:00
bricks
examples