Document Parsing ================ The ``unstructured`` library is designed to help preprocess structure unstructured text documents for use in downstream machine learning tasks. Examples of documents that can be processes using the ``unstructured`` library include PDFs, XML and HTML documents. Library Documentation --------------------- :doc:`installing` Instructions on how to install the ``unstructured`` library on your system. :doc:`getting_started` Check out this section to learn about basic workflows in ``unstructured``. :doc:`bricks` Learning more about partitioning, cleaning, and staging bricks, included advanced usage patterns. :doc:`examples` Examples of other types of workflows within the ``unstructured`` package. .. Hidden TOCs .. toctree:: :caption: Library Documentation :maxdepth: 2 :hidden: installing getting_started bricks examples