Document Parsing ================ The ``unstructured`` library is designed to help preprocess structure unstructured text documents for use in downstream machine learning tasks. Examples of documents that can be processes using the ``unstructured`` library include PDFs, XML and HTML documents. Library Documentation --------------------- :doc:`installing` How to install the ``unstructured`` library :doc:`examples` Examples of how to use the library to parse different document types .. Hidden TOCs .. toctree:: :caption: Library Documentation :maxdepth: 2 :hidden: installing elements bricks examples