2022-09-26 14:55:20 -07:00

29 lines
637 B
ReStructuredText

Document Parsing
================
The ``unstructured`` library is designed to help preprocess structure unstructured text documents
for use in downstream machine learning tasks. Examples of documents that can be processes
using the ``unstructured`` library include PDFs, XML and HTML documents.
Library Documentation
---------------------
:doc:`installing`
How to install the ``unstructured`` library
:doc:`examples`
Examples of how to use the library to parse different document types
.. Hidden TOCs
.. toctree::
:caption: Library Documentation
:maxdepth: 2
:hidden:
installing
elements
bricks
examples