mirror of
https://github.com/Unstructured-IO/unstructured.git
synced 2025-07-06 16:42:42 +00:00
29 lines
637 B
ReStructuredText
29 lines
637 B
ReStructuredText
![]() |
Document Parsing
|
||
|
================
|
||
|
|
||
|
The ``unstructured`` library is designed to help preprocess structure unstructured text documents
|
||
|
for use in downstream machine learning tasks. Examples of documents that can be processes
|
||
|
using the ``unstructured`` library include PDFs, XML and HTML documents.
|
||
|
|
||
|
Library Documentation
|
||
|
---------------------
|
||
|
|
||
|
:doc:`installing`
|
||
|
How to install the ``unstructured`` library
|
||
|
|
||
|
:doc:`examples`
|
||
|
Examples of how to use the library to parse different document types
|
||
|
|
||
|
|
||
|
.. Hidden TOCs
|
||
|
|
||
|
.. toctree::
|
||
|
:caption: Library Documentation
|
||
|
:maxdepth: 2
|
||
|
:hidden:
|
||
|
|
||
|
installing
|
||
|
elements
|
||
|
bricks
|
||
|
examples
|