mirror of
https://github.com/Unstructured-IO/unstructured.git
synced 2025-07-12 03:25:54 +00:00

This PR adds documentation of models supported by the `Unstructured` tool. The changes reflect the tool's capabilities, usage examples, and the process for integrating custom models. Sections: - Detailed the basic usage of the `Unstructured` partition with the model name. - Provided a list of available models in the `Unstructured` partition. - Added instructions on using non-default models via three distinct methods. - Explained leveraging models from the LayoutParser's model zoo with `UnstructuredDetectronModel`. - Guided users in integrating their custom object detection models using the `UnstructuredObjectDetectionModel` class. Tested the docs build with: > cd docs > pip install -r requirements.txt > make html
17 lines
512 B
ReStructuredText
17 lines
512 B
ReStructuredText
Best Practices
|
|
==============
|
|
|
|
Unstructured offers a few strategies and models to extract document information. These best practices are intended to provide guidelines to configure the ``strategy`` and ``model`` configurations to optimize document information extraction.
|
|
|
|
High-level overview of available strategies and models in ``Unstructured`` library:
|
|
|
|
.. image:: imgs/strategy.png
|
|
:width: 1000
|
|
:alt: Alternative text
|
|
|
|
.. toctree::
|
|
:maxdepth: 1
|
|
|
|
best_practices/strategies
|
|
best_practices/models
|