unstructured/docs/source/bricks.rst
Jack Retterer a35ff890e0
Update docs jack (#1157)
Documentation Overhaul

- Added documentation hierarchy
- Added options for Bash vs Python for API & Upstream Connectors
- Added Introduction section (Overview, Key Concepts, Getting Started)
- Redid connectors section
- Installation is now broken up (needs further work)
2023-08-21 10:27:32 -07:00

20 lines
721 B
ReStructuredText

Bricks
======
Bricks are functions that live in ``unstructured`` and are the primary public API for the library.
There are several types of bricks in ``unstructured``, corresponding to the different stages of document pre-processing: partitioning, cleaning, chunking and staging.
After reading this section, you should understand the following:
* How to partition a document into json or csv.
* How to remove unwanted content from document elements using cleaning bricks.
* How to extract content from a document using the extraction bricks.
* How to prepare data for downstream use cases using staging bricks
.. toctree::
:maxdepth: 1
bricks/partition
bricks/cleaning
bricks/extracting
bricks/staging