mirror of
https://github.com/Unstructured-IO/unstructured.git
synced 2025-06-27 02:30:08 +00:00

* add env var for cap threshold; raise default threshold * update docs and tests * added check for ending in a comma * update docs * no caps check for all upper text * capture Text in html and text * check category in Text equality check * lower case all caps before checking for verbs * added check for us city/state/zip * added address type * add address to html * add address to text * fix for text tests; escape for large text segments * refactor regex for readability * update comment * additional test for text with linebreaks * update docs * update changelog * update elements docs * remove old comment * case -> cast * type fix
10 lines
169 B
Plaintext
10 lines
169 B
Plaintext
This is a test document to use for unit tests.
|
|
|
|
Doylestown, PA 18901
|
|
|
|
Important points:
|
|
|
|
- Hamburgers are delicious
|
|
- Dogs are the best
|
|
- I love fuzzy blankets
|