Logo
Explore Help
Register Sign In
yujunjun/unstructured
1
0
Fork 0
You've already forked unstructured
mirror of https://github.com/Unstructured-IO/unstructured.git synced 2025-10-22 21:44:28 +00:00
Code Issues Packages Projects Releases Wiki Activity
unstructured/test_unstructured
History
John 6e5d27c6c3
fix pdf partition of list items being detected as titles in OCR only mode (#1119)
Closes Github issue #1010

adds group_bullet_paragraph func to handle grouping of bullet items that are split across multiple lines
2023-08-15 09:35:54 -07:00
..
cleaners
fix pdf partition of list items being detected as titles in OCR only mode (#1119)
2023-08-15 09:35:54 -07:00
documents
feat: unique_element_ids kwarg for UUID elements (#1085)
2023-08-11 11:02:37 +00:00
file_utils
doc: update API doc to sync with new parameter in prod API (#1049)
2023-08-09 11:09:37 -04:00
nlp
fix: correct nltk download arg order (#991)
2023-07-28 11:29:59 -04:00
partition
Handle inline and lacking filename (#1109)
2023-08-14 18:38:53 +00:00
staging
feat: add filter element types as post processing function (#1014)
2023-08-03 10:50:35 -04:00
vcr_fixtures/cassettes
chore: Re-enable test_upload_label_studio_data_with_sdk (#674)
2023-06-02 23:38:43 +00:00
__init__.py
chore: Reorganize partition bricks under partition directory (#76)
2022-11-21 22:27:23 +00:00
test_utils.py
feat: add requires_dependencies decorator (#302)
2023-02-28 14:50:39 +00:00
Powered by Gitea Version: 1.23.5 Page: 218ms Template: 11ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API