Logo
Explore Help
Register Sign In
yujunjun/unstructured
1
0
Fork 0
You've already forked unstructured
mirror of https://github.com/Unstructured-IO/unstructured.git synced 2025-09-02 13:24:03 +00:00
Code Issues Packages Projects Releases Wiki Activity
unstructured/unstructured
History
Matt Robinson f7cde5539a
fix: page_number should not always be 1 in the metadata (#657)
* fix page number issue

* add tests

* changelog and version

* update changelog
2023-05-30 15:10:14 -04:00
..
cleaners
Issue/encoding error eml (#639)
2023-05-30 10:24:02 -07:00
documents
feat: add page_name to metadata for Excel documents (#609)
2023-05-18 13:53:23 +00:00
file_utils
fix: add .log to list of TXT filetypes
2023-05-30 14:13:58 -04:00
ingest
Fix(ingest): Deprecate --s3-url in favor of --remote-url (#616)
2023-05-19 12:11:40 -04:00
models
chore: Remove PDF parsing code and dependencies (#75)
2022-11-21 11:47:29 -06:00
nlp
fix: add handling for non-standard rfc-2822 formats (#564)
2023-05-11 14:36:25 +00:00
partition
fix: page_number should not always be 1 in the metadata (#657)
2023-05-30 15:10:14 -04:00
staging
fix: include all metadata fields when converting to dataframe or CSV (#568)
2023-05-10 13:03:33 -04:00
__init__.py
Initial Release
2022-09-26 14:55:20 -07:00
__version__.py
Issue/encoding error eml (#639)
2023-05-30 10:24:02 -07:00
logger.py
Chore: Add a trace logger for NLP output (#561)
2023-05-10 16:16:15 +00:00
utils.py
Slack connector (#462)
2023-04-16 19:34:43 +00:00
Powered by Gitea Version: 1.23.5 Page: 4798ms Template: 436ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API