mirror of
https://github.com/Unstructured-IO/unstructured.git
synced 2025-11-05 20:37:36 +00:00
we are seeing some .eml files come through the VLM partitioner. Which then downgrades to hi-res i believe. For some reason they have a date format that is not standard email format. But it is still legitimate. This uses a more robust date package to parse the date. This package is already installed. --------- Co-authored-by: ryannikolaidis <1208590+ryannikolaidis@users.noreply.github.com> Co-authored-by: potter-potter <potter-potter@users.noreply.github.com>
6 lines
161 B
Plaintext
6 lines
161 B
Plaintext
Date: INVALID-DATE-FORMAT
|
|
From: sender@example.com
|
|
To: recipient@example.com
|
|
Subject: Test invalid date format
|
|
|
|
This is a test-email with an invalid date format. |