mirror of
https://github.com/Unstructured-IO/unstructured.git
synced 2025-06-27 02:30:08 +00:00

Update partition_eml and partition_msg to capture cc, bcc, and message id fields. Docs PR: https://github.com/Unstructured-IO/docs/pull/135/files Testing ``` from unstructured.partition.email import partition_email from test_unstructured.unit_utils import example_doc_path elements = partition_email(filename=example_doc_path("eml/fake-email-header.eml"), include_headers=True) print(elements) elements[0].metadata.to_dict() ``` Note to reviewers: Tests in `test_unstructured/partition/test_email.py` were refactored and rearranged to group similar tests together, so it will be easiest to review those changes commit by commit. --------- Co-authored-by: ryannikolaidis <1208590+ryannikolaidis@users.noreply.github.com> Co-authored-by: Coniferish <Coniferish@users.noreply.github.com>
30 lines
1.2 KiB
Plaintext
30 lines
1.2 KiB
Plaintext
Received: from ABCDEFG-000.ABC.guide (00.0.0.00) by ABCDEFG-000.ABC.guide
|
|
([ba23::58b5:2236:45g2:88h2]) with Unstructured TTTT Server (version=ABC0_0,
|
|
cipher=ABC_ABCDE_ABC_NOPE_ABC_000_ABC_ABC000) id 00.0.000.0 via Techbox
|
|
Transport; Wed, 20 Feb 2023 10:03:18 +1200
|
|
MIME-Version: 1.0
|
|
Date: Fri, 16 Dec 2022 17:04:16 -0500
|
|
Bcc: Hello <hello@unstructured.io>
|
|
Message-ID: <CADc-_xaLB2FeVQ7mNsoX+NJb_7hAJhBKa_zet-rtgPGenj0uVw@mail.gmail.com>
|
|
Subject: Test Email
|
|
From: Matthew Robinson <mrobinson@unstructured.io>
|
|
To: Matthew Robinson <mrobinson@unstructured.io>
|
|
Cc: Fake Email <fake-email@unstructured.io>, test@unstructured.io
|
|
Content-Type: multipart/alternative; boundary="00000000000095c9b205eff92630"
|
|
|
|
--00000000000095c9b205eff92630
|
|
Content-Type: text/plain; charset="UTF-8"
|
|
|
|
This is a test email to use for unit tests.
|
|
|
|
Important points:
|
|
|
|
- Roses are red
|
|
- Violets are blue
|
|
|
|
--00000000000095c9b205eff92630
|
|
Content-Type: text/html; charset="UTF-8"
|
|
|
|
<div dir="ltr"><div>This is a test email to use for unit tests.</div><div><br></div><div>Important points:</div><div><ul><li>Roses are red</li><li>Violets are blue</li></ul></div></div>
|
|
|
|
--00000000000095c9b205eff92630-- |