mirror of
https://github.com/Unstructured-IO/unstructured.git
synced 2025-12-25 06:04:53 +00:00
build(deps): bump unstructured-inference==0.5.13 (#1141)
Bump to unstructured-inference==0.5.13, which includes: Fix extracted image elements being included in layout merge, addresses the issue where an entire-page image in a PDF was not passed to the layout model when using hi_res.
This commit is contained in:
parent
9f7bd6127b
commit
dd0f582585
11
CHANGELOG.md
11
CHANGELOG.md
@ -1,3 +1,14 @@
|
||||
## 0.10.2
|
||||
|
||||
### Enhancements
|
||||
* Bump unstructured-inference==0.5.13:
|
||||
- Fix extracted image elements being included in layout merge, addresses the issue
|
||||
where an entire-page image in a PDF was not passed to the layout model when using hi_res.
|
||||
|
||||
### Features
|
||||
|
||||
### Fixes
|
||||
|
||||
## 0.10.1
|
||||
|
||||
### Enhancements
|
||||
|
||||
@ -26,4 +26,4 @@ Pillow<10.0.0
|
||||
# AttributeError: 'ResourcePath' object has no attribute 'collection'
|
||||
Office365-REST-Python-Client<2.4.3
|
||||
# NOTE(christine) Pinned to set the `unstructured-inference` version
|
||||
unstructured-inference==0.5.12
|
||||
unstructured-inference==0.5.13
|
||||
@ -205,7 +205,7 @@ typing-extensions==4.7.1
|
||||
# torch
|
||||
tzdata==2023.3
|
||||
# via pandas
|
||||
unstructured-inference==0.5.12
|
||||
unstructured-inference==0.5.13
|
||||
# via
|
||||
# -c requirements/constraints.in
|
||||
# -r requirements/extra-pdf-image.in
|
||||
|
||||
@ -4,7 +4,7 @@
|
||||
#
|
||||
# pip-compile requirements/ingest-confluence.in
|
||||
#
|
||||
atlassian-python-api==3.40.1
|
||||
atlassian-python-api==3.41.0
|
||||
# via -r requirements/ingest-confluence.in
|
||||
certifi==2023.7.22
|
||||
# via
|
||||
|
||||
File diff suppressed because one or more lines are too long
@ -9,16 +9,6 @@
|
||||
},
|
||||
"text": "Data in Brief 22 (2019) 451–457"
|
||||
},
|
||||
{
|
||||
"type": "Image",
|
||||
"element_id": "70d50409ea726a2789ebbd004bec31f4",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
"page_number": 1
|
||||
},
|
||||
"text": "Contents lists available at ScienceDirect Data in Brief journal homepage: www.elsevier.com/locate/dib"
|
||||
},
|
||||
{
|
||||
"type": "UncategorizedText",
|
||||
"element_id": "869adddb184177031536477262e0dde0",
|
||||
@ -39,6 +29,16 @@
|
||||
},
|
||||
"text": "Data in Brief"
|
||||
},
|
||||
{
|
||||
"type": "NarrativeText",
|
||||
"element_id": "9234133787d0a6b3976b16569c0b5cf3",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
"page_number": 1
|
||||
},
|
||||
"text": "journal homepage: www.elsevier.com/locate/dib"
|
||||
},
|
||||
{
|
||||
"type": "UncategorizedText",
|
||||
"element_id": "ac01687ab870e4bb6e7313db4654928a",
|
||||
|
||||
@ -9,16 +9,6 @@
|
||||
},
|
||||
"text": "Data in Brief 22 (2019) 484–487"
|
||||
},
|
||||
{
|
||||
"type": "Image",
|
||||
"element_id": "70d50409ea726a2789ebbd004bec31f4",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
"page_number": 1
|
||||
},
|
||||
"text": "Contents lists available at ScienceDirect Data in Brief journal homepage: www.elsevier.com/locate/dib"
|
||||
},
|
||||
{
|
||||
"type": "UncategorizedText",
|
||||
"element_id": "869adddb184177031536477262e0dde0",
|
||||
@ -39,6 +29,16 @@
|
||||
},
|
||||
"text": "Data in Brief"
|
||||
},
|
||||
{
|
||||
"type": "NarrativeText",
|
||||
"element_id": "9234133787d0a6b3976b16569c0b5cf3",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
"page_number": 1
|
||||
},
|
||||
"text": "journal homepage: www.elsevier.com/locate/dib"
|
||||
},
|
||||
{
|
||||
"type": "UncategorizedText",
|
||||
"element_id": "ac01687ab870e4bb6e7313db4654928a",
|
||||
|
||||
@ -852,7 +852,7 @@
|
||||
},
|
||||
{
|
||||
"type": "FigureCaption",
|
||||
"element_id": "185e67615d123b35d38ea72e0cdb6d99",
|
||||
"element_id": "d21661161ae2c8dc39e96ee5c660704b",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
@ -960,16 +960,6 @@
|
||||
},
|
||||
"text": "LayoutParser provides a unified interface for existing OCR tools. Though there are many OCR tools available, they are usually configured differently with distinct APIs or protocols for using them. It can be inefficient to add new OCR tools into an existing pipeline, and difficult to make direct comparisons among the available tools to find the best option for a particular project. To this end, LayoutParser builds a series of wrappers among existing OCR engines, and provides nearly the same syntax for using them. It supports a plug-and-play style of using OCR engines, making it effortless to switch, evaluate, and compare different OCR modules:"
|
||||
},
|
||||
{
|
||||
"type": "Image",
|
||||
"element_id": "65ac0f9ae348b12ed9484b8af7296617",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
"page_number": 7
|
||||
},
|
||||
"text": "ocr_agent = lp.TesseractAgent ()pOi"
|
||||
},
|
||||
{
|
||||
"type": "ListItem",
|
||||
"element_id": "bebbb4e94f1f97edeb5b96e252720a93",
|
||||
@ -1351,6 +1341,26 @@
|
||||
},
|
||||
"text": "x09 Burpunog uayor Aeydsiq 1 vondo 10g Guypunog usyoy apir:z uondo Mode I: Showing Layout on the Original Image Mode Il: Drawing OCR'd Text at the Correspoding Position"
|
||||
},
|
||||
{
|
||||
"type": "NarrativeText",
|
||||
"element_id": "aed1b21a388cefaa841f20f48d19ca98",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
"page_number": 9
|
||||
},
|
||||
"text": "Mode I: Showing Layout on the Original Image"
|
||||
},
|
||||
{
|
||||
"type": "NarrativeText",
|
||||
"element_id": "915bc5f1403e01b56e77300d9354fded",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
"page_number": 9
|
||||
},
|
||||
"text": "Mode Il: Drawing OCR'd Text at the Correspoding Position"
|
||||
},
|
||||
{
|
||||
"type": "NarrativeText",
|
||||
"element_id": "cc8ad6e0f933633a37b82200e6724f9e",
|
||||
|
||||
File diff suppressed because one or more lines are too long
@ -659,6 +659,16 @@
|
||||
},
|
||||
"text": "Modern society is dependent on the steady supply of electricity, every day of the year – regardless of weather, season or time of day – and nuclear energy is particularly well-suited to providing this service. Given that the majority of baseload supply is fossil-based, an increase in the use of nuclear energy would result in a rapid decarbonization of the electricity system. The International Energy Agency’s (IEA) recent report III on nuclear energy highlighted the importance of dependable baseload electricity generators and the need to properly value and compensate them for the electricity security and reliability services they provide."
|
||||
},
|
||||
{
|
||||
"type": "FigureCaption",
|
||||
"element_id": "87289b7325e8d8052da6f0182799e2a3",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
"page_number": 5
|
||||
},
|
||||
"text": "ee Nie"
|
||||
},
|
||||
{
|
||||
"type": "UncategorizedText",
|
||||
"element_id": "1121cfccd5913f0a63fec40a6ffd44ea",
|
||||
@ -1700,8 +1710,8 @@
|
||||
"text": "The incredible energy density of uranium means that just a few kilos is all that is required to provide one person with enough power for a lifetime. Uranium is abundant and can be found in many parts of the world, as well as in seawater. Furthermore, spent nuclear fuel is well managed and can in most cases be recycled to produce even more power. By using nuclear energy, countries are able to take charge of their own destinies by decreasing their reliance on imported energy – enhanced independence and security in uncertain times."
|
||||
},
|
||||
{
|
||||
"type": "FigureCaption",
|
||||
"element_id": "eeda9f9210dfe4be7e82b4385290d3ca",
|
||||
"type": "NarrativeText",
|
||||
"element_id": "de916089e4fe96d481dbdbb9499694e8",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
@ -1919,16 +1929,6 @@
|
||||
},
|
||||
"text": "8"
|
||||
},
|
||||
{
|
||||
"type": "Image",
|
||||
"element_id": "6d647fc38c561c01f7859e019345d367",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
"page_number": 12
|
||||
},
|
||||
"text": "+44 (0)20 7451 1520www.world-nuclear.orginfo@world-nuclear.org World Nuclear Association is the international organization that represents the global nuclear industry. Its mission is to promote a wider understanding of nuclear energy among key international influencers by producing authoritative information, developing common industry positions, and contributing to the energy debate."
|
||||
},
|
||||
{
|
||||
"type": "UncategorizedText",
|
||||
"element_id": "821daa4396c0087d9d5ee9240bc5c85c",
|
||||
|
||||
@ -979,26 +979,6 @@
|
||||
},
|
||||
"text": "If the potential of nuclear energy is to be fully realized, public health and safety approaches must be recalibrated to consider a wider range of factors when considering radiation, adopting an “all-hazards” approach. Such an approach must ensure that risks are placed within a proper perspective and context, rather than looking at them in isolation. We therefore must not look at the costs – be they economic, environmental, or public health – associated with an individual power plant in isolation, but rather the costs associated with it (and its alternatives) at a societal level (Figure 4). This would entail looking at the potential risks arising from the use of nuclear power and comparing these with the risks associated with not adopting nuclear power."
|
||||
},
|
||||
{
|
||||
"type": "Image",
|
||||
"element_id": "c9889d326ca46635644c051ced3cdde5",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
"page_number": 7
|
||||
},
|
||||
"text": "Plant-levelproduction costsat market prices Grid-level costsof the electricitysystem ber Jest—"
|
||||
},
|
||||
{
|
||||
"type": "Image",
|
||||
"element_id": "2550e9a8245a64cdb4de02c91133865a",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
"page_number": 7
|
||||
},
|
||||
"text": "Plant-levelproduction costsat market prices"
|
||||
},
|
||||
{
|
||||
"type": "UncategorizedText",
|
||||
"element_id": "c679f20b20827cc120836b9b22b53492",
|
||||
@ -1149,6 +1129,16 @@
|
||||
},
|
||||
"text": "Therefore, World Nuclear Association calls upon policymakers and regulators to adopt an all-hazards approach, where different risks associated with energy producing technologies are placed in perspective and the appropriate context, and examined in line with the latest scientific evidence. Policymakers and regulators must ensure that their decisions regarding radiation protection do not create greater risks elsewhere. This include the recalibration of existing regulations regarding nuclear power and radiation, weighing the cost of regulatory measures against the societal benefits provided by nuclear energy."
|
||||
},
|
||||
{
|
||||
"type": "FigureCaption",
|
||||
"element_id": "59a569516614e7fbdefa6a2aef6f50a7",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
"page_number": 9
|
||||
},
|
||||
"text": "yeee PALEESO OOcrane a 72."
|
||||
},
|
||||
{
|
||||
"type": "UncategorizedText",
|
||||
"element_id": "10159baf262b43a92d95db59dae1f72c",
|
||||
@ -1419,16 +1409,6 @@
|
||||
},
|
||||
"text": "8"
|
||||
},
|
||||
{
|
||||
"type": "Image",
|
||||
"element_id": "6d647fc38c561c01f7859e019345d367",
|
||||
"metadata": {
|
||||
"data_source": {},
|
||||
"filetype": "application/pdf",
|
||||
"page_number": 12
|
||||
},
|
||||
"text": "+44 (0)20 7451 1520www.world-nuclear.orginfo@world-nuclear.org World Nuclear Association is the international organization that represents the global nuclear industry. Its mission is to promote a wider understanding of nuclear energy among key international influencers by producing authoritative information, developing common industry positions, and contributing to the energy debate."
|
||||
},
|
||||
{
|
||||
"type": "UncategorizedText",
|
||||
"element_id": "821daa4396c0087d9d5ee9240bc5c85c",
|
||||
|
||||
@ -1 +1 @@
|
||||
__version__ = "0.10.1" # pragma: no cover
|
||||
__version__ = "0.10.2" # pragma: no cover
|
||||
|
||||
Loading…
x
Reference in New Issue
Block a user