mirror of
https://github.com/Unstructured-IO/unstructured.git
synced 2025-12-25 06:04:53 +00:00
fix: fix ingest paddle hanging issue (#1441)
## Summary Ingest tests are having paddle OOM issue which cause the tests to hang forever. The fix here is to remove paddle from ci and set both OCR env `TABLE_OCR` and `ENTIRE_PAGE_OCR` to `tesseract`. (will have follow up PR to investigate why this is failing) ## Test please check ingest tests in CI
This commit is contained in:
parent
eb8ce89137
commit
f962a1e57d
2
.github/workflows/ci.yml
vendored
2
.github/workflows/ci.yml
vendored
@ -289,6 +289,8 @@ jobs:
|
||||
NOTION_API_KEY: ${{ secrets.NOTION_API_KEY }}
|
||||
AWS_SECRET_ACCESS_KEY: ${{ secrets.AWS_SECRET_ACCESS_KEY }}
|
||||
AWS_ACCESS_KEY_ID: ${{ secrets.AWS_ACCESS_KEY_ID }}
|
||||
TABLE_OCR: "tesseract"
|
||||
ENTIRE_PAGE_OCR: "tesseract"
|
||||
run: |
|
||||
source .venv/bin/activate
|
||||
sudo apt-get update
|
||||
|
||||
@ -86,6 +86,8 @@ jobs:
|
||||
NOTION_API_KEY: ${{ secrets.NOTION_API_KEY }}
|
||||
AWS_SECRET_ACCESS_KEY: ${{ secrets.AWS_SECRET_ACCESS_KEY }}
|
||||
AWS_ACCESS_KEY_ID: ${{ secrets.AWS_ACCESS_KEY_ID }}
|
||||
TABLE_OCR: "tesseract"
|
||||
ENTIRE_PAGE_OCR: "tesseract"
|
||||
OVERWRITE_FIXTURES: "true"
|
||||
run: |
|
||||
source .venv/bin/activate
|
||||
|
||||
2
Makefile
2
Makefile
@ -21,7 +21,7 @@ install-base: install-base-pip-packages install-nltk-models
|
||||
install: install-base-pip-packages install-dev install-nltk-models install-test install-huggingface install-all-docs
|
||||
|
||||
.PHONY: install-ci
|
||||
install-ci: install-base-pip-packages install-nltk-models install-huggingface install-paddleocr install-all-docs install-test
|
||||
install-ci: install-base-pip-packages install-nltk-models install-huggingface install-all-docs install-test
|
||||
|
||||
.PHONY: install-base-ci
|
||||
install-base-ci: install-base-pip-packages install-nltk-models install-test
|
||||
|
||||
Loading…
x
Reference in New Issue
Block a user