Update README.md

This commit is contained in:
Jake Poznanski 2025-02-14 11:07:42 -08:00 committed by GitHub
parent 6583fb641a
commit 0dcdbcc61a
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -58,13 +58,14 @@ to make sure you have access to https://huggingface.co/allenai/olmOCR-7B-0225-pr
For quick testing, try the [web demo](https://olmocr.allen.ai/). To run locally, a GPU is required, as inference is powered by [sglang](https://github.com/sgl-project/sglang) under the hood. For quick testing, try the [web demo](https://olmocr.allen.ai/). To run locally, a GPU is required, as inference is powered by [sglang](https://github.com/sgl-project/sglang) under the hood.
Convert a Single PDF: Convert a Single PDF:
```bash ```bash
python -m olmocr.pipeline ./localworkspace --pdfs tests/gnarly_pdfs/horribleocr.pdf # will convert one PDF into a directory called `localworkspace` python -m olmocr.pipeline ./localworkspace --pdfs tests/gnarly_pdfs/horribleocr.pdf
``` ```
Convert Multiple PDFs: Convert Multiple PDFs:
```bash ```bash
python -m olmocr.pipeline ./localworkspace --pdfs tests/gnarly_pdfs/*.pdf python -m olmocr.pipeline ./localworkspace --pdfs tests/gnarly_pdfs/*.pdf
``` ```
Results will be stored as JSON in `./localworkspace`.
#### Viewing Results #### Viewing Results