From 53be91a0ed52f8bfa30cd778ee256cb9fed080ce Mon Sep 17 00:00:00 2001 From: Jake Poznanski Date: Thu, 23 Oct 2025 18:26:39 +0000 Subject: [PATCH] Add citations and arxiv paper links --- README.md | 20 +++++++++++++++++--- 1 file changed, 17 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 9f9f7ef..6ce6eb4 100644 --- a/README.md +++ b/README.md @@ -9,10 +9,10 @@ GitHub release - + Tech Report v1 - + Tech Report v2 @@ -411,8 +411,9 @@ A full copy of the license can be found [on GitHub](https://github.com/allenai/o ## Citing +For olmOCR v1 and OlmOCR-bench: ```bibtex -@misc{olmocr, +@misc{olmocrbench, title={{olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models}}, author={Jake Poznanski and Jon Borchardt and Jason Dunkelberger and Regan Huff and Daniel Lin and Aman Rangapur and Christopher Wilhelm and Kyle Lo and Luca Soldaini}, year={2025}, @@ -422,3 +423,16 @@ A full copy of the license can be found [on GitHub](https://github.com/allenai/o url={https://arxiv.org/abs/2502.18443}, } ``` + +For olmOCR v2 Unit Testing Rewards with RL: +```bibtex +@misc{olmocr2, + title={olmOCR 2: Unit Test Rewards for Document OCR}, + author={Jake Poznanski and Luca Soldaini and Kyle Lo}, + year={2025}, + eprint={2510.19817}, + archivePrefix={arXiv}, + primaryClass={cs.CV}, + url={https://arxiv.org/abs/2510.19817}, +} +``` \ No newline at end of file