567 Commits

Author SHA1 Message Date
Jake Poznanski
d36e556f19 Hopefully fixes build 2025-01-30 13:11:37 -08:00
Jake Poznanski
c69e0d6762 More cleanup, removing dead adv anchor code 2025-01-30 12:58:11 -08:00
Jake Poznanski
d4d711d12a Nicer glob handing for pipeline.py 2025-01-30 12:48:10 -08:00
Jake Poznanski
84477b50f4 More formatting 2025-01-30 10:54:21 -08:00
Jake Poznanski
e3d04ee79f Merge branch 'main' of https://github.com/allenai/olmocr into main 2025-01-30 10:53:40 -08:00
Jake Poznanski
c37e545d25 running isort again 2025-01-30 10:53:35 -08:00
Jake Poznanski
358a24f6cb
Update README.md 2025-01-30 10:33:54 -08:00
Jake Poznanski
c58e13392b
Update README.md 2025-01-30 10:28:57 -08:00
Jake Poznanski
2c2953329e Fixing most ruff errors 2025-01-29 15:57:26 -08:00
Jake Poznanski
56903774b7 Ruff 2025-01-29 15:47:57 -08:00
Jake Poznanski
fb402297ce Isort and black update 2025-01-29 15:42:34 -08:00
Jake Poznanski
cdb10a951b Python 3.11 2025-01-29 15:33:11 -08:00
Jake Poznanski
dcaca8aa90 Black formatting 2025-01-29 15:30:39 -08:00
Jake Poznanski
4a1762d455 isort 2025-01-29 15:25:10 -08:00
Jake Poznanski
0628d3161f Some unit test cleanup 2025-01-29 15:15:10 -08:00
Jake Poznanski
7d2403da52 More infos 2025-01-29 14:25:15 -08:00
Jake Poznanski
8dd006d806 Merge branch 'main' of https://github.com/allenai/olmocr into main 2025-01-29 14:12:41 -08:00
Jake Poznanski
04615d7f0a More logging on sglang server 2025-01-29 14:12:39 -08:00
Jake Poznanski
d9f5b7245f Merge branch 'main' of https://github.com/allenai/olmocr 2025-01-29 22:03:39 +00:00
Jake Poznanski
962126e987 Typo 2025-01-29 22:03:37 +00:00
Jake Poznanski
c7e56e7bff
Update README.md 2025-01-29 14:01:18 -08:00
Jake Poznanski
6369b1f10c Merge branch 'main' of https://github.com/allenai/olmocr 2025-01-29 21:49:19 +00:00
Jake Poznanski
17a5dfe0d0 Add gpu message 2025-01-29 21:48:56 +00:00
Jake Poznanski
7e4fb68869
Update README.md 2025-01-29 13:37:13 -08:00
Jake Poznanski
0ccb99c9dd readme 2025-01-29 13:36:04 -08:00
Jake Poznanski
2e4ef9522b Readme 2025-01-29 13:35:05 -08:00
Jake Poznanski
21925050c2
Update README.md 2025-01-29 11:53:40 -08:00
Jake Poznanski
9a1be7e6c2 Readme 2025-01-29 11:52:24 -08:00
Jake Poznanski
496e162712
Update README.md 2025-01-29 11:51:30 -08:00
Jake Poznanski
b574766977 Viewer and gitignore 2025-01-29 11:46:46 -08:00
Jake Poznanski
86267d865f Viewer cleanup 2025-01-29 11:38:53 -08:00
Jake Poznanski
a243c8923d
Update README.md 2025-01-29 11:28:38 -08:00
Jake Poznanski
dbf647790a viewer fix 2025-01-29 11:27:55 -08:00
Jake Poznanski
4c35105bd4 More readme imporvements 2025-01-29 11:23:04 -08:00
Jake Poznanski
f16acec296 Readme improvements 2025-01-29 11:13:06 -08:00
Jake Poznanski
dee494ad7b Local file stuff 2025-01-28 15:12:28 -08:00
Jake Poznanski
7882944e88 Local pdf support 2025-01-28 15:03:31 -08:00
Jake Poznanski
dbe54871db Support stats feature later 2025-01-28 14:29:46 -08:00
Jake Poznanski
48447b616c Can use remote s3 files, and local workspace now 2025-01-28 14:28:19 -08:00
Jake Poznanski
50f9a6adb5 Name refactor 2025-01-28 14:16:53 -08:00
Jake Poznanski
e0afb935fa Better check for separate sglang installation step 2025-01-28 13:56:00 -08:00
Jake Poznanski
00e3aac058 Inference test for qwen2 and 2.5, work queue fixes, build current still broken 2025-01-27 15:58:48 -08:00
Jake Poznanski
4d0d9246b4 Merge branch 'main' of https://github.com/allenai/olmocr 2025-01-27 21:11:43 +00:00
Jake Poznanski
b28aad61bb More test docs 2025-01-27 21:11:23 +00:00
Jake Poznanski
43c3caa559 Merge branch 'main' of https://github.com/allenai/olmocr into main 2025-01-27 12:59:51 -08:00
Jake Poznanski
4baba92c29 Remove some todos 2025-01-27 12:59:43 -08:00
Jake Poznanski
96ae2dd49b Refactoring 2025-01-27 20:45:28 +00:00
Jake Poznanski
c6062677aa Cleaning up some unused code 2025-01-27 18:48:15 +00:00
Jake Poznanski
d8c13d05f6 Readmes and version updates 2025-01-27 18:41:13 +00:00
Jake Poznanski
b2894d0280 Massive refactor from pdelfin to olmocr 2025-01-27 18:30:41 +00:00