1490 Commits

Author SHA1 Message Date
Jake Poznanski
6be12c2e06 Baseline tests for blanks 2025-08-25 22:01:24 +00:00
Jake Poznanski
ad33672781 fix 2025-08-25 21:04:53 +00:00
Jake Poznanski
c7aa217281 Scripts to run benchmarks better 2025-08-25 20:12:10 +00:00
Jake Poznanski
59321af018
Merge pull request #319 from haydn-jones/main
External vLLM instance
2025-08-25 08:43:25 -07:00
Haydn Jones
2c63836648 Black and mock 2025-08-23 20:07:05 -04:00
Haydn Jones
261c722f56 Update README + arg name 2025-08-21 17:49:07 -04:00
Haydn Jones
b34c3611e1 oopsy woopsy 2025-08-20 19:22:48 -04:00
Haydn Jones
b8a2b92174 External vLLM 2025-08-20 19:21:38 -04:00
Jake Poznanski
4dbf951f45
Merge pull request #313 from tongliang11/patch-1
Fix typo in README.md
2025-08-19 10:40:40 -07:00
Tong Liang
cb4f23dc0c
Fix typo in README.md 2025-08-16 21:48:07 -04:00
Jake Poznanski
c492615355 Bump version to v0.3.3 for release v0.3.3 2025-08-15 19:45:17 +00:00
Jake Poznanski
cee12ccc9f New version 2025-08-15 19:45:07 +00:00
Jake Poznanski
76405b53db Lints 2025-08-15 19:44:47 +00:00
Jake Poznanski
69c33abfcc Trying to keep queue loaded more 2025-08-15 18:44:45 +00:00
Jake Poznanski
7c98673972 Pipeline fixes for OMP_NUM_THREADS 2025-08-15 18:30:00 +00:00
Jake Poznanski
b9238b8638 Fix for floaty amount 2025-08-14 22:27:26 +00:00
Jake Poznanski
618777c17e Bump version to v0.3.2 for release v0.3.2 2025-08-14 20:58:11 +00:00
Jake Poznanski
5532493ec8 Pipeline should be improved to limit CPU usage on page renders 2025-08-14 20:57:57 +00:00
Jake Poznanski
3a36ee239d Cleanup 2025-08-14 20:13:52 +00:00
Jake Poznanski
a863d04e6e Cleanup page rendering cpu limits 2025-08-14 20:11:26 +00:00
Jake Poznanski
0dd4fe83f4 Bump version to v0.3.1 for release v0.3.1 2025-08-14 16:52:35 +00:00
Jake Poznanski
7e8f9e43d8 New version 2025-08-14 16:50:49 +00:00
Jake Poznanski
0a8cd93c0a Better queue managmenet again 2025-08-14 16:37:11 +00:00
Jake Poznanski
38679243d7 Removing extra files 2025-08-14 16:17:59 +00:00
Jake Poznanski
dc5c45e144 Deps 2025-08-14 16:10:29 +00:00
Jake Poznanski
7b3b93589d VLLM bump 2025-08-14 16:08:45 +00:00
Jake Poznanski
4431b4886f Better tracking of semaphore release on bigger jobs 2025-08-14 16:05:21 +00:00
Jake Poznanski
4efd3f5d9e AI2 Internal budgeting 2025-08-13 22:16:18 +00:00
Jake Poznanski
9f8df232b6 Readme updates 2025-08-13 22:03:03 +00:00
Jake Poznanski
36ca700669 Bump version to v0.3.0 for release v0.3.0 2025-08-13 21:41:30 +00:00
Jake Poznanski
3e5351c028 version bump 2025-08-13 21:41:22 +00:00
Jake Poznanski
894c617ea4
Merge pull request #303 from allenai/jakep/olmocr_v03
olmOCR v.0.3.0
2025-08-13 14:39:54 -07:00
Jake Poznanski
463cef7ea2 New default model 2025-08-13 20:57:15 +00:00
Jake Poznanski
e86267a01c Making local results directory properly 2025-08-13 20:40:04 +00:00
Jake Poznanski
11302feb8c Move open cv2 import only into experimental data loader class 2025-08-13 20:28:31 +00:00
Jake Poznanski
93411a80a0 Lint fixes 2025-08-13 20:21:04 +00:00
Jake Poznanski
05330150ad New work queue code is cleaner 2025-08-13 20:20:27 +00:00
Jake Poznanski
9a8fa335ae One more scheme to try 2025-08-13 18:21:58 +00:00
Jake Poznanski
ffb0c6abc5 Adding some more quant schemes 2025-08-13 18:00:38 +00:00
Jake Poznanski
b921922f25 Cleaning up some pipeline logs 2025-08-13 17:39:02 +00:00
Jake Poznanski
332a818614 useless config 2025-08-12 17:31:19 +00:00
Jake Poznanski
b873d66dae resumable 2025-08-12 16:35:21 +00:00
Jake Poznanski
98d457c502 2epoch config fix 2025-08-11 22:21:55 +00:00
Jake Poznanski
387e7947c4 Another 2 epoch run 2025-08-06 22:39:09 +00:00
Jake Poznanski
2a3c534a84 2 epoch resumable config 2025-08-06 22:38:38 +00:00
Jake Poznanski
c7a533c945 Sorting data loader samples to maintain consistency between runs 2025-08-06 21:46:13 +00:00
Jake Poznanski
2fca448105 Using new budget code 2025-08-06 16:31:08 +00:00
Jake Poznanski
e664dc5f36 typo 2025-08-05 19:43:11 +00:00
Jake Poznanski
8b8c6bb837 Cleaning up some training requirements installation steps 2025-08-05 19:42:46 +00:00
Jake Poznanski
c9b8088bc6 Adding some preempt flags 2025-08-05 18:00:46 +00:00