1175 Commits

Author SHA1 Message Date
Jake Poznanski
3c1cd6a504 Bump version to v0.1.73 for release v0.1.73 2025-06-17 17:07:02 +00:00
Jake Poznanski
dd1d0b561b More debug logs 2025-06-17 17:06:45 +00:00
Jake Poznanski
08ca1544bf Beaker push fix 2025-06-17 16:59:22 +00:00
Jake Poznanski
5e5c31b93e Bump version to v0.1.72 for release v0.1.72 2025-06-17 16:21:43 +00:00
Jake Poznanski
715b841596 0.1.72 2025-06-17 16:21:36 +00:00
Jake Poznanski
b03feb3356 Fixed 2025-06-17 16:10:38 +00:00
Jake Poznanski
b588ae27d2 Remvoing sglang tests, switch to vllm 2025-06-17 16:07:16 +00:00
Jake Poznanski
6e3fba3c59 Lints 2025-06-17 16:06:40 +00:00
Jake Poznanski
e489b28421 Lints 2025-06-17 15:58:16 +00:00
Jake Poznanski
6fcd26d66a Updating readme 2025-06-17 15:48:25 +00:00
Jake Poznanski
8c62072832 Merge remote-tracking branch 'origin/main' into jakep/vllm_perf 2025-06-17 15:25:32 +00:00
Jake Poznanski
1295e171bb Merge branch 'main' of https://github.com/allenai/olmocr 2025-06-12 22:35:09 +00:00
Jake Poznanski
37090e2801 Go back to workers 1 in marker test script 2025-06-12 22:35:08 +00:00
Jake Poznanski
f273de6e6e
Update README.md
Updating to v.1.7.5 marker that I ran locally with base only for now
2025-06-12 15:32:09 -07:00
Jake Poznanski
af02f15f24
Merge pull request #236 from VikParuchuri/main
Fix marker benchmarks
2025-06-12 15:24:17 -07:00
Jake Poznanski
3da6e2d587 Pareto plot update, keep cost the same for now 2025-06-12 22:23:41 +00:00
Jake Poznanski
fcd8bbec92 Install aws cli 2025-06-12 21:38:28 +00:00
Jake Poznanski
fc06797bec aws cli 2025-06-12 21:29:39 +00:00
Jake Poznanski
59e0a1ccb0 Marker wants newer torchvision 2025-06-12 21:23:53 +00:00
Jake Poznanski
0f3b45c1a3 Add time 2025-06-12 21:19:17 +00:00
Jake Poznanski
4bfcfce767 Actually install the right thing 2025-06-12 21:18:58 +00:00
Jake Poznanski
548187902b Ignore 2025-06-12 21:14:00 +00:00
Jake Poznanski
f8dfd85765 Script 2025-06-12 21:13:31 +00:00
Jake Poznanski
044874a634 Adding marker benchmark 2025-06-12 21:12:58 +00:00
Jake Poznanski
9787d007b9 Pulling in bigger benchmark script from vllm branch to main 2025-06-12 21:02:46 +00:00
Jake Poznanski
af7aaef605 Run marker script 2025-06-12 20:07:17 +00:00
Jake Poznanski
cbc4580b72 Fixing #240 2025-06-12 17:21:21 +00:00
aman-17
3eda2c04c1 updated vllm to 0.9.1 2025-06-10 16:14:57 -07:00
Jake Poznanski
a83a0da65f Cleanup of vllm perf branch with @amanr 2025-06-10 21:56:05 +00:00
aman-17
316d0af1cd added dtype functionality 2025-06-06 16:19:40 -07:00
aman-17
c8a5361d1b fixing packages of 22.04 2025-06-06 13:50:12 -07:00
aman-17
c5d075c63a fixed apt_pkg module 2025-06-06 13:48:48 -07:00
aman-17
08fd82f323 made changes wrt ubuntu 22.04 2025-06-06 13:41:10 -07:00
aman-17
6507a657be updated ubuntu to 22.04 for glbc 2.32 2025-06-06 13:29:51 -07:00
Jake Poznanski
25dfe0b831 Weird glibc error 2025-06-06 18:53:52 +00:00
Jake Poznanski
0257444720 Ok, cleaner retry pattern for model downloading 2025-06-06 18:52:01 +00:00
Vik Paruchuri
267f52bd79 Update marker cost 2025-06-06 13:53:50 -04:00
Jake Poznanski
9539eab840 AWs creds fix 2025-06-06 17:45:17 +00:00
Jake Poznanski
e0fda1a77d Passing aws creds to benchmark so we can run custom models stored in s3 2025-06-05 17:40:14 +00:00
Jake Poznanski
ecf0d48a28 Dont allow uncomitted changes 2025-06-05 17:22:12 +00:00
Jake Poznanski
134bba9fcd Run benchmark adjustments 2025-06-05 17:21:06 +00:00
Jake Poznanski
7009a7a9d9 Trying out FP8 compression 2025-06-05 17:18:20 +00:00
Jake Poznanski
9ffbe8df46 Adding quick stats percentage done check 2025-06-05 15:58:19 +00:00
Vik Paruchuri
f21ff08c2f Fix marker benchmarks 2025-06-04 23:10:14 -07:00
Jake Poznanski
aad8428dc3 Reverting custom pipeline image 2025-06-02 23:05:48 +00:00
Jake Poznanski
5c52e016e6 Include cuda 12.8 2025-06-02 22:52:28 +00:00
Jake Poznanski
5c524b53ac Cleaning up stats reportng 2025-06-02 21:40:14 +00:00
Jake Poznanski
916f0cb919 Trying with flash infer installed 2025-06-02 21:23:04 +00:00
Jake Poznanski
2ccef7d760 Ugh, this code is bad 2025-06-02 21:22:25 +00:00
Jake Poznanski
2f1957b401 Performance fixes with vllm backend 2025-06-02 21:10:30 +00:00