23 Commits

Author SHA1 Message Date
Jake Poznanski
ad33672781 fix 2025-08-25 21:04:53 +00:00
Jake Poznanski
c7aa217281 Scripts to run benchmarks better 2025-08-25 20:12:10 +00:00
Jake Poznanski
2fca448105 Using new budget code 2025-08-06 16:31:08 +00:00
Jake Poznanski
43ae28dde4 Prepare checkpoint works for older models too 2025-07-14 21:30:32 +00:00
Jake Poznanski
43c94fea58 Bencharmk update 2025-06-12 20:47:58 +00:00
Jake Poznanski
b1e064f8a6 Run benchmark script will also start a job to convert 10k docs from olmocr-mix to check performance 2025-06-12 20:27:50 +00:00
Jake Poznanski
25dfe0b831 Weird glibc error 2025-06-06 18:53:52 +00:00
Jake Poznanski
9539eab840 AWs creds fix 2025-06-06 17:45:17 +00:00
Jake Poznanski
e0fda1a77d Passing aws creds to benchmark so we can run custom models stored in s3 2025-06-05 17:40:14 +00:00
Jake Poznanski
ecf0d48a28 Dont allow uncomitted changes 2025-06-05 17:22:12 +00:00
Jake Poznanski
134bba9fcd Run benchmark adjustments 2025-06-05 17:21:06 +00:00
Jake Poznanski
7009a7a9d9 Trying out FP8 compression 2025-06-05 17:18:20 +00:00
Jake Poznanski
8d92620d3c Merge remote-tracking branch 'origin/main' into retry_improvements 2025-05-29 20:33:45 +00:00
Jake Poznanski
cd5b524d20 Some benchmark cleanup 2025-05-29 20:32:25 +00:00
Jake Poznanski
022be37723 Some better info strings in benchmark runner 2025-05-29 18:43:27 +00:00
Jake Poznanski
01c4a561d3 Script fixes 2025-05-29 17:58:11 +00:00
Jake Poznanski
129412cdb0 Git lfs for more reliable downloads 2025-05-29 17:38:00 +00:00
Jake Poznanski
45e0ae59dc omg 2025-05-29 17:21:58 +00:00
Jake Poznanski
15e0064212 More fixes 2025-05-29 17:20:32 +00:00
Jake Poznanski
e8e6b6cb17 More fixes 2025-05-29 17:19:36 +00:00
Jake Poznanski
06988ac533 Image fixes 2025-05-29 17:18:12 +00:00
Jake Poznanski
ff31faebe4 Runner improvements 2025-05-29 17:12:41 +00:00
Jake Poznanski
475cc1c3a4 Working on runner script 2025-05-29 17:08:05 +00:00