1735 Commits

Author SHA1 Message Date
Jake Poznanski
a0bc5a4690 Deepinfra readme 2025-09-29 17:29:28 +00:00
Jake Poznanski
0c6d889863 Adding retry code on 429 errors from exteranl providers 2025-09-29 17:26:22 +00:00
Jake Poznanski
9c750903ce Ignore files 2025-09-29 17:06:14 +00:00
Jake Poznanski
f0caa188ab
Merge pull request #344 from allenai/amanr/deepinfra
DeepInfra Support
2025-09-29 10:03:29 -07:00
Jake Poznanski
7f4b728dcd Skip docker checks if using beaker image 2025-09-26 23:27:04 +00:00
aman-17
f3c4073395 added Api_key argument to pipeline pytests 2025-09-26 14:25:25 -07:00
aman-17
359abef654 updated pytests 2025-09-26 14:19:22 -07:00
Jake Poznanski
1099820bf6 Hmm, going to have to train a filtered version 2025-09-26 21:15:15 +00:00
aman-17
556ff26d58 fixed lint, style, ruff 2025-09-26 14:08:40 -07:00
aman-17
e7ae5e6240 fixed style 2025-09-26 13:58:34 -07:00
aman-17
90589e16de Added deepinfra usage to readme 2025-09-26 13:56:34 -07:00
aman-17
2a5792e5ed add if else for vllm local usage bug for api argument 2025-09-26 13:29:48 -07:00
Jake Poznanski
bb06829840 SOuping in fp32 2025-09-26 20:03:29 +00:00
aman-17
7fe3f65de7 added support for deepinfra 2025-09-26 11:06:51 -07:00
Charitarth Chugh
fe425fde20
Add chunked prefill and limit mm per prompt options 2025-09-25 14:29:49 -04:00
Jake Poznanski
3d6e6a6a01 Setting defaults, adding in seed 2025-09-24 17:37:12 +00:00
Jake Poznanski
1654c760d9 Fixing grpo eos reward 2025-09-23 20:27:07 +00:00
Jake Poznanski
b4b121b118 Testing reward eos 2025-09-23 20:10:11 +00:00
Jake Poznanski
01bc1ff7b6 Allow passing in beaker image to run benchmark 2025-09-23 19:48:42 +00:00
Jake Poznanski
a00d9d172e Adding stricter math and table tests when in synthetic mode 2025-09-23 18:37:50 +00:00
Jake Poznanski
1197c35808 Mix contamination checker script 2025-09-23 18:17:13 +00:00
Jake Poznanski
9818797fbc One last try to disable timeouts in katex thing 2025-09-23 03:26:45 +00:00
Jake Poznanski
022d87fb7a Another attempt at playwright rendering fix for long RL jobs 2025-09-22 20:47:58 +00:00
Jake Poznanski
97db9cdfb1 Hmm 2025-09-22 18:46:55 +00:00
Jake Poznanski
4e77ecac3f Ok, should have a cleaner playwright story on multithreaded environments 2025-09-22 18:14:30 +00:00
Jake Poznanski
1a64728420 Checking to see if specifying number of dataloader workers helps thread issue 2025-09-22 17:08:06 +00:00
Jake Poznanski
3b20322eb0 Adding some more debug stuff to try to figure out playwright issue 2025-09-22 16:03:49 +00:00
Jake Poznanski
780bc7d934 Better context manager and cleanup of old browser instances 2025-09-21 23:38:09 +00:00
Jake Poznanski
7e786c79c5 Save total limit 2025-09-20 19:52:34 +00:00
Jake Poznanski
83d965c768 Adding contam check for olmocr-bench when making synth data 2025-09-20 03:42:42 +00:00
Jake Poznanski
a8ad6c12b5 Convert fix 2025-09-19 17:36:34 +00:00
Jake Poznanski
b1242db8e2 last threading fix 2025-09-19 17:23:12 +00:00
Jake Poznanski
0a74746da9 Ugh 2025-09-19 17:20:28 +00:00
Jake Poznanski
f114a7fbec Crahses 2025-09-19 17:19:12 +00:00
Jake Poznanski
b52ac23073 Fixing non async threading 2025-09-19 17:12:22 +00:00
Jake Poznanski
e607b53748 Keeps getting killed 2025-09-19 17:02:02 +00:00
Jake Poznanski
cedd4a80cf Fixing paddle ocr to run fast 2025-09-19 16:57:05 +00:00
Jake Poznanski
8f75ea062e testing paddle 2025-09-19 16:55:01 +00:00
Jake Poznanski
e9ab2fd1bb Adding paddlepaddle v5 runner for benchmarking 2025-09-19 16:45:53 +00:00
Jake Poznanski
1c703917df Synthmix ignore 2025-09-19 15:58:58 +00:00
Jake Poznanski
4907b1c700 Bumping rotation augment a tad 2025-09-18 20:38:02 +00:00
Jake Poznanski
a42d8199cd Adding 1025 mix dataset, should be ready for final run 2025-09-18 19:50:40 +00:00
Jake Poznanski
1ac72ad169 Adding some scripts to clean data 2025-09-18 19:44:30 +00:00
Jake Poznanski
30750f77c1 Ok, rendering smaller version of the page, since this is the max suppored by claude and it would get rescaled anyways 2025-09-17 19:51:03 +00:00
Jake Poznanski
a60c84ed14 Maybe better scaling with no losing of text 2025-09-16 22:01:49 +00:00
Jake Poznanski
52df81873a Glob path fixes 2025-09-16 21:42:46 +00:00
Jake Poznanski
3b729d6770 Oops fixing random gen things 2025-09-16 21:17:12 +00:00
Jake Poznanski
2fa67a980e More reliable test gen 2025-09-16 19:57:00 +00:00
Jake Poznanski
e3e09c04db Synth data fixups 2025-09-16 18:47:54 +00:00
Jake Poznanski
2400744673 More generic model name loading 2025-09-11 17:44:24 +00:00