Jake Poznanski
|
a0bc5a4690
|
Deepinfra readme
|
2025-09-29 17:29:28 +00:00 |
|
Jake Poznanski
|
0c6d889863
|
Adding retry code on 429 errors from exteranl providers
|
2025-09-29 17:26:22 +00:00 |
|
Jake Poznanski
|
9c750903ce
|
Ignore files
|
2025-09-29 17:06:14 +00:00 |
|
Jake Poznanski
|
f0caa188ab
|
Merge pull request #344 from allenai/amanr/deepinfra
DeepInfra Support
|
2025-09-29 10:03:29 -07:00 |
|
Jake Poznanski
|
7f4b728dcd
|
Skip docker checks if using beaker image
|
2025-09-26 23:27:04 +00:00 |
|
aman-17
|
f3c4073395
|
added Api_key argument to pipeline pytests
|
2025-09-26 14:25:25 -07:00 |
|
aman-17
|
359abef654
|
updated pytests
|
2025-09-26 14:19:22 -07:00 |
|
Jake Poznanski
|
1099820bf6
|
Hmm, going to have to train a filtered version
|
2025-09-26 21:15:15 +00:00 |
|
aman-17
|
556ff26d58
|
fixed lint, style, ruff
|
2025-09-26 14:08:40 -07:00 |
|
aman-17
|
e7ae5e6240
|
fixed style
|
2025-09-26 13:58:34 -07:00 |
|
aman-17
|
90589e16de
|
Added deepinfra usage to readme
|
2025-09-26 13:56:34 -07:00 |
|
aman-17
|
2a5792e5ed
|
add if else for vllm local usage bug for api argument
|
2025-09-26 13:29:48 -07:00 |
|
Jake Poznanski
|
bb06829840
|
SOuping in fp32
|
2025-09-26 20:03:29 +00:00 |
|
aman-17
|
7fe3f65de7
|
added support for deepinfra
|
2025-09-26 11:06:51 -07:00 |
|
Charitarth Chugh
|
fe425fde20
|
Add chunked prefill and limit mm per prompt options
|
2025-09-25 14:29:49 -04:00 |
|
Jake Poznanski
|
3d6e6a6a01
|
Setting defaults, adding in seed
|
2025-09-24 17:37:12 +00:00 |
|
Jake Poznanski
|
1654c760d9
|
Fixing grpo eos reward
|
2025-09-23 20:27:07 +00:00 |
|
Jake Poznanski
|
b4b121b118
|
Testing reward eos
|
2025-09-23 20:10:11 +00:00 |
|
Jake Poznanski
|
01bc1ff7b6
|
Allow passing in beaker image to run benchmark
|
2025-09-23 19:48:42 +00:00 |
|
Jake Poznanski
|
a00d9d172e
|
Adding stricter math and table tests when in synthetic mode
|
2025-09-23 18:37:50 +00:00 |
|
Jake Poznanski
|
1197c35808
|
Mix contamination checker script
|
2025-09-23 18:17:13 +00:00 |
|
Jake Poznanski
|
9818797fbc
|
One last try to disable timeouts in katex thing
|
2025-09-23 03:26:45 +00:00 |
|
Jake Poznanski
|
022d87fb7a
|
Another attempt at playwright rendering fix for long RL jobs
|
2025-09-22 20:47:58 +00:00 |
|
Jake Poznanski
|
97db9cdfb1
|
Hmm
|
2025-09-22 18:46:55 +00:00 |
|
Jake Poznanski
|
4e77ecac3f
|
Ok, should have a cleaner playwright story on multithreaded environments
|
2025-09-22 18:14:30 +00:00 |
|
Jake Poznanski
|
1a64728420
|
Checking to see if specifying number of dataloader workers helps thread issue
|
2025-09-22 17:08:06 +00:00 |
|
Jake Poznanski
|
3b20322eb0
|
Adding some more debug stuff to try to figure out playwright issue
|
2025-09-22 16:03:49 +00:00 |
|
Jake Poznanski
|
780bc7d934
|
Better context manager and cleanup of old browser instances
|
2025-09-21 23:38:09 +00:00 |
|
Jake Poznanski
|
7e786c79c5
|
Save total limit
|
2025-09-20 19:52:34 +00:00 |
|
Jake Poznanski
|
83d965c768
|
Adding contam check for olmocr-bench when making synth data
|
2025-09-20 03:42:42 +00:00 |
|
Jake Poznanski
|
a8ad6c12b5
|
Convert fix
|
2025-09-19 17:36:34 +00:00 |
|
Jake Poznanski
|
b1242db8e2
|
last threading fix
|
2025-09-19 17:23:12 +00:00 |
|
Jake Poznanski
|
0a74746da9
|
Ugh
|
2025-09-19 17:20:28 +00:00 |
|
Jake Poznanski
|
f114a7fbec
|
Crahses
|
2025-09-19 17:19:12 +00:00 |
|
Jake Poznanski
|
b52ac23073
|
Fixing non async threading
|
2025-09-19 17:12:22 +00:00 |
|
Jake Poznanski
|
e607b53748
|
Keeps getting killed
|
2025-09-19 17:02:02 +00:00 |
|
Jake Poznanski
|
cedd4a80cf
|
Fixing paddle ocr to run fast
|
2025-09-19 16:57:05 +00:00 |
|
Jake Poznanski
|
8f75ea062e
|
testing paddle
|
2025-09-19 16:55:01 +00:00 |
|
Jake Poznanski
|
e9ab2fd1bb
|
Adding paddlepaddle v5 runner for benchmarking
|
2025-09-19 16:45:53 +00:00 |
|
Jake Poznanski
|
1c703917df
|
Synthmix ignore
|
2025-09-19 15:58:58 +00:00 |
|
Jake Poznanski
|
4907b1c700
|
Bumping rotation augment a tad
|
2025-09-18 20:38:02 +00:00 |
|
Jake Poznanski
|
a42d8199cd
|
Adding 1025 mix dataset, should be ready for final run
|
2025-09-18 19:50:40 +00:00 |
|
Jake Poznanski
|
1ac72ad169
|
Adding some scripts to clean data
|
2025-09-18 19:44:30 +00:00 |
|
Jake Poznanski
|
30750f77c1
|
Ok, rendering smaller version of the page, since this is the max suppored by claude and it would get rescaled anyways
|
2025-09-17 19:51:03 +00:00 |
|
Jake Poznanski
|
a60c84ed14
|
Maybe better scaling with no losing of text
|
2025-09-16 22:01:49 +00:00 |
|
Jake Poznanski
|
52df81873a
|
Glob path fixes
|
2025-09-16 21:42:46 +00:00 |
|
Jake Poznanski
|
3b729d6770
|
Oops fixing random gen things
|
2025-09-16 21:17:12 +00:00 |
|
Jake Poznanski
|
2fa67a980e
|
More reliable test gen
|
2025-09-16 19:57:00 +00:00 |
|
Jake Poznanski
|
e3e09c04db
|
Synth data fixups
|
2025-09-16 18:47:54 +00:00 |
|
Jake Poznanski
|
2400744673
|
More generic model name loading
|
2025-09-11 17:44:24 +00:00 |
|