1735 Commits

Author SHA1 Message Date
Jake Poznanski
aa239eb34c Lints 2025-10-13 21:15:19 +00:00
Jake Poznanski
369fd4d23a Adjusting some things 2025-10-13 21:14:53 +00:00
Jake Poznanski
9480508642 Mineru 2025-10-13 20:47:52 +00:00
Jake Poznanski
417fbed4ad Fix 2025-10-13 19:46:27 +00:00
Jake Poznanski
7d6db61446 Mineru runner 2025-10-13 19:43:39 +00:00
Jake Poznanski
7487e3673a More graceful tar extraction 2025-10-13 17:27:45 +00:00
Jake Poznanski
5b81bc61c6 Filtering downloads 2025-10-13 17:22:57 +00:00
Jake Poznanski
b86e3071da More bench results 2025-10-13 16:37:08 +00:00
Jake Poznanski
62faa003d3 Fix for some corrupted data 2025-10-10 22:34:32 +00:00
Jake Poznanski
fc4934c9b4 URL packaging 2025-10-10 16:52:42 +00:00
Jake Poznanski
87a2b8a9a3 More lint fixes 2025-10-09 22:16:46 +00:00
Jake Poznanski
875337f962 Lints 2025-10-09 22:12:19 +00:00
Jake Poznanski
702c42f8e7 Packaging working better now 2025-10-09 22:12:02 +00:00
Jake Poznanski
557bb9a5e9 Repackager is still not working right 2025-10-09 22:01:01 +00:00
Jake Poznanski
4c21e15d0e Packaging and repackaging test works 2025-10-09 21:52:05 +00:00
Jake Poznanski
9f4a2d4177 Tests 2025-10-09 21:42:32 +00:00
Jake Poznanski
35fc9ca025 Testing the packager 2025-10-09 21:30:38 +00:00
Jake Poznanski
74eb910b95 Now you can just run pytest . cleanly 2025-10-09 20:31:28 +00:00
Jake Poznanski
f01f7183e4 Test fixes 2025-10-09 20:28:29 +00:00
Jake Poznanski
bc8c044dd4 Preparing olmocr mix packaging scripts 2025-10-09 20:14:43 +00:00
Jake Poznanski
743e48361c New claude sonnet, going to add multilinguage tests to olmocr bench 1025 internal version 2025-10-09 19:43:22 +00:00
Jake Poznanski
da4ada33a0 Adding miner for multilingual documents 2025-10-09 18:26:40 +00:00
Jake Poznanski
95dd21b66c GRPO Documentation 2025-10-07 20:40:10 +00:00
Jake Poznanski
1f791c4a19 Changes 2025-10-07 18:29:08 +00:00
Jake Poznanski
727b345715 Merge fix 2025-10-07 18:16:31 +00:00
Jake Poznanski
8ef68fde88 Merge branch 'main' into jakep/new_data 2025-10-07 17:44:54 +00:00
Jake Poznanski
e15615aadb Model defaults 2025-10-07 17:10:45 +00:00
Jake Poznanski
b81e40602d Readme score fixes 2025-10-06 22:59:00 +00:00
Jake Poznanski
2e3d1a0317 Comitting test script to be used in model cards for individual one-off inference 2025-10-06 22:47:06 +00:00
Jake Poznanski
c89787183a Bump version to v0.3.8 for release v0.3.8 2025-10-06 21:46:18 +00:00
Jake Poznanski
e12941a608 Version bump 2025-10-06 21:46:10 +00:00
Jake Poznanski
7fe756fe63 Formatting 2025-10-06 21:10:32 +00:00
Jake Poznanski
9c7c670f1f Bump version to v0.3.7 for release v0.3.7 2025-10-06 21:10:07 +00:00
Jake Poznanski
1951a849ec Version bump with new vllm 2025-10-06 21:10:00 +00:00
Jake Poznanski
c75f5b98a1 Cleaning up pr 341 arguments to match with vllm 0.11, which only has V1 engine and thus always does chunked prefill. And fixes arg syntax 2025-10-06 20:26:41 +00:00
Jake Poznanski
e202c22822 Merge branch 'vllm_0_11' 2025-10-06 20:24:26 +00:00
Jake Poznanski
2b70b50312
Merge pull request #341 from charitarthchugh/charitarthchugh/vllm-defaults-speedup
Add chunked prefill and limit mm per prompt options
2025-10-06 13:23:47 -07:00
Jake Poznanski
81be6f5c1f Transformers version 2025-10-06 19:52:55 +00:00
Jake Poznanski
9b517a02be Git lfs in docker image 2025-10-06 19:47:19 +00:00
Jake Poznanski
9feb41af82 New docker file approach for vllm 0.11 2025-10-06 18:57:16 +00:00
Jake Poznanski
59266ed419 More readmes 2025-10-01 22:27:37 +00:00
Jake Poznanski
476ba212dc Bolds 2025-10-01 22:05:58 +00:00
Jake Poznanski
bb7790a138 Bolds 2025-10-01 22:05:30 +00:00
Jake Poznanski
4e68b174bf New bench scores added 2025-10-01 22:04:40 +00:00
Jake Poznanski
8ef7f8085a isort and black 2025-09-30 17:37:10 +00:00
Jake Poznanski
b5b1de98dd Allowing more max tokens in pipeline for new models 2025-09-29 22:12:27 +00:00
Jake Poznanski
f4356de091 deepinfra readme improved 2025-09-29 17:56:03 +00:00
Jake Poznanski
8982bae756 Bump version to v0.3.6 for release v0.3.6 2025-09-29 17:37:25 +00:00
Jake Poznanski
fb1ef9e38a Release script fix 2025-09-29 17:37:14 +00:00
Jake Poznanski
c587eb9050 Ugh, release script adds all files by default 2025-09-29 17:36:41 +00:00