1501 Commits

Author SHA1 Message Date
Jake Poznanski
656dbef833 Frontier configs 2025-06-30 17:43:30 +00:00
Jake Poznanski
e2f2d36e4f More typos 2025-06-30 17:41:19 +00:00
Jake Poznanski
ea72ea2645 Ugh stupid fix 2025-06-30 17:40:19 +00:00
Jake Poznanski
55a737ca6b script 2025-06-30 17:32:01 +00:00
Jake Poznanski
ba49fd53d9 frontier train script let's see what happens 2025-06-30 17:30:17 +00:00
Jake Poznanski
bde6f2955e Bf16 only 2025-06-30 17:25:53 +00:00
Jake Poznanski
44dd966850 Wandb fixes 2025-06-30 17:23:47 +00:00
Jake Poznanski
f8071c7457 Loss config 2025-06-30 17:17:48 +00:00
Jake Poznanski
a3997419b3 Naming config entries better 2025-06-30 17:15:58 +00:00
Jake Poznanski
44cba7911b Bench katex files distributed in installation package now 2025-06-30 17:07:38 +00:00
Jake Poznanski
f09b9ff142
Merge pull request #264 from boyuan99/fix-installation-command-typo
Fix typo in pip install command for GPU setup
2025-06-30 09:52:27 -07:00
Jake Poznanski
8e5e18f54c Checking that anchor text works for each pdf page when initializing dataloader 2025-06-30 16:29:33 +00:00
Bo Yuan
333f029ffb Fix typo in pip install command for GPU setup
Remove incorrect period before [gpu] in pip install command.
The correct syntax is 'olmocr[gpu]' not 'olmocr.[gpu]'.
2025-06-30 01:06:21 -05:00
Jake Poznanski
dc7fff5bf7 Collator fix 2025-06-29 19:52:53 +00:00
Jake Poznanski
12b5cc3101 Lowwering size of default data load for testing 2025-06-28 23:09:44 +00:00
Jake Poznanski
c36b5df2af Cleanup collator 2025-06-28 22:46:12 +00:00
Jake Poznanski
887190e961 Cleanup 2025-06-27 21:54:31 +00:00
Jake Poznanski
330f465d5d Small fixes 2025-06-27 21:53:06 +00:00
Jake Poznanski
214c44df36 Reporting to wandb, better eval dataset loading 2025-06-27 21:16:22 +00:00
Jake Poznanski
600d967fe6 Config changes 2025-06-27 19:55:04 +00:00
Jake Poznanski
850b598db1 Sdpa 2025-06-27 16:59:33 +00:00
Jake Poznanski
573219d246 Lint 2025-06-27 16:43:50 +00:00
Jake Poznanski
eab7492e60 Forwarding -tp and -dp options 2025-06-27 16:40:47 +00:00
Jake Poznanski
14b9b2dc8f Fix for rocm vllm 2025-06-27 16:23:31 +00:00
Jake Poznanski
b96454b786 Merge branch 'main' into jakep/new_trainer 2025-06-27 16:21:58 +00:00
Jake Poznanski
58e4fadfc0 torchvision requirement 2025-06-27 16:16:19 +00:00
Jake Poznanski
1451dd1395 weka 2025-06-27 02:57:26 +00:00
Jake Poznanski
680377c93f Example config 2025-06-26 23:32:50 +00:00
Jake Poznanski
dee3730231 Gantry stuff 2025-06-26 18:34:53 +00:00
Jake Poznanski
0d7836b111 Basic atttempt to run trainer script 2025-06-25 23:22:59 +00:00
Jake Poznanski
d7e5037192 New trainer launch script cleanups 2025-06-25 23:05:32 +00:00
Jake Poznanski
91e7b5ce3f Claude generated train script 2025-06-24 22:56:35 +00:00
Jake Poznanski
0ebc35cf1f Basic train config loader for datasets 2025-06-24 22:48:36 +00:00
Jake Poznanski
b93c262dca Prepping new config stuff 2025-06-24 22:40:50 +00:00
Jake Poznanski
633b03d1da Merge branch 'main' of https://github.com/allenai/olmocr 2025-06-24 22:06:02 +00:00
Jake Poznanski
67e9ec873f Removing unused file 2025-06-24 22:06:01 +00:00
Aman Rangapur
1df93d0ddf
Merge pull request #257 from allenai/amanr/nanonets_ocr
Added Nanonets OCR bench results
2025-06-23 16:35:57 -07:00
aman-17
202e22932e addressed Jake's comment for pagenumbers with \d+ 2025-06-23 23:29:10 +00:00
aman-17
9d04b30ea4 added nanonets 2025-06-23 22:04:47 +00:00
Jake Poznanski
24a2f9b0a4 Bump version to v0.1.76 for release v0.1.76 2025-06-23 21:54:15 +00:00
Jake Poznanski
cd93ca5927 Version bump 2025-06-23 21:54:06 +00:00
Aman Rangapur
ecce181ab9
Merge pull request #256 from allenai/jakep/dockerfix
Cleanup for docker file
2025-06-23 14:47:36 -07:00
Jake Poznanski
e45e871dc9 Cleanup for docker file 2025-06-23 20:05:33 +00:00
Jake Poznanski
0c6d1990dc
Update README.md 2025-06-17 14:45:11 -07:00
Jake Poznanski
ec5c5b6444 Updating pareto plots 2025-06-17 21:41:23 +00:00
Jake Poznanski
6c51829ae6 Some helper scripts 2025-06-17 21:21:50 +00:00
Jake Poznanski
626952a786 Adding news 2025-06-17 20:06:33 +00:00
Jake Poznanski
9d260791a0 README updates 2025-06-17 19:58:06 +00:00
Jake Poznanski
69524cb305 Updatinge bench readme 2025-06-17 19:55:17 +00:00
Jake Poznanski
069a99ea5f Bump version to v0.1.75 for release v0.1.75 2025-06-17 19:07:48 +00:00