94 Commits

Author SHA1 Message Date
Jake Poznanski
b626b4a1e1 Adjusting labeling task 2025-04-07 20:27:32 +00:00
Jake Poznanski
3d1925067b Removing progress bar in annotation UI 2025-04-04 21:41:36 +00:00
Jake Poznanski
caf21b9664 Lints 2025-04-04 19:45:38 +00:00
Jake Poznanski
f1188dc85d Merge branch 'main' of https://github.com/allenai/olmocr 2025-04-04 19:44:55 +00:00
Jake Poznanski
a0f8b028f8 Reporting results 2025-04-04 19:44:54 +00:00
Jake Poznanski
cc7b1131c6 Editing 2025-04-04 19:38:59 +00:00
Jake Poznanski
9338f5359f Saving pdf paths 2025-04-04 19:36:10 +00:00
Jake Poznanski
c8cc61b95f
Merge pull request #163 from franzbischoff/main
Add script to convert JSONL files to Markdown format
2025-04-04 12:30:54 -07:00
Jake Poznanski
61624a37ff Fixed 2025-04-04 17:53:26 +00:00
Jake Poznanski
d299119c65 Links updated 2025-04-04 17:18:41 +00:00
Jake Poznanski
a113fd3015 Review app 2025-04-04 17:18:19 +00:00
Jake Poznanski
e8c14fc496 Saving prolific codes 2025-04-04 17:12:46 +00:00
Jake Poznanski
cd9e370c92 Tinyhosting automatically 2025-04-04 16:29:58 +00:00
Jake Poznanski
02cd002488 Step by step annotation 2025-04-04 16:19:04 +00:00
Jake Poznanski
6a0dbfc925 Adjusting buttons 2025-04-04 16:05:04 +00:00
Francisco Bischoff
c2193ddc93
Remove first line 2025-04-04 16:44:21 +01:00
Francisco Bischoff
c96143c3b1
Add script to convert JSONL files to Markdown format 2025-04-04 12:52:58 +01:00
Jake Poznanski
83ae61014c Scan dolma docs improvements for PII review 2025-04-01 20:03:15 +00:00
Jake Poznanski
bc78e0d8a0 Adding feedback 2025-04-01 18:35:04 +00:00
Jake Poznanski
213252f048 A few improvements to the dolma doc viewer script 2025-04-01 18:25:40 +00:00
Jake Poznanski
d45c0323a4 Better equation rendering checker with more tests. 2025-03-26 18:49:48 +00:00
Jake Poznanski
b8e3034847 Trying a change to the render script 2025-03-26 18:26:06 +00:00
Jake Poznanski
f5d92bdb14 Trying to get new CI to work 2025-03-14 02:43:55 +00:00
Chris Wilhelm
9b958e65f1 moves what happens where around a bit and updates readme 2025-03-13 15:31:55 -07:00
Chris Wilhelm
098b01c006 wire it up into a gh action 2025-03-13 15:31:55 -07:00
Chris Wilhelm
7e8492059c wip 2025-03-13 15:31:55 -07:00
Chris Wilhelm
29b9054749 basic docker image and test 2025-03-13 15:31:55 -07:00
Jake Poznanski
abeaf028fd Docker file builds faster now 2025-03-05 19:37:09 +00:00
Jake Poznanski
dc7cb5c8b5 Ruff fixes to CI 2025-03-03 15:56:39 -08:00
kyleclo
25df26fefd readme 2025-02-28 10:12:07 -08:00
kyleclo
7e434d8466 Merge branch 'main' into kylel/elo 2025-02-28 10:06:40 -08:00
aman-17
0130a970c2 fixed style 2025-02-25 08:57:02 -08:00
Jake Poznanski
e4f9b1962f Infinigram counting script for paper 2025-02-18 19:01:17 +00:00
Jake Poznanski
602012267e Match script 2025-02-18 17:53:46 +00:00
Jake Poznanski
b871e4b425 Small helper to measure overlap 2025-02-18 17:14:56 +00:00
Jake Poznanski
58db354532 Fixing release script 2025-02-14 22:57:43 +00:00
kyleclo
86b17d0ea3 add boxplot drawing 2025-02-13 19:38:09 -08:00
kyleclo
a790ba73ee update args; include output 2025-02-13 17:06:36 -08:00
kyleclo
88c18b3afa human eval data; elo ratings script; dependencies 2025-02-13 16:59:09 -08:00
Jake Poznanski
04844b3f87 More beaker and docker fixes 2025-01-30 22:14:57 +00:00
Jake Poznanski
c69e0d6762 More cleanup, removing dead adv anchor code 2025-01-30 12:58:11 -08:00
Jake Poznanski
dcaca8aa90 Black formatting 2025-01-29 15:30:39 -08:00
Jake Poznanski
4a1762d455 isort 2025-01-29 15:25:10 -08:00
Jake Poznanski
b2894d0280 Massive refactor from pdelfin to olmocr 2025-01-27 18:30:41 +00:00
Jake Poznanski
5b429ad100 Higher lr for molmo, fixed evals 2025-01-24 23:15:35 +00:00
Jake Poznanski
d0eea81c00 Dealing with issue with molmo unused params 2025-01-24 16:27:42 +00:00
Jake Poznanski
ef4167dc45 Test set script 2025-01-14 19:36:18 +00:00
Jake Poznanski
cff97990bf Moving to official sglang release 2024-11-22 19:37:31 +00:00
Jake Poznanski
9e2e09bd06 More fixes 2024-11-18 15:04:50 -08:00
Jake Poznanski
8e16780b82 Beaker stuff 2024-11-14 08:49:12 -08:00