This website requires JavaScript.
Explore
Help
Register
Sign In
yujunjun
/
olmocr
Watch
1
Star
0
Fork
0
You've already forked olmocr
mirror of
https://github.com/allenai/olmocr.git
synced
2025-10-13 01:02:26 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
olmocr
/
scripts
History
Jake Poznanski
a7e2f719bf
Start a preemptible one at least once
2025-07-02 19:26:30 +00:00
..
beaker
New trainer launch script cleanups
2025-06-25 23:05:32 +00:00
data
Rendering the pdfs in the dataloader
2025-06-11 18:11:42 +00:00
elo
…
eval
First attempt at new trainer code
2025-06-11 16:56:16 +00:00
pareto
Updating pareto plots
2025-06-17 21:41:23 +00:00
train
Start a preemptible one at least once
2025-07-02 19:26:30 +00:00
autoscan_dolmadocs.py
…
benchmark_throughput.py
…
chatgpt_tag_dolmadocs_v1.py
…
chatgpt_tag_dolmadocs_v2.py
…
check_qual.sh
…
infinigram_count.py
…
jsonl_to_markdown.py
…
molmo-7b-lora-gantry.sh
…
movedolmadocs_to_md.py
…
parse_with_pdfminer.py
…
pii_rule_comparison.py
…
prepare_changelog.py
…
release_notes.py
…
release.sh
…
rich_tagging_pipeline.py
…
run_benchmark.sh
Pulling in bigger benchmark script from vllm branch to main
2025-06-12 21:02:46 +00:00
run_integration_test.sh
…
run_marker_benchmark.sh
Go back to workers 1 in marker test script
2025-06-12 22:35:08 +00:00
run_tagging_pipeline.sh
…
s2orc_extractor.sh
…
scan_dolmadocs.py
…
sync_beaker_image.sh
Some helper scripts
2025-06-17 21:21:50 +00:00
tagging_pipeline_v2.py
…
tagging_pipeline.py
…