Logo
Explore Help
Register Sign In
yujunjun/olmocr
1
0
Fork 0
You've already forked olmocr
mirror of https://github.com/allenai/olmocr.git synced 2025-10-13 01:02:26 +00:00
Code Issues Packages Projects Releases Wiki Activity
olmocr/scripts
History
Jake Poznanski a7e2f719bf Start a preemptible one at least once
2025-07-02 19:26:30 +00:00
..
beaker
New trainer launch script cleanups
2025-06-25 23:05:32 +00:00
data
Rendering the pdfs in the dataloader
2025-06-11 18:11:42 +00:00
elo
…
eval
First attempt at new trainer code
2025-06-11 16:56:16 +00:00
pareto
Updating pareto plots
2025-06-17 21:41:23 +00:00
train
Start a preemptible one at least once
2025-07-02 19:26:30 +00:00
autoscan_dolmadocs.py
…
benchmark_throughput.py
…
chatgpt_tag_dolmadocs_v1.py
…
chatgpt_tag_dolmadocs_v2.py
…
check_qual.sh
…
infinigram_count.py
…
jsonl_to_markdown.py
…
molmo-7b-lora-gantry.sh
…
movedolmadocs_to_md.py
…
parse_with_pdfminer.py
…
pii_rule_comparison.py
…
prepare_changelog.py
…
release_notes.py
…
release.sh
…
rich_tagging_pipeline.py
…
run_benchmark.sh
Pulling in bigger benchmark script from vllm branch to main
2025-06-12 21:02:46 +00:00
run_integration_test.sh
…
run_marker_benchmark.sh
Go back to workers 1 in marker test script
2025-06-12 22:35:08 +00:00
run_tagging_pipeline.sh
…
s2orc_extractor.sh
…
scan_dolmadocs.py
…
sync_beaker_image.sh
Some helper scripts
2025-06-17 21:21:50 +00:00
tagging_pipeline_v2.py
…
tagging_pipeline.py
…
Powered by Gitea Version: 1.23.5 Page: 5905ms Template: 138ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API