Logo
Explore Help
Register Sign In
yujunjun/olmocr
1
0
Fork 0
You've already forked olmocr
mirror of https://github.com/allenai/olmocr.git synced 2025-10-29 09:01:35 +00:00
Code Issues Packages Projects Releases Wiki Activity
olmocr/scripts
History
Jake Poznanski 54cd5a3438 Going to train on the new transcripts data
2025-09-08 22:30:40 +00:00
..
beaker
New trainer launch script cleanups
2025-06-25 23:05:32 +00:00
data
Rendering the pdfs in the dataloader
2025-06-11 18:11:42 +00:00
elo
…
eval
First attempt at new trainer code
2025-06-11 16:56:16 +00:00
pareto
Updating pareto plots
2025-06-17 21:41:23 +00:00
pii
Cleaning up scripts, multi gpu trainer more flexible
2025-09-03 18:25:10 +00:00
train
Going to train on the new transcripts data
2025-09-08 22:30:40 +00:00
clean_olmocrmix.py
Cleaning script
2025-09-03 21:31:21 +00:00
compare_vllm.sh
Using new budget code
2025-08-06 16:31:08 +00:00
compress_model.sh
Using new budget code
2025-08-06 16:31:08 +00:00
infinigram_count.py
…
jsonl_to_markdown.py
Code cleanup, version bump, remove unused permutation test
2025-05-16 21:25:32 +00:00
movedolmadocs_to_md.py
…
parse_with_pdfminer.py
Repo cleanup
2025-05-28 17:08:25 +00:00
prepare_changelog.py
…
release_notes.py
…
release.sh
…
run_benchmark_guided_decoding.sh
Using new budget code
2025-08-06 16:31:08 +00:00
run_benchmark.sh
Fixing run_benchmark
2025-08-25 20:28:40 +00:00
run_integration_test.sh
…
run_marker_benchmark.sh
Using new budget code
2025-08-06 16:31:08 +00:00
run_transformers_benchmark.sh
Trying fix for transformers benchmark
2025-08-04 19:50:05 +00:00
s2orc_extractor.sh
…
scan_dolmadocs.py
…
sync_beaker_image.sh
Some helper scripts
2025-06-17 21:21:50 +00:00
Powered by Gitea Version: 1.23.5 Page: 2150ms Template: 234ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API