olmocr

mirror of https://github.com/allenai/olmocr.git synced 2025-07-23 09:02:16 +00:00

Author	SHA1	Message	Date
Jake Poznanski	b2894d0280	Massive refactor from pdelfin to olmocr	2025-01-27 18:30:41 +00:00
Jake Poznanski	3ecbeae6dc	Trying save to s3 but with threaded saver	2024-10-17 21:39:01 +00:00
Jake Poznanski	529d51d57d	Put LR back, need to save larger checkpoints to weka to prevent timeouts	2024-10-17 19:46:25 +00:00
Jake Poznanski	063be21287	New image	2024-10-16 14:46:28 -07:00
Jake Poznanski	a8b50ae8fa	Preloading the datasets directly	2024-10-10 19:57:51 +00:00
Jake Poznanski	adc702c918	FIxing wandb key	2024-10-08 18:16:39 +00:00
Jake Poznanski	4fb7e9b184	Updated eval script	2024-10-08 16:09:25 +00:00
Jake Poznanski	fb4e585e9f	Trying out non-lora training	2024-10-08 15:20:37 +00:00
Jake Poznanski	44bcdc771b	Hopefully can use weka for the train datasets now	2024-10-07 16:14:28 +00:00
Jake Poznanski	0ddaf9023d	Getting ready to launch a new training run	2024-10-02 23:04:56 +00:00
Jake Poznanski	decfd7fbc1	Fixing the refiner input prompt to something simpler that doesn't depend on the training data. Fixing beaker job workspace and bumping priority to high.	2024-09-27 22:54:07 +00:00
Jake Poznanski	a0bec4ee41	7b scripto	2024-09-25 22:08:36 +00:00