1749 Commits

Author SHA1 Message Date
Jake Poznanski
cedd4a80cf Fixing paddle ocr to run fast 2025-09-19 16:57:05 +00:00
Jake Poznanski
8f75ea062e testing paddle 2025-09-19 16:55:01 +00:00
Jake Poznanski
e9ab2fd1bb Adding paddlepaddle v5 runner for benchmarking 2025-09-19 16:45:53 +00:00
Jake Poznanski
1c703917df Synthmix ignore 2025-09-19 15:58:58 +00:00
Jake Poznanski
4907b1c700 Bumping rotation augment a tad 2025-09-18 20:38:02 +00:00
Jake Poznanski
a42d8199cd Adding 1025 mix dataset, should be ready for final run 2025-09-18 19:50:40 +00:00
Jake Poznanski
1ac72ad169 Adding some scripts to clean data 2025-09-18 19:44:30 +00:00
Jake Poznanski
30750f77c1 Ok, rendering smaller version of the page, since this is the max suppored by claude and it would get rescaled anyways 2025-09-17 19:51:03 +00:00
Jake Poznanski
a60c84ed14 Maybe better scaling with no losing of text 2025-09-16 22:01:49 +00:00
Jake Poznanski
52df81873a Glob path fixes 2025-09-16 21:42:46 +00:00
Jake Poznanski
3b729d6770 Oops fixing random gen things 2025-09-16 21:17:12 +00:00
Jake Poznanski
2fa67a980e More reliable test gen 2025-09-16 19:57:00 +00:00
Jake Poznanski
e3e09c04db Synth data fixups 2025-09-16 18:47:54 +00:00
Jake Poznanski
2400744673 More generic model name loading 2025-09-11 17:44:24 +00:00
Jake Poznanski
3ae0f30f98 Adjusted the dolma viewer so I can more easily vibe check some new model outputs 2025-09-11 17:32:20 +00:00
Jake Poznanski
0516ff035f More viewer updates 2025-09-11 17:00:49 +00:00
Jake Poznanski
0ffcdc0272 Cleaner viewer 2025-09-11 16:49:10 +00:00
Jake Poznanski
3865e92e7e more rotation new mix without filtering 2025-09-10 21:34:47 +00:00
Jake Poznanski
1ee3dce948 Oops 2025-09-10 17:01:15 +00:00
Jake Poznanski
43ccd82609 One more train config with rotation augments 2025-09-10 16:06:04 +00:00
Jake Poznanski
3eaa584ed5 Fixing data config 2025-09-09 15:22:04 +00:00
Jake Poznanski
077e3eea7f Preemptible mix 2025-09-09 14:55:01 +00:00
Jake Poznanski
54cd5a3438 Going to train on the new transcripts data 2025-09-08 22:30:40 +00:00
Jake Poznanski
592a669e1f Arxiv downloader 2025-09-08 20:44:54 +00:00
Jake Poznanski
607e251530 Mine tables stuff 2025-09-08 20:22:18 +00:00
Jake Poznanski
3e9477db98 Aiming to get some more table data 2025-09-08 20:05:09 +00:00
Jake Poznanski
ba8b3824bf Adding some rotation augmentation to the post training step 2025-09-08 18:54:53 +00:00
Jake Poznanski
a957ab2aaf Adding an adjustment to how blank pages test is run, skipping image tags 2025-09-08 17:18:51 +00:00
Jake Poznanski
c0a1e70440 Adding filter 2025-09-08 17:02:09 +00:00
Jake Poznanski
aad8729ad7 prepare checkpint from main 2025-09-08 17:01:58 +00:00
Jake Poznanski
8f88a98e5d prepare checkpoint script fixes 2025-09-04 22:15:55 +00:00
Jake Poznanski
0f46fa0988 Adding num_generations 2025-09-04 18:44:23 +00:00
Jake Poznanski
ef09c73bf2 Fixing up some rewards stuff 2025-09-04 17:34:53 +00:00
Jake Poznanski
ede0dc51b1 Adding drop last to prevent any weirdnesses 2025-09-04 16:50:08 +00:00
Jake Poznanski
14a882db9a Fixing to new version, adjusting scale rewards stuff 2025-09-03 22:43:35 +00:00
Jake Poznanski
2fd4ae8489 Adding some more options to play with 2025-09-03 22:29:23 +00:00
Jake Poznanski
755c221024 Trying some more things 2025-09-03 22:11:16 +00:00
Jake Poznanski
0a9c8f3e96 Adding warmup steps param 2025-09-03 21:33:18 +00:00
Jake Poznanski
a41d04660a Cleaning script 2025-09-03 21:31:21 +00:00
Jake Poznanski
e6cff25b6b Cleanup stuff 2025-09-03 20:34:12 +00:00
Jake Poznanski
bade86fe91 Cleaned up things 2025-09-03 20:23:01 +00:00
Jake Poznanski
b689a8e5f8 Giving more memory buffer 2025-09-03 19:56:53 +00:00
Jake Poznanski
7346d12322 Better cleaning, augusta version 2025-09-03 18:47:02 +00:00
Jake Poznanski
f20f1a0b54 Doing some cleaning 2025-09-03 18:41:36 +00:00
Jake Poznanski
94d19c51c6 Cleaning up scripts, multi gpu trainer more flexible 2025-09-03 18:25:10 +00:00
Jake Poznanski
c612293a59 Remove device map auto 2025-09-03 18:04:42 +00:00
Jake Poznanski
1fb49cefc1 Working on multi gpu trainer 2025-09-03 17:25:14 +00:00
Jake Poznanski
00f51fb2c7 Fixing bug with multi epoch training 2025-09-02 21:03:00 +00:00
Jake Poznanski
c720c02d83 Cleaning up repo a bit 2025-09-02 06:45:24 +00:00
Jake Poznanski
56b08d5aa4 Bump version to v0.3.4 for release v0.3.4 2025-08-31 03:12:39 +00:00