Jake Poznanski
|
eb8dce01f4
|
Hmm, things work better but still have issues importing images
|
2025-09-17 20:12:05 +00:00 |
|
Jake Poznanski
|
0f04cc5c67
|
Merge branch 'jakep/new_data' into jakep/new_data_image_boxes
|
2025-09-17 19:51:56 +00:00 |
|
Jake Poznanski
|
30750f77c1
|
Ok, rendering smaller version of the page, since this is the max suppored by claude and it would get rescaled anyways
|
2025-09-17 19:51:03 +00:00 |
|
Jake Poznanski
|
54cda1662b
|
Brings in images from original documents, but it seems worse quality
|
2025-09-17 18:58:04 +00:00 |
|
Jake Poznanski
|
a60c84ed14
|
Maybe better scaling with no losing of text
|
2025-09-16 22:01:49 +00:00 |
|
Jake Poznanski
|
52df81873a
|
Glob path fixes
|
2025-09-16 21:42:46 +00:00 |
|
Jake Poznanski
|
3b729d6770
|
Oops fixing random gen things
|
2025-09-16 21:17:12 +00:00 |
|
Jake Poznanski
|
2fa67a980e
|
More reliable test gen
|
2025-09-16 19:57:00 +00:00 |
|
Jake Poznanski
|
e3e09c04db
|
Synth data fixups
|
2025-09-16 18:47:54 +00:00 |
|
Jake Poznanski
|
2400744673
|
More generic model name loading
|
2025-09-11 17:44:24 +00:00 |
|
Jake Poznanski
|
3ae0f30f98
|
Adjusted the dolma viewer so I can more easily vibe check some new model outputs
|
2025-09-11 17:32:20 +00:00 |
|
Jake Poznanski
|
0516ff035f
|
More viewer updates
|
2025-09-11 17:00:49 +00:00 |
|
Jake Poznanski
|
0ffcdc0272
|
Cleaner viewer
|
2025-09-11 16:49:10 +00:00 |
|
Jake Poznanski
|
3865e92e7e
|
more rotation new mix without filtering
|
2025-09-10 21:34:47 +00:00 |
|
Jake Poznanski
|
1ee3dce948
|
Oops
|
2025-09-10 17:01:15 +00:00 |
|
Jake Poznanski
|
43ccd82609
|
One more train config with rotation augments
|
2025-09-10 16:06:04 +00:00 |
|
Jake Poznanski
|
3eaa584ed5
|
Fixing data config
|
2025-09-09 15:22:04 +00:00 |
|
Jake Poznanski
|
077e3eea7f
|
Preemptible mix
|
2025-09-09 14:55:01 +00:00 |
|
Jake Poznanski
|
54cd5a3438
|
Going to train on the new transcripts data
|
2025-09-08 22:30:40 +00:00 |
|
Jake Poznanski
|
592a669e1f
|
Arxiv downloader
|
2025-09-08 20:44:54 +00:00 |
|
Jake Poznanski
|
607e251530
|
Mine tables stuff
|
2025-09-08 20:22:18 +00:00 |
|
Jake Poznanski
|
3e9477db98
|
Aiming to get some more table data
|
2025-09-08 20:05:09 +00:00 |
|
Jake Poznanski
|
ba8b3824bf
|
Adding some rotation augmentation to the post training step
|
2025-09-08 18:54:53 +00:00 |
|
Jake Poznanski
|
a957ab2aaf
|
Adding an adjustment to how blank pages test is run, skipping image tags
|
2025-09-08 17:18:51 +00:00 |
|
Jake Poznanski
|
c0a1e70440
|
Adding filter
|
2025-09-08 17:02:09 +00:00 |
|
Jake Poznanski
|
aad8729ad7
|
prepare checkpint from main
|
2025-09-08 17:01:58 +00:00 |
|
Jake Poznanski
|
0f46fa0988
|
Adding num_generations
|
2025-09-04 18:44:23 +00:00 |
|
Jake Poznanski
|
ef09c73bf2
|
Fixing up some rewards stuff
|
2025-09-04 17:34:53 +00:00 |
|
Jake Poznanski
|
ede0dc51b1
|
Adding drop last to prevent any weirdnesses
|
2025-09-04 16:50:08 +00:00 |
|
Jake Poznanski
|
14a882db9a
|
Fixing to new version, adjusting scale rewards stuff
|
2025-09-03 22:43:35 +00:00 |
|
Jake Poznanski
|
2fd4ae8489
|
Adding some more options to play with
|
2025-09-03 22:29:23 +00:00 |
|
Jake Poznanski
|
755c221024
|
Trying some more things
|
2025-09-03 22:11:16 +00:00 |
|
Jake Poznanski
|
0a9c8f3e96
|
Adding warmup steps param
|
2025-09-03 21:33:18 +00:00 |
|
Jake Poznanski
|
a41d04660a
|
Cleaning script
|
2025-09-03 21:31:21 +00:00 |
|
Jake Poznanski
|
e6cff25b6b
|
Cleanup stuff
|
2025-09-03 20:34:12 +00:00 |
|
Jake Poznanski
|
bade86fe91
|
Cleaned up things
|
2025-09-03 20:23:01 +00:00 |
|
Jake Poznanski
|
b689a8e5f8
|
Giving more memory buffer
|
2025-09-03 19:56:53 +00:00 |
|
Jake Poznanski
|
7346d12322
|
Better cleaning, augusta version
|
2025-09-03 18:47:02 +00:00 |
|
Jake Poznanski
|
f20f1a0b54
|
Doing some cleaning
|
2025-09-03 18:41:36 +00:00 |
|
Jake Poznanski
|
94d19c51c6
|
Cleaning up scripts, multi gpu trainer more flexible
|
2025-09-03 18:25:10 +00:00 |
|
Jake Poznanski
|
c612293a59
|
Remove device map auto
|
2025-09-03 18:04:42 +00:00 |
|
Jake Poznanski
|
1fb49cefc1
|
Working on multi gpu trainer
|
2025-09-03 17:25:14 +00:00 |
|
Jake Poznanski
|
00f51fb2c7
|
Fixing bug with multi epoch training
|
2025-09-02 21:03:00 +00:00 |
|
Jake Poznanski
|
72fcfafde7
|
Fixed up national archives script
|
2025-08-29 16:36:18 +00:00 |
|
Jake Poznanski
|
7e09a02f7c
|
Better NA extractor
|
2025-08-28 20:48:51 +00:00 |
|
Jake Poznanski
|
22626a512c
|
Describing national archives project
|
2025-08-28 18:44:24 +00:00 |
|
Jake Poznanski
|
f426826850
|
Clean downloads
|
2025-08-28 18:03:52 +00:00 |
|
Jake Poznanski
|
3d3d184f25
|
Fixes
|
2025-08-28 17:56:27 +00:00 |
|
Jake Poznanski
|
ed3820c0c7
|
LOC downloader
|
2025-08-28 17:38:20 +00:00 |
|
Jake Poznanski
|
6123d4452b
|
Reorganizing files
|
2025-08-28 17:04:26 +00:00 |
|