Jake Poznanski
|
2e3d1a0317
|
Comitting test script to be used in model cards for individual one-off inference
|
2025-10-06 22:47:06 +00:00 |
|
Jake Poznanski
|
8ef7f8085a
|
isort and black
|
2025-09-30 17:37:10 +00:00 |
|
Jake Poznanski
|
7f4b728dcd
|
Skip docker checks if using beaker image
|
2025-09-26 23:27:04 +00:00 |
|
Jake Poznanski
|
bb06829840
|
SOuping in fp32
|
2025-09-26 20:03:29 +00:00 |
|
Jake Poznanski
|
01bc1ff7b6
|
Allow passing in beaker image to run benchmark
|
2025-09-23 19:48:42 +00:00 |
|
Jake Poznanski
|
a00d9d172e
|
Adding stricter math and table tests when in synthetic mode
|
2025-09-23 18:37:50 +00:00 |
|
Jake Poznanski
|
1197c35808
|
Mix contamination checker script
|
2025-09-23 18:17:13 +00:00 |
|
Jake Poznanski
|
83d965c768
|
Adding contam check for olmocr-bench when making synth data
|
2025-09-20 03:42:42 +00:00 |
|
Jake Poznanski
|
1ac72ad169
|
Adding some scripts to clean data
|
2025-09-18 19:44:30 +00:00 |
|
Jake Poznanski
|
54cd5a3438
|
Going to train on the new transcripts data
|
2025-09-08 22:30:40 +00:00 |
|
Jake Poznanski
|
ef09c73bf2
|
Fixing up some rewards stuff
|
2025-09-04 17:34:53 +00:00 |
|
Jake Poznanski
|
ede0dc51b1
|
Adding drop last to prevent any weirdnesses
|
2025-09-04 16:50:08 +00:00 |
|
Jake Poznanski
|
14a882db9a
|
Fixing to new version, adjusting scale rewards stuff
|
2025-09-03 22:43:35 +00:00 |
|
Jake Poznanski
|
755c221024
|
Trying some more things
|
2025-09-03 22:11:16 +00:00 |
|
Jake Poznanski
|
a41d04660a
|
Cleaning script
|
2025-09-03 21:31:21 +00:00 |
|
Jake Poznanski
|
e6cff25b6b
|
Cleanup stuff
|
2025-09-03 20:34:12 +00:00 |
|
Jake Poznanski
|
bade86fe91
|
Cleaned up things
|
2025-09-03 20:23:01 +00:00 |
|
Jake Poznanski
|
b689a8e5f8
|
Giving more memory buffer
|
2025-09-03 19:56:53 +00:00 |
|
Jake Poznanski
|
7346d12322
|
Better cleaning, augusta version
|
2025-09-03 18:47:02 +00:00 |
|
Jake Poznanski
|
f20f1a0b54
|
Doing some cleaning
|
2025-09-03 18:41:36 +00:00 |
|
Jake Poznanski
|
94d19c51c6
|
Cleaning up scripts, multi gpu trainer more flexible
|
2025-09-03 18:25:10 +00:00 |
|
Jake Poznanski
|
c612293a59
|
Remove device map auto
|
2025-09-03 18:04:42 +00:00 |
|
Jake Poznanski
|
1fb49cefc1
|
Working on multi gpu trainer
|
2025-09-03 17:25:14 +00:00 |
|
Jake Poznanski
|
3be381b375
|
Adding some params
|
2025-08-26 20:46:06 +00:00 |
|
Jake Poznanski
|
82fd50263f
|
Launcher for grpo training
|
2025-08-26 16:28:38 +00:00 |
|
Jake Poznanski
|
ed6f483074
|
Fixing run_benchmark
|
2025-08-25 20:28:40 +00:00 |
|
Jake Poznanski
|
d84eb95ba2
|
Saving some extra data mixes
|
2025-08-25 20:26:29 +00:00 |
|
Jake Poznanski
|
b16e4051f6
|
Saving bench results to s3
|
2025-08-25 19:53:55 +00:00 |
|
Jake Poznanski
|
d9b6978499
|
Some scripts
|
2025-08-25 18:44:18 +00:00 |
|
Jake Poznanski
|
55b7101d7e
|
Add some new rotation tests to a branch of the bench
|
2025-08-25 16:25:00 +00:00 |
|
Jake Poznanski
|
c0aee06c8f
|
grpo startup script works
|
2025-08-21 22:15:21 +00:00 |
|
Jake Poznanski
|
1dd6ff9b03
|
Olmocr bench grpo stuff
|
2025-08-21 18:17:07 +00:00 |
|
Jake Poznanski
|
6184c94c3c
|
Vllm enable
|
2025-08-21 17:33:56 +00:00 |
|
Jake Poznanski
|
1dbb4332c0
|
FIxing up
|
2025-08-21 16:50:56 +00:00 |
|
Jake Poznanski
|
7c446e1679
|
Trying to fix script
|
2025-08-20 22:44:10 +00:00 |
|
Jake Poznanski
|
a2ee4d46c0
|
gpro trainer test 1
|
2025-08-20 22:35:19 +00:00 |
|
Jake Poznanski
|
c075f3071f
|
New configs for new data
|
2025-08-16 17:31:42 +00:00 |
|
Jake Poznanski
|
2fca448105
|
Using new budget code
|
2025-08-06 16:31:08 +00:00 |
|
Jake Poznanski
|
8b8c6bb837
|
Cleaning up some training requirements installation steps
|
2025-08-05 19:42:46 +00:00 |
|
Jake Poznanski
|
c9b8088bc6
|
Adding some preempt flags
|
2025-08-05 18:00:46 +00:00 |
|
Jake Poznanski
|
55f8ba0ac0
|
Fixing configs
|
2025-08-04 22:54:39 +00:00 |
|
Jake Poznanski
|
12f8a90f1b
|
Copying preprocessed files to local ssd in trainer script
|
2025-08-04 22:18:38 +00:00 |
|
Jake Poznanski
|
7c098955a9
|
Trying fix for transformers benchmark
|
2025-08-04 19:50:05 +00:00 |
|
Jake Poznanski
|
df52cb0e0e
|
Small fixes for transformers test runner
|
2025-07-25 03:18:24 +00:00 |
|
Jake Poznanski
|
cf1912dec4
|
Some transformer bench ideas
|
2025-07-24 21:20:15 +00:00 |
|
Jake Poznanski
|
16145a4b32
|
Need accelerate
|
2025-07-16 18:51:37 +00:00 |
|
Jake Poznanski
|
31c834dcdd
|
Constants
|
2025-07-16 02:15:17 +00:00 |
|
Jake Poznanski
|
5ea4e8a6e2
|
Compare vllm script
|
2025-07-15 22:55:49 +00:00 |
|
Jake Poznanski
|
24608956a0
|
Working on comparing to vllm
|
2025-07-15 22:21:54 +00:00 |
|
Jake Poznanski
|
e6c98236b6
|
Adding more pipeline retry stats, compress code fixed
|
2025-07-15 21:41:10 +00:00 |
|