245 Commits

Author SHA1 Message Date
Jake Poznanski
c612293a59 Remove device map auto 2025-09-03 18:04:42 +00:00
Jake Poznanski
1fb49cefc1 Working on multi gpu trainer 2025-09-03 17:25:14 +00:00
Jake Poznanski
3be381b375 Adding some params 2025-08-26 20:46:06 +00:00
Jake Poznanski
82fd50263f Launcher for grpo training 2025-08-26 16:28:38 +00:00
Jake Poznanski
ed6f483074 Fixing run_benchmark 2025-08-25 20:28:40 +00:00
Jake Poznanski
d84eb95ba2 Saving some extra data mixes 2025-08-25 20:26:29 +00:00
Jake Poznanski
b16e4051f6 Saving bench results to s3 2025-08-25 19:53:55 +00:00
Jake Poznanski
d9b6978499 Some scripts 2025-08-25 18:44:18 +00:00
Jake Poznanski
55b7101d7e Add some new rotation tests to a branch of the bench 2025-08-25 16:25:00 +00:00
Jake Poznanski
c0aee06c8f grpo startup script works 2025-08-21 22:15:21 +00:00
Jake Poznanski
1dd6ff9b03 Olmocr bench grpo stuff 2025-08-21 18:17:07 +00:00
Jake Poznanski
6184c94c3c Vllm enable 2025-08-21 17:33:56 +00:00
Jake Poznanski
1dbb4332c0 FIxing up 2025-08-21 16:50:56 +00:00
Jake Poznanski
7c446e1679 Trying to fix script 2025-08-20 22:44:10 +00:00
Jake Poznanski
a2ee4d46c0 gpro trainer test 1 2025-08-20 22:35:19 +00:00
Jake Poznanski
c075f3071f New configs for new data 2025-08-16 17:31:42 +00:00
Jake Poznanski
2fca448105 Using new budget code 2025-08-06 16:31:08 +00:00
Jake Poznanski
8b8c6bb837 Cleaning up some training requirements installation steps 2025-08-05 19:42:46 +00:00
Jake Poznanski
c9b8088bc6 Adding some preempt flags 2025-08-05 18:00:46 +00:00
Jake Poznanski
55f8ba0ac0 Fixing configs 2025-08-04 22:54:39 +00:00
Jake Poznanski
12f8a90f1b Copying preprocessed files to local ssd in trainer script 2025-08-04 22:18:38 +00:00
Jake Poznanski
7c098955a9 Trying fix for transformers benchmark 2025-08-04 19:50:05 +00:00
Jake Poznanski
df52cb0e0e Small fixes for transformers test runner 2025-07-25 03:18:24 +00:00
Jake Poznanski
cf1912dec4 Some transformer bench ideas 2025-07-24 21:20:15 +00:00
Jake Poznanski
16145a4b32 Need accelerate 2025-07-16 18:51:37 +00:00
Jake Poznanski
31c834dcdd Constants 2025-07-16 02:15:17 +00:00
Jake Poznanski
5ea4e8a6e2 Compare vllm script 2025-07-15 22:55:49 +00:00
Jake Poznanski
24608956a0 Working on comparing to vllm 2025-07-15 22:21:54 +00:00
Jake Poznanski
e6c98236b6 Adding more pipeline retry stats, compress code fixed 2025-07-15 21:41:10 +00:00
Jake Poznanski
4dbbf91e1c Compression script 2025-07-15 21:26:15 +00:00
Jake Poznanski
1092213c5f Merge branch 'jakep/new_traininer_nojson_newprompt' into jakep/new_trainer 2025-07-15 17:44:55 +00:00
Jake Poznanski
43ae28dde4 Prepare checkpoint works for older models too 2025-07-14 21:30:32 +00:00
Jake Poznanski
f014c2aaf9 Need to reserve all 8 gpus for reliable performance benchmark, even if you only use 1 2025-07-14 21:02:14 +00:00
Jake Poznanski
65d0edcaae Adding guided decoding option 2025-07-10 15:13:26 +00:00
Jake Poznanski
a7e2f719bf Start a preemptible one at least once 2025-07-02 19:26:30 +00:00
Jake Poznanski
79a7818517 New trainer launch script for beaker 2025-07-01 01:43:38 +00:00
Jake Poznanski
dcf026a63c Better script 2025-07-01 01:40:55 +00:00
Jake Poznanski
9f0f912101 Ugh 2025-06-30 23:37:35 +00:00
Jake Poznanski
1d007d1bf2 Perhaps fixing default config 2025-06-30 22:58:21 +00:00
Jake Poznanski
e7020c7f50 More configs 2025-06-30 22:49:42 +00:00
Jake Poznanski
5c2d69a3d7 Some cleanup stuff 2025-06-30 21:24:35 +00:00
Jake Poznanski
656dbef833 Frontier configs 2025-06-30 17:43:30 +00:00
Jake Poznanski
e2f2d36e4f More typos 2025-06-30 17:41:19 +00:00
Jake Poznanski
55a737ca6b script 2025-06-30 17:32:01 +00:00
Jake Poznanski
ba49fd53d9 frontier train script let's see what happens 2025-06-30 17:30:17 +00:00
Jake Poznanski
b96454b786 Merge branch 'main' into jakep/new_trainer 2025-06-27 16:21:58 +00:00
Jake Poznanski
1451dd1395 weka 2025-06-27 02:57:26 +00:00
Jake Poznanski
dee3730231 Gantry stuff 2025-06-26 18:34:53 +00:00
Jake Poznanski
0d7836b111 Basic atttempt to run trainer script 2025-06-25 23:22:59 +00:00
Jake Poznanski
d7e5037192 New trainer launch script cleanups 2025-06-25 23:05:32 +00:00