10 Commits

Author SHA1 Message Date
Jake Poznanski
3be381b375 Adding some params 2025-08-26 20:46:06 +00:00
Jake Poznanski
82fd50263f Launcher for grpo training 2025-08-26 16:28:38 +00:00
Jake Poznanski
d84eb95ba2 Saving some extra data mixes 2025-08-25 20:26:29 +00:00
Jake Poznanski
d9b6978499 Some scripts 2025-08-25 18:44:18 +00:00
Jake Poznanski
c0aee06c8f grpo startup script works 2025-08-21 22:15:21 +00:00
Jake Poznanski
1dd6ff9b03 Olmocr bench grpo stuff 2025-08-21 18:17:07 +00:00
Jake Poznanski
6184c94c3c Vllm enable 2025-08-21 17:33:56 +00:00
Jake Poznanski
1dbb4332c0 FIxing up 2025-08-21 16:50:56 +00:00
Jake Poznanski
7c446e1679 Trying to fix script 2025-08-20 22:44:10 +00:00
Jake Poznanski
a2ee4d46c0 gpro trainer test 1 2025-08-20 22:35:19 +00:00