Jake Poznanski
|
04844b3f87
|
More beaker and docker fixes
|
2025-01-30 22:14:57 +00:00 |
|
Jake Poznanski
|
c69e0d6762
|
More cleanup, removing dead adv anchor code
|
2025-01-30 12:58:11 -08:00 |
|
Jake Poznanski
|
dcaca8aa90
|
Black formatting
|
2025-01-29 15:30:39 -08:00 |
|
Jake Poznanski
|
4a1762d455
|
isort
|
2025-01-29 15:25:10 -08:00 |
|
Jake Poznanski
|
b2894d0280
|
Massive refactor from pdelfin to olmocr
|
2025-01-27 18:30:41 +00:00 |
|
Jake Poznanski
|
5b429ad100
|
Higher lr for molmo, fixed evals
|
2025-01-24 23:15:35 +00:00 |
|
Jake Poznanski
|
d0eea81c00
|
Dealing with issue with molmo unused params
|
2025-01-24 16:27:42 +00:00 |
|
Jake Poznanski
|
ef4167dc45
|
Test set script
|
2025-01-14 19:36:18 +00:00 |
|
Jake Poznanski
|
cff97990bf
|
Moving to official sglang release
|
2024-11-22 19:37:31 +00:00 |
|
Jake Poznanski
|
9e2e09bd06
|
More fixes
|
2024-11-18 15:04:50 -08:00 |
|
Jake Poznanski
|
8e16780b82
|
Beaker stuff
|
2024-11-14 08:49:12 -08:00 |
|
Jake Poznanski
|
4c3bf7045d
|
Beaker fixes
|
2024-11-13 14:24:23 -08:00 |
|
Jake Poznanski
|
83bb1dcd3b
|
Dockerfile fixes
|
2024-11-13 12:59:52 -08:00 |
|
Jake Poznanski
|
6c9c785130
|
Using version strings
|
2024-11-13 12:35:40 -08:00 |
|
Jake Poznanski
|
39256c19bb
|
Beaker running
|
2024-11-13 10:25:35 -08:00 |
|
Jake Poznanski
|
867e2c9a36
|
Docker builds
|
2024-11-13 09:46:08 -08:00 |
|
Jake Poznanski
|
a091412079
|
Starting to play with docker too
|
2024-11-13 09:35:34 -08:00 |
|
Jake Poznanski
|
93d70683d4
|
More docs
|
2024-11-04 17:28:09 +00:00 |
|
Jake Poznanski
|
cda0ad7984
|
Config typo
|
2024-10-30 21:18:48 +00:00 |
|
Jake Poznanski
|
cf3b377bb9
|
train script
|
2024-10-30 14:05:02 -07:00 |
|
Jake Poznanski
|
a1a4798ce7
|
Some crazy idea I had to simplify futures and memory limits
|
2024-10-23 21:51:37 +00:00 |
|
Jake Poznanski
|
f6ac591fe9
|
vllm benchmarker
|
2024-10-23 18:14:50 +00:00 |
|
Jake Poznanski
|
d99096e9a2
|
Adding vllm profile script for reference
|
2024-10-22 20:00:34 +00:00 |
|
Jake Poznanski
|
31becaf7e4
|
S2orc dataset extractor
|
2024-10-21 21:28:44 +00:00 |
|
Jake Poznanski
|
3ecbeae6dc
|
Trying save to s3 but with threaded saver
|
2024-10-17 21:39:01 +00:00 |
|
Jake Poznanski
|
529d51d57d
|
Put LR back, need to save larger checkpoints to weka to prevent timeouts
|
2024-10-17 19:46:25 +00:00 |
|
Jake Poznanski
|
063be21287
|
New image
|
2024-10-16 14:46:28 -07:00 |
|
Jake Poznanski
|
90cb80fd65
|
Docker update
|
2024-10-16 21:40:39 +00:00 |
|
Jake Poznanski
|
a8b50ae8fa
|
Preloading the datasets directly
|
2024-10-10 19:57:51 +00:00 |
|
Jake Poznanski
|
230c8a9f9a
|
Trying new run that will rewrite the prompts as it goes
|
2024-10-08 22:10:18 +00:00 |
|
Jake Poznanski
|
adc702c918
|
FIxing wandb key
|
2024-10-08 18:16:39 +00:00 |
|
Jake Poznanski
|
4fb7e9b184
|
Updated eval script
|
2024-10-08 16:09:25 +00:00 |
|
Jake Poznanski
|
fb4e585e9f
|
Trying out non-lora training
|
2024-10-08 15:20:37 +00:00 |
|
Jake Poznanski
|
44bcdc771b
|
Hopefully can use weka for the train datasets now
|
2024-10-07 16:14:28 +00:00 |
|
Jake Poznanski
|
78e3a94173
|
Adding pluto ib
|
2024-10-03 15:33:17 +00:00 |
|
Jake Poznanski
|
0ddaf9023d
|
Getting ready to launch a new training run
|
2024-10-02 23:04:56 +00:00 |
|
Jake Poznanski
|
decfd7fbc1
|
Fixing the refiner input prompt to something simpler that doesn't depend on the training data. Fixing beaker job workspace and bumping priority to high.
|
2024-09-27 22:54:07 +00:00 |
|
Jake Poznanski
|
a0bec4ee41
|
7b scripto
|
2024-09-25 22:08:36 +00:00 |
|
Jake Poznanski
|
3a5b438a6f
|
Lora misconfiguration
|
2024-09-25 10:48:39 -07:00 |
|
Jake Poznanski
|
f6905c39ea
|
Hopefully the last changes
|
2024-09-24 15:52:34 -07:00 |
|
Jake Poznanski
|
0442a33209
|
New images work much better now, and device map fix
|
2024-09-24 12:58:18 -07:00 |
|
Jake Poznanski
|
0d9917367b
|
Flash attention as part of the image
|
2024-09-24 11:57:56 -07:00 |
|
Jake Poznanski
|
3c8e05362f
|
New image, dont need to install
|
2024-09-24 11:30:19 -07:00 |
|
Jake Poznanski
|
66c29dd44f
|
Moving to making a new dockerfile
|
2024-09-24 11:24:14 -07:00 |
|
Jake Poznanski
|
b0777dcb87
|
missing libaio
|
2024-09-24 15:32:31 +00:00 |
|
Jake Poznanski
|
5287ba50b9
|
Back to pip... sigh
|
2024-09-24 14:45:44 +00:00 |
|
Jake Poznanski
|
1cf3cd8caa
|
Had to swtich to conda env override for gantry due to cu118 compat
|
2024-09-23 22:35:42 +00:00 |
|
Jake Poznanski
|
cb0b97a16a
|
Gantry requirements
|
2024-09-23 15:08:39 -07:00 |
|
Jake Poznanski
|
15793975dd
|
Merge branch 'main' of https://github.com/allenai/pdelfin
|
2024-09-23 21:42:27 +00:00 |
|
Jake Poznanski
|
0691e1a77f
|
chmodding
|
2024-09-23 21:42:26 +00:00 |
|