27 Commits

Author SHA1 Message Date
Jake Poznanski
a8b50ae8fa Preloading the datasets directly 2024-10-10 19:57:51 +00:00
Jake Poznanski
230c8a9f9a Trying new run that will rewrite the prompts as it goes 2024-10-08 22:10:18 +00:00
Jake Poznanski
adc702c918 FIxing wandb key 2024-10-08 18:16:39 +00:00
Jake Poznanski
4fb7e9b184 Updated eval script 2024-10-08 16:09:25 +00:00
Jake Poznanski
fb4e585e9f Trying out non-lora training 2024-10-08 15:20:37 +00:00
Jake Poznanski
44bcdc771b Hopefully can use weka for the train datasets now 2024-10-07 16:14:28 +00:00
Jake Poznanski
78e3a94173 Adding pluto ib 2024-10-03 15:33:17 +00:00
Jake Poznanski
0ddaf9023d Getting ready to launch a new training run 2024-10-02 23:04:56 +00:00
Jake Poznanski
decfd7fbc1 Fixing the refiner input prompt to something simpler that doesn't depend on the training data. Fixing beaker job workspace and bumping priority to high. 2024-09-27 22:54:07 +00:00
Jake Poznanski
a0bec4ee41 7b scripto 2024-09-25 22:08:36 +00:00
Jake Poznanski
3a5b438a6f Lora misconfiguration 2024-09-25 10:48:39 -07:00
Jake Poznanski
f6905c39ea Hopefully the last changes 2024-09-24 15:52:34 -07:00
Jake Poznanski
0442a33209 New images work much better now, and device map fix 2024-09-24 12:58:18 -07:00
Jake Poznanski
0d9917367b Flash attention as part of the image 2024-09-24 11:57:56 -07:00
Jake Poznanski
3c8e05362f New image, dont need to install 2024-09-24 11:30:19 -07:00
Jake Poznanski
66c29dd44f Moving to making a new dockerfile 2024-09-24 11:24:14 -07:00
Jake Poznanski
b0777dcb87 missing libaio 2024-09-24 15:32:31 +00:00
Jake Poznanski
5287ba50b9 Back to pip... sigh 2024-09-24 14:45:44 +00:00
Jake Poznanski
1cf3cd8caa Had to swtich to conda env override for gantry due to cu118 compat 2024-09-23 22:35:42 +00:00
Jake Poznanski
cb0b97a16a Gantry requirements 2024-09-23 15:08:39 -07:00
Jake Poznanski
15793975dd Merge branch 'main' of https://github.com/allenai/pdelfin 2024-09-23 21:42:27 +00:00
Jake Poznanski
0691e1a77f chmodding 2024-09-23 21:42:26 +00:00
Jake Poznanski
a30ca16e1f Script adjustment 2024-09-23 14:41:35 -07:00
Jake Poznanski
a3feca01fc Setting up for a real train run 2024-09-23 14:32:10 -07:00
Jake Poznanski
0812b0dd77 Prepping for gantry 2024-09-23 14:04:22 -07:00
Jake Poznanski
9662718bfd Running personalize script on template 2024-09-17 15:06:59 +00:00
Jake Poznanski
68b2c0e8d6
Initial commit 2024-09-17 07:53:43 -07:00