582 Commits

Author SHA1 Message Date
Jake Poznanski
b15bff64ff Work queue coallescing 2024-11-07 13:26:42 -08:00
Jake Poznanski
57186c7737 Doing some more stuff 2024-11-07 21:08:46 +00:00
Jake Poznanski
923231e680 exit handlers 2024-11-07 21:00:51 +00:00
Jake Poznanski
051a7b4f6b Prepping work script 2024-11-07 20:16:23 +00:00
Jake Poznanski
a65e12bea0 Model download stuff 2024-11-07 19:01:45 +00:00
Jake Poznanski
12a91ffa96 Starting on a new approach 2024-11-07 18:21:23 +00:00
Jake Poznanski
faf8659028 Putting aside redis 2024-11-07 16:49:13 +00:00
Jake Poznanski
3d6be3c97b Work queue sharing thing 2024-11-07 00:11:02 +00:00
Jake Poznanski
75d4a0ec6e Experimental beaker pipeline self organizing redis idea 2024-11-07 00:03:30 +00:00
Jake Poznanski
a14febc79d sglang support for runeval 2024-11-06 23:19:11 +00:00
Jake Poznanski
592cc50067 More docs 2024-11-04 17:58:46 +00:00
Jake Poznanski
03f5b25d49 Docs good now 2024-11-04 17:37:24 +00:00
Jake Poznanski
d89ea6b9fb docs 2024-11-04 17:36:37 +00:00
Jake Poznanski
0362ce687f docs 2024-11-04 17:36:20 +00:00
Jake Poznanski
b2b3f06d8d docs 2024-11-04 17:35:28 +00:00
Jake Poznanski
46ccab38b2 More docs 2024-11-04 17:34:13 +00:00
Jake Poznanski
93d70683d4 More docs 2024-11-04 17:28:09 +00:00
Jake Poznanski
73bd961135 Logger fix 2024-11-04 17:10:22 +00:00
Jake Poznanski
37782283e0 More docs 2024-11-04 17:08:29 +00:00
Jake Poznanski
ef2e4d6e42 Adding more docs 2024-11-04 16:20:36 +00:00
Jake Poznanski
5ebc8cdd88 Checkfix 2024-11-01 17:13:11 +00:00
Jake Poznanski
9f010e6ab0 Add check for poppler installation 2024-11-01 16:57:19 +00:00
Jake Poznanski
be8fb28799
Update README.md 2024-11-01 09:49:41 -07:00
Jake Poznanski
426fda1f24 Removing some logs 2024-11-01 15:00:49 +00:00
Jake Poznanski
500bd2de5b flash attn 2024-10-30 22:33:10 +00:00
Jake Poznanski
d45b34fdd5 Trust remote code 2024-10-30 21:22:39 +00:00
Jake Poznanski
cda0ad7984 Config typo 2024-10-30 21:18:48 +00:00
Jake Poznanski
cf3b377bb9 train script 2024-10-30 14:05:02 -07:00
Jake Poznanski
8f001bf74c Config updates 2024-10-30 14:02:57 -07:00
Jake Poznanski
6a4a55f9e0 Hopefully working molmo HF trainer config 2024-10-30 14:00:27 -07:00
Jake Poznanski
bede854cd5 Startng to write molmo formatters 2024-10-30 13:24:11 -07:00
Jake Poznanski
e65747e591 Some better logging 2024-10-30 11:22:52 -07:00
Jake Poznanski
a0e0917102 Merge branch 'main' of https://github.com/allenai/pdelfin into main 2024-10-30 10:42:56 -07:00
Jake Poznanski
43aa4f2508 Proper selection of LORA weights 2024-10-30 10:42:53 -07:00
Jake Poznanski
c652c7e396 Merge branch 'main' of https://github.com/allenai/pdelfin 2024-10-30 16:26:03 +00:00
Jake Poznanski
85e0e2a61b Fixing issues with pdf parsing 2024-10-30 16:26:02 +00:00
Jake Poznanski
bcb47946e5 Starting on molmo changes 2024-10-30 08:39:48 -07:00
Jake Poznanski
232c445a23 Pipeline stability fixes hopefully and logging 2024-10-29 20:15:34 +00:00
Jake Poznanski
ce2e4baa87 Applying rotation corrections 2024-10-28 20:32:23 +00:00
Jake Poznanski
08d51b7183 Adding some rotation retry contrl 2024-10-28 20:16:06 +00:00
Jake Poznanski
7678f31aa9 Fixing some reliability issues with the pipeline script 2024-10-28 16:49:00 +00:00
Jake Poznanski
45269fa6a5 Switching to logging vs prints 2024-10-28 15:29:46 +00:00
Jake Poznanski
a3e7654190 Update all docs at once 2024-10-28 15:06:29 +00:00
Jake Poznanski
062abff25c Adding some skip logic 2024-10-27 21:17:48 +00:00
Jake Poznanski
8e6d0c65d6 swtichin to orjson, some better json error handling 2024-10-25 22:10:54 +00:00
Jake Poznanski
48a3affec3 Reindexing 2024-10-25 20:32:51 +00:00
Jake Poznanski
f13d0a5741 List configs to list 2024-10-24 03:07:32 +00:00
Jake Poznanski
ffe470bf0e Fix 2024-10-23 22:55:50 +00:00
Jake Poznanski
180dde03c5 dataprep sampling tests 2024-10-23 22:53:05 +00:00
Jake Poznanski
64041bd6d7 Allow sampling different anchor text lens 2024-10-23 15:37:23 -07:00