582 Commits

Author SHA1 Message Date
Jake Poznanski
827b77e8df Working on task groups 2024-11-14 12:06:13 -08:00
Jake Poznanski
a58efea133 better logging 2024-11-14 09:55:37 -08:00
Jake Poznanski
a9cf2e0272 Allow setting beaker priority 2024-11-14 09:10:28 -08:00
Jake Poznanski
41c8d552a0 exponential backoff 2024-11-14 09:02:49 -08:00
Jake Poznanski
4dcf9ed5d4 more fixes 2024-11-14 08:55:20 -08:00
Jake Poznanski
06331d740a Fix timeout 2024-11-14 08:50:33 -08:00
Jake Poznanski
8e16780b82 Beaker stuff 2024-11-14 08:49:12 -08:00
Jake Poznanski
4c3bf7045d Beaker fixes 2024-11-13 14:24:23 -08:00
Jake Poznanski
3172a1c16a Shuffling 2024-11-13 13:23:29 -08:00
Jake Poznanski
fe3c9a2709 Creds and other things 2024-11-13 13:14:33 -08:00
Jake Poznanski
a3b6962d21 fix 2024-11-13 13:05:57 -08:00
Jake Poznanski
83bb1dcd3b Dockerfile fixes 2024-11-13 12:59:52 -08:00
Jake Poznanski
6c9c785130 Using version strings 2024-11-13 12:35:40 -08:00
Jake Poznanski
9610eac4f0 Secrets management 2024-11-13 11:26:46 -08:00
Jake Poznanski
39256c19bb Beaker running 2024-11-13 10:25:35 -08:00
Jake Poznanski
867e2c9a36 Docker builds 2024-11-13 09:46:08 -08:00
Jake Poznanski
a091412079 Starting to play with docker too 2024-11-13 09:35:34 -08:00
Jake Poznanski
bce85e669a pipeline 2024-11-13 08:00:14 -08:00
Jake Poznanski
a085e8c7b3 Beaker test 2024-11-12 15:56:51 -08:00
Jake Poznanski
910c2ebcfc Downloads from s3 based on hash 2024-11-12 15:18:04 -08:00
Jake Poznanski
6598e2dc45 Control http session at the worker level 2024-11-12 13:54:45 -08:00
Jake Poznanski
fbacdd06e0 Stuff 2024-11-12 13:44:20 -08:00
Jake Poznanski
ae9b1c405d Better stats 2024-11-12 13:28:39 -08:00
Jake Poznanski
9ce28c0504 Measuring metrics better now 2024-11-12 12:56:35 -08:00
Jake Poznanski
193e5214d1 Semaphore timeout 2024-11-12 11:53:29 -08:00
Jake Poznanski
102c0e4cfc new version of sglang, server restarts, semaphore timeouts 2024-11-12 10:49:13 -08:00
Jake Poznanski
918e2f3542 Pipeline stuff 2024-11-12 09:33:53 -08:00
Jake Poznanski
691cc5a13c A few items 2024-11-12 08:34:25 -08:00
Jake Poznanski
4f2f4fda7d Quicker results by limited workers via semaphore while still utilizing gpu 2024-11-12 08:18:22 -08:00
Jake Poznanski
615409568d Logging and perf stuff 2024-11-11 15:35:18 -08:00
Jake Poznanski
ade3580eaf FIxes 2024-11-11 14:38:26 -08:00
Jake Poznanski
732300ab4d Some errors dealt with 2024-11-11 14:26:15 -08:00
Jake Poznanski
24a9d23b00 Trying to get reliablity up 2024-11-11 13:54:04 -08:00
Jake Poznanski
fedda40466 Small fixes 2024-11-11 13:31:14 -08:00
Jake Poznanski
a9a94f2950 Code to get stats 2024-11-11 13:09:09 -08:00
Jake Poznanski
6b625b2a7f Bugfixes 2024-11-11 11:58:45 -08:00
Jake Poznanski
9fb464c654 Refactoring to assemble docs 2024-11-11 11:46:49 -08:00
Jake Poznanski
da1b23fc47 Minor fixes 2024-11-11 10:24:47 -08:00
Jake Poznanski
9ff107b7b5 Merge branch 'main' of https://github.com/allenai/pdelfin into main 2024-11-08 15:25:18 -08:00
Jake Poznanski
60563d6e9a Merge branch 'main' of https://github.com/allenai/pdelfin 2024-11-08 23:15:04 +00:00
Jake Poznanski
71252a87ec Debug statements for pipeline 2024-11-08 23:14:44 +00:00
Jake Poznanski
299819e313 Reqs 2024-11-08 15:02:40 -08:00
Jake Poznanski
9d5193538c some cleanups 2024-11-08 11:38:56 -08:00
Jake Poznanski
65901644e1 Starting to work 2024-11-08 11:04:58 -08:00
Jake Poznanski
82ec24910f Progress 2024-11-08 10:36:09 -08:00
Jake Poznanski
37dc412335 Working on script 2024-11-08 10:19:00 -08:00
Jake Poznanski
e5fb7c0020 Organization 2024-11-08 09:59:27 -08:00
Jake Poznanski
ee72b3601e Starting up server and workers async now 2024-11-08 09:14:00 -08:00
Jake Poznanski
a39350e074 Reworking to be async 2024-11-08 08:14:20 -08:00
Jake Poznanski
a103ce730f Some small things 2024-11-07 23:24:01 +00:00