98 Commits

Author SHA1 Message Date
Jake Poznanski
af8ce518ac Merge branch 'main' of https://github.com/allenai/pdelfin into main 2024-11-21 08:45:19 -08:00
Jake Poznanski
9112d81bd1 No keep alive connection to try to resolve sglang livelock 2024-11-21 08:45:17 -08:00
Jake Poznanski
2443c22fde Projected output tokens 2024-11-20 23:57:10 +00:00
Jake Poznanski
67d11ec0e6 TODOs and client fix 2024-11-20 14:45:12 -08:00
Jake Poznanski
9b8d58b59e Better stats and metadata 2024-11-20 10:42:26 -08:00
Jake Poznanski
273a8b0d0a Logging fallback pages 2024-11-19 15:11:02 -08:00
Jake Poznanski
b0acfa870e Adding support for fallback pages 2024-11-19 14:59:20 -08:00
Jake Poznanski
204a4a8e5b Better stats 2024-11-19 13:41:32 -08:00
Jake Poznanski
3ef4609bdd Fixing args 2024-11-19 11:48:45 -08:00
Jake Poznanski
27d23525b7 Claude recommends httpx instead of aiohttp, seeing if that will help with straggler timeouts 2024-11-19 10:41:58 -08:00
Jake Poznanski
4469f4b2ce Version patch 2024-11-18 19:55:26 -08:00
Jake Poznanski
9e2e09bd06 More fixes 2024-11-18 15:04:50 -08:00
Jake Poznanski
8793fc7d99 Adding more retries, and it was able to process more complicated books 2024-11-18 14:25:32 -08:00
Jake Poznanski
2f55a3ddb7 fix 2024-11-18 13:58:25 -08:00
Jake Poznanski
d4d47369cb more gcs 2024-11-18 13:20:28 -08:00
Jake Poznanski
e48d4bef00 Fix 2024-11-18 13:16:19 -08:00
Jake Poznanski
8c3b5753c9 Gcs support better 2024-11-18 13:07:27 -08:00
Jake Poznanski
9381bf862a docs 2024-11-18 12:44:34 -08:00
Jake Poznanski
f287f2451c Fixing a few stats things 2024-11-18 11:50:22 -08:00
Jake Poznanski
e499413089 Better work queue 2024-11-18 11:04:51 -08:00
Jake Poznanski
995b1d15fc Fixes, mocking out queue into separate file 2024-11-18 09:55:45 -08:00
Jake Poznanski
fcabb8e55a Handling more error cases 2024-11-18 09:12:04 -08:00
Jake Poznanski
96984fcd77 Fix a reliability issue 2024-11-18 09:03:24 -08:00
Jake Poznanski
0af29f1f44 Adding page rotation 2024-11-18 08:29:32 -08:00
Jake Poznanski
e2303f28af Running on l40s, fixing queue 2024-11-18 08:25:36 -08:00
Jake Poznanski
68543d40ce Adding stats 2024-11-18 07:57:39 -08:00
Jake Poznanski
b4ca5636bc Decent set of todos for monday 2024-11-18 04:54:12 +00:00
Jake Poznanski
2f1664f3d7 Stop everything on a Nan 2024-11-16 08:16:11 -08:00
Jake Poznanski
eefb045859 Single cluster fix 2024-11-15 13:30:27 -08:00
Jake Poznanski
2e1d0b6a90 Fix 2024-11-15 13:21:55 -08:00
Jake Poznanski
748b095e0f Fix 2024-11-15 13:19:23 -08:00
Jake Poznanski
80ba562bb2 Fixing timeout situation 2024-11-15 13:18:13 -08:00
Jake Poznanski
2c52664301 Cleaner exit 2024-11-15 12:54:45 -08:00
Jake Poznanski
77c82fdb3a New version with aiohttp fixes 2024-11-15 12:48:36 -08:00
Jake Poznanski
ae1e4bc07e More realistic results 2024-11-15 11:35:10 -08:00
Jake Poznanski
770da2b7ae Docker 2024-11-15 11:25:24 -08:00
Jake Poznanski
bfe4211dcc Debugging timeout errors and other things 2024-11-15 11:23:38 -08:00
Jake Poznanski
278422b8ff Fixing one max context issue 2024-11-15 10:03:26 -08:00
Jake Poznanski
9a1e82f9d9 Logging 2024-11-14 14:47:19 -08:00
Jake Poznanski
fe0574c725 Cleanup code, s3 retries 2024-11-14 14:13:04 -08:00
Jake Poznanski
2c7686f8ff I think I have error handling better now 2024-11-14 13:44:54 -08:00
Jake Poznanski
8217e49153 Page calc 2024-11-14 13:38:58 -08:00
Jake Poznanski
4eab90f69b Fixing bugs 2024-11-14 13:13:27 -08:00
Jake Poznanski
b67d8e7555 Fixing work queue population 2024-11-14 12:48:46 -08:00
Jake Poznanski
827b77e8df Working on task groups 2024-11-14 12:06:13 -08:00
Jake Poznanski
a58efea133 better logging 2024-11-14 09:55:37 -08:00
Jake Poznanski
a9cf2e0272 Allow setting beaker priority 2024-11-14 09:10:28 -08:00
Jake Poznanski
41c8d552a0 exponential backoff 2024-11-14 09:02:49 -08:00
Jake Poznanski
4dcf9ed5d4 more fixes 2024-11-14 08:55:20 -08:00
Jake Poznanski
06331d740a Fix timeout 2024-11-14 08:50:33 -08:00