1524 Commits

Author SHA1 Message Date
Jake Poznanski
e12941a608 Version bump 2025-10-06 21:46:10 +00:00
Jake Poznanski
7fe756fe63 Formatting 2025-10-06 21:10:32 +00:00
Jake Poznanski
9c7c670f1f Bump version to v0.3.7 for release v0.3.7 2025-10-06 21:10:07 +00:00
Jake Poznanski
1951a849ec Version bump with new vllm 2025-10-06 21:10:00 +00:00
Jake Poznanski
c75f5b98a1 Cleaning up pr 341 arguments to match with vllm 0.11, which only has V1 engine and thus always does chunked prefill. And fixes arg syntax 2025-10-06 20:26:41 +00:00
Jake Poznanski
e202c22822 Merge branch 'vllm_0_11' 2025-10-06 20:24:26 +00:00
Jake Poznanski
2b70b50312
Merge pull request #341 from charitarthchugh/charitarthchugh/vllm-defaults-speedup
Add chunked prefill and limit mm per prompt options
2025-10-06 13:23:47 -07:00
Jake Poznanski
81be6f5c1f Transformers version 2025-10-06 19:52:55 +00:00
Jake Poznanski
9b517a02be Git lfs in docker image 2025-10-06 19:47:19 +00:00
Jake Poznanski
9feb41af82 New docker file approach for vllm 0.11 2025-10-06 18:57:16 +00:00
Jake Poznanski
f4356de091 deepinfra readme improved 2025-09-29 17:56:03 +00:00
Jake Poznanski
8982bae756 Bump version to v0.3.6 for release v0.3.6 2025-09-29 17:37:25 +00:00
Jake Poznanski
fb1ef9e38a Release script fix 2025-09-29 17:37:14 +00:00
Jake Poznanski
c587eb9050 Ugh, release script adds all files by default 2025-09-29 17:36:41 +00:00
Jake Poznanski
a0bc5a4690 Deepinfra readme 2025-09-29 17:29:28 +00:00
Jake Poznanski
0c6d889863 Adding retry code on 429 errors from exteranl providers 2025-09-29 17:26:22 +00:00
Jake Poznanski
9c750903ce Ignore files 2025-09-29 17:06:14 +00:00
Jake Poznanski
f0caa188ab
Merge pull request #344 from allenai/amanr/deepinfra
DeepInfra Support
2025-09-29 10:03:29 -07:00
aman-17
f3c4073395 added Api_key argument to pipeline pytests 2025-09-26 14:25:25 -07:00
aman-17
359abef654 updated pytests 2025-09-26 14:19:22 -07:00
aman-17
556ff26d58 fixed lint, style, ruff 2025-09-26 14:08:40 -07:00
aman-17
e7ae5e6240 fixed style 2025-09-26 13:58:34 -07:00
aman-17
90589e16de Added deepinfra usage to readme 2025-09-26 13:56:34 -07:00
aman-17
2a5792e5ed add if else for vllm local usage bug for api argument 2025-09-26 13:29:48 -07:00
aman-17
7fe3f65de7 added support for deepinfra 2025-09-26 11:06:51 -07:00
Charitarth Chugh
fe425fde20
Add chunked prefill and limit mm per prompt options 2025-09-25 14:29:49 -04:00
Jake Poznanski
8f88a98e5d prepare checkpoint script fixes 2025-09-04 22:15:55 +00:00
Jake Poznanski
c720c02d83 Cleaning up repo a bit 2025-09-02 06:45:24 +00:00
Jake Poznanski
56b08d5aa4 Bump version to v0.3.4 for release v0.3.4 2025-08-31 03:12:39 +00:00
Jake Poznanski
f3cdc78b4f Pushing new version 2025-08-31 03:12:30 +00:00
Jake Poznanski
edd098093b Reverting version changes that broke, vllm 0.10.1 is not good 2025-08-27 18:55:26 +00:00
Jake Poznanski
27792664bf Transformers version bump needed also 2025-08-27 16:35:51 +00:00
Jake Poznanski
03c7479a17 VLLM version bump 2025-08-27 16:33:37 +00:00
Jake Poznanski
3eec58012c Docker ignore 2025-08-26 17:52:50 +00:00
Jake Poznanski
6be12c2e06 Baseline tests for blanks 2025-08-25 22:01:24 +00:00
Jake Poznanski
ad33672781 fix 2025-08-25 21:04:53 +00:00
Jake Poznanski
c7aa217281 Scripts to run benchmarks better 2025-08-25 20:12:10 +00:00
Jake Poznanski
59321af018
Merge pull request #319 from haydn-jones/main
External vLLM instance
2025-08-25 08:43:25 -07:00
Haydn Jones
2c63836648 Black and mock 2025-08-23 20:07:05 -04:00
Haydn Jones
261c722f56 Update README + arg name 2025-08-21 17:49:07 -04:00
Haydn Jones
b34c3611e1 oopsy woopsy 2025-08-20 19:22:48 -04:00
Haydn Jones
b8a2b92174 External vLLM 2025-08-20 19:21:38 -04:00
Jake Poznanski
4dbf951f45
Merge pull request #313 from tongliang11/patch-1
Fix typo in README.md
2025-08-19 10:40:40 -07:00
Tong Liang
cb4f23dc0c
Fix typo in README.md 2025-08-16 21:48:07 -04:00
Jake Poznanski
c492615355 Bump version to v0.3.3 for release v0.3.3 2025-08-15 19:45:17 +00:00
Jake Poznanski
cee12ccc9f New version 2025-08-15 19:45:07 +00:00
Jake Poznanski
76405b53db Lints 2025-08-15 19:44:47 +00:00
Jake Poznanski
69c33abfcc Trying to keep queue loaded more 2025-08-15 18:44:45 +00:00
Jake Poznanski
7c98673972 Pipeline fixes for OMP_NUM_THREADS 2025-08-15 18:30:00 +00:00
Jake Poznanski
b9238b8638 Fix for floaty amount 2025-08-14 22:27:26 +00:00