Jake Poznanski
|
e12941a608
|
Version bump
|
2025-10-06 21:46:10 +00:00 |
|
Jake Poznanski
|
7fe756fe63
|
Formatting
|
2025-10-06 21:10:32 +00:00 |
|
Jake Poznanski
|
9c7c670f1f
|
Bump version to v0.3.7 for release
v0.3.7
|
2025-10-06 21:10:07 +00:00 |
|
Jake Poznanski
|
1951a849ec
|
Version bump with new vllm
|
2025-10-06 21:10:00 +00:00 |
|
Jake Poznanski
|
c75f5b98a1
|
Cleaning up pr 341 arguments to match with vllm 0.11, which only has V1 engine and thus always does chunked prefill. And fixes arg syntax
|
2025-10-06 20:26:41 +00:00 |
|
Jake Poznanski
|
e202c22822
|
Merge branch 'vllm_0_11'
|
2025-10-06 20:24:26 +00:00 |
|
Jake Poznanski
|
2b70b50312
|
Merge pull request #341 from charitarthchugh/charitarthchugh/vllm-defaults-speedup
Add chunked prefill and limit mm per prompt options
|
2025-10-06 13:23:47 -07:00 |
|
Jake Poznanski
|
81be6f5c1f
|
Transformers version
|
2025-10-06 19:52:55 +00:00 |
|
Jake Poznanski
|
9b517a02be
|
Git lfs in docker image
|
2025-10-06 19:47:19 +00:00 |
|
Jake Poznanski
|
9feb41af82
|
New docker file approach for vllm 0.11
|
2025-10-06 18:57:16 +00:00 |
|
Jake Poznanski
|
f4356de091
|
deepinfra readme improved
|
2025-09-29 17:56:03 +00:00 |
|
Jake Poznanski
|
8982bae756
|
Bump version to v0.3.6 for release
v0.3.6
|
2025-09-29 17:37:25 +00:00 |
|
Jake Poznanski
|
fb1ef9e38a
|
Release script fix
|
2025-09-29 17:37:14 +00:00 |
|
Jake Poznanski
|
c587eb9050
|
Ugh, release script adds all files by default
|
2025-09-29 17:36:41 +00:00 |
|
Jake Poznanski
|
a0bc5a4690
|
Deepinfra readme
|
2025-09-29 17:29:28 +00:00 |
|
Jake Poznanski
|
0c6d889863
|
Adding retry code on 429 errors from exteranl providers
|
2025-09-29 17:26:22 +00:00 |
|
Jake Poznanski
|
9c750903ce
|
Ignore files
|
2025-09-29 17:06:14 +00:00 |
|
Jake Poznanski
|
f0caa188ab
|
Merge pull request #344 from allenai/amanr/deepinfra
DeepInfra Support
|
2025-09-29 10:03:29 -07:00 |
|
aman-17
|
f3c4073395
|
added Api_key argument to pipeline pytests
|
2025-09-26 14:25:25 -07:00 |
|
aman-17
|
359abef654
|
updated pytests
|
2025-09-26 14:19:22 -07:00 |
|
aman-17
|
556ff26d58
|
fixed lint, style, ruff
|
2025-09-26 14:08:40 -07:00 |
|
aman-17
|
e7ae5e6240
|
fixed style
|
2025-09-26 13:58:34 -07:00 |
|
aman-17
|
90589e16de
|
Added deepinfra usage to readme
|
2025-09-26 13:56:34 -07:00 |
|
aman-17
|
2a5792e5ed
|
add if else for vllm local usage bug for api argument
|
2025-09-26 13:29:48 -07:00 |
|
aman-17
|
7fe3f65de7
|
added support for deepinfra
|
2025-09-26 11:06:51 -07:00 |
|
Charitarth Chugh
|
fe425fde20
|
Add chunked prefill and limit mm per prompt options
|
2025-09-25 14:29:49 -04:00 |
|
Jake Poznanski
|
8f88a98e5d
|
prepare checkpoint script fixes
|
2025-09-04 22:15:55 +00:00 |
|
Jake Poznanski
|
c720c02d83
|
Cleaning up repo a bit
|
2025-09-02 06:45:24 +00:00 |
|
Jake Poznanski
|
56b08d5aa4
|
Bump version to v0.3.4 for release
v0.3.4
|
2025-08-31 03:12:39 +00:00 |
|
Jake Poznanski
|
f3cdc78b4f
|
Pushing new version
|
2025-08-31 03:12:30 +00:00 |
|
Jake Poznanski
|
edd098093b
|
Reverting version changes that broke, vllm 0.10.1 is not good
|
2025-08-27 18:55:26 +00:00 |
|
Jake Poznanski
|
27792664bf
|
Transformers version bump needed also
|
2025-08-27 16:35:51 +00:00 |
|
Jake Poznanski
|
03c7479a17
|
VLLM version bump
|
2025-08-27 16:33:37 +00:00 |
|
Jake Poznanski
|
3eec58012c
|
Docker ignore
|
2025-08-26 17:52:50 +00:00 |
|
Jake Poznanski
|
6be12c2e06
|
Baseline tests for blanks
|
2025-08-25 22:01:24 +00:00 |
|
Jake Poznanski
|
ad33672781
|
fix
|
2025-08-25 21:04:53 +00:00 |
|
Jake Poznanski
|
c7aa217281
|
Scripts to run benchmarks better
|
2025-08-25 20:12:10 +00:00 |
|
Jake Poznanski
|
59321af018
|
Merge pull request #319 from haydn-jones/main
External vLLM instance
|
2025-08-25 08:43:25 -07:00 |
|
Haydn Jones
|
2c63836648
|
Black and mock
|
2025-08-23 20:07:05 -04:00 |
|
Haydn Jones
|
261c722f56
|
Update README + arg name
|
2025-08-21 17:49:07 -04:00 |
|
Haydn Jones
|
b34c3611e1
|
oopsy woopsy
|
2025-08-20 19:22:48 -04:00 |
|
Haydn Jones
|
b8a2b92174
|
External vLLM
|
2025-08-20 19:21:38 -04:00 |
|
Jake Poznanski
|
4dbf951f45
|
Merge pull request #313 from tongliang11/patch-1
Fix typo in README.md
|
2025-08-19 10:40:40 -07:00 |
|
Tong Liang
|
cb4f23dc0c
|
Fix typo in README.md
|
2025-08-16 21:48:07 -04:00 |
|
Jake Poznanski
|
c492615355
|
Bump version to v0.3.3 for release
v0.3.3
|
2025-08-15 19:45:17 +00:00 |
|
Jake Poznanski
|
cee12ccc9f
|
New version
|
2025-08-15 19:45:07 +00:00 |
|
Jake Poznanski
|
76405b53db
|
Lints
|
2025-08-15 19:44:47 +00:00 |
|
Jake Poznanski
|
69c33abfcc
|
Trying to keep queue loaded more
|
2025-08-15 18:44:45 +00:00 |
|
Jake Poznanski
|
7c98673972
|
Pipeline fixes for OMP_NUM_THREADS
|
2025-08-15 18:30:00 +00:00 |
|
Jake Poznanski
|
b9238b8638
|
Fix for floaty amount
|
2025-08-14 22:27:26 +00:00 |
|