-
-
Notifications
You must be signed in to change notification settings - Fork 7.6k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Misc] improve web section group title display
documentation
Improvements or additions to documentation
#18684
opened May 25, 2025 by
reidliu41
Loading…
[CI/Build][Doc] Update Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
gte-Qwen2-1.5B-instruct
usage
documentation
#18683
opened May 25, 2025 by
DarkLight1337
Loading…
[CI/Build][Bugfix] Ensure compatibility with transformers 4.52
ci/build
multi-modality
Related to multi-modality (#4194)
[Doc] Move examples and further reorganize user guide
ci/build
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#18666
opened May 24, 2025 by
DarkLight1337
Loading…
[Doc] Convert Sphinx directives ( Related to multi-modality (#4194)
v1
{class}
, {meth}
, {attr}
, ...) to MkDocs format for better documentation linking
frontend
multi-modality
#18663
opened May 24, 2025 by
Zerohertz
Loading…
30 tasks done
[v1] Re-init input batch for multiple kv cache groups
tpu
Related to Google TPUs
v1
#18654
opened May 24, 2025 by
heheda12345
Loading…
[v1][KVCacheManager] Add a special KVCacheNullBlock class
v1
#18652
opened May 24, 2025 by
heheda12345
Loading…
[V1][Quantization] Add CUDA graph compatible v1 GGUF support
#18646
opened May 24, 2025 by
Isotr0py
Loading…
2 tasks done
[Misc] Fixed the abnormally high TTFT issue in the PD disaggregation example
documentation
Improvements or additions to documentation
#18644
opened May 24, 2025 by
zhaohaidao
Loading…
[Bugfix][Nixl] Fix full prefix cache hit bug
v1
#18632
opened May 23, 2025 by
robertgshaw2-redhat
Loading…
[Bugfix][Failing Test] Fix test_vllm_port.py
ci/build
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
#18618
opened May 23, 2025 by
rabi
Loading…
Fix links in multi-modal model contributing page
documentation
Improvements or additions to documentation
#18615
opened May 23, 2025 by
hmellor
Loading…
[V1][Sampler] Improve performance of FlashInfer sampling by sampling logits instead of probs
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#18608
opened May 23, 2025 by
lgeiger
Loading…
[Bugfix][ROCm] Fix ROCm FP8 Quantization Padding Issue
#18606
opened May 23, 2025 by
vllmellm
Loading…
[Hardware][AMD] integrate aiter into vll
needs-rebase
v1
#18596
opened May 23, 2025 by
Zzz9990
Loading…
FIX: NixlConnector: do not skip short do_remote_prefill requests
v1
#18590
opened May 23, 2025 by
juncgu
Loading…
[CUDA] Enable full cudagraph for FlashMLA
needs-rebase
v1
#18581
opened May 23, 2025 by
ProExpertProg
Loading…
Support datasets in
vllm bench serve
and sync with benchmark_[serving,datasets].py
#18566
opened May 22, 2025 by
mgoin
Loading…
feat(rocm-support): support mamba2 on rocm
ci/build
#18565
opened May 22, 2025 by
almersawi
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.