-
-
Notifications
You must be signed in to change notification settings - Fork 12.9k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Bugfix: Prevent reasoning_content leak
bug
Something isn't working
frontend
tool-calling
#32997
opened Jan 24, 2026 by
RohanDisa
Loading…
Feature/silu block quant fusion v1
ci/build
performance
Performance-related issues
#32996
opened Jan 24, 2026 by
Monishver11
•
Draft
[docs] Update governance process links
documentation
Improvements or additions to documentation
#32995
opened Jan 24, 2026 by
esmeetu
Loading…
5 tasks
[Feature] Support CPU Offloading without Pytorch Pinned Memory that leads to doubled allocation
nvidia
#32993
opened Jan 24, 2026 by
wzhao18
Loading…
5 tasks
[BugFix] Fix CPU Offloading Bug with UVA
bug
Something isn't working
nvidia
#32991
opened Jan 24, 2026 by
wzhao18
Loading…
5 tasks
Indicate compile mode in the benchmark results
performance
Performance-related issues
#32990
opened Jan 24, 2026 by
huydhn
Loading…
[Misc]Consolidate RoPE-related parsing into ModelArchitectureConfig
#32989
opened Jan 24, 2026 by
charlotte12l
•
Draft
5 tasks
[Docs] Update README with uv recommendation and Python version requirements
documentation
Improvements or additions to documentation
#32987
opened Jan 23, 2026 by
sjhddh
Loading…
[Tests] Replace flaky sleep with polling in test_background_cancel
v1
#32986
opened Jan 23, 2026 by
sjhddh
Loading…
[Fix] Include list index in multimodal validation error messages
frontend
#32985
opened Jan 23, 2026 by
sjhddh
Loading…
[Perf] Cache exc.errors() result in validation exception handler
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
#32984
opened Jan 23, 2026 by
sjhddh
Loading…
[CI] Add pip and pre-commit caching to pre-commit workflow
ci/build
#32980
opened Jan 23, 2026 by
sjhddh
Loading…
[CI] Add pip caching to cleanup_pr_body workflow
ci/build
#32979
opened Jan 23, 2026 by
sjhddh
Loading…
[Docs] Sync quantization list between README files
documentation
Improvements or additions to documentation
#32978
opened Jan 23, 2026 by
sjhddh
Loading…
[Docs] Fix Apple silicon include path in CPU installation docs
cpu
Related to CPU backends
documentation
Improvements or additions to documentation
#32977
opened Jan 23, 2026 by
sjhddh
Loading…
[Perf] Optimize detokenizer python logic
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#32975
opened Jan 23, 2026 by
yewentao256
Loading…
[Attention][WIP] FA4 integration
ci/build
nvidia
v1
#32974
opened Jan 23, 2026 by
LucasWilkinson
Loading…
[Perf] Overlap workspace_buffer.fill_(0) with compute in MLA attention
#32973
opened Jan 23, 2026 by
robertgshaw2-redhat
Loading…
5 tasks
[Bugfix][VLM] Fix transformers backend embed_multimodal for Qwen2.5-VL profiling
bug
Something isn't working
qwen
Related to Qwen models
#32969
opened Jan 23, 2026 by
AndreasKaratzas
Loading…
[Misc] Add run one batch script that supports profiling
documentation
Improvements or additions to documentation
#32968
opened Jan 23, 2026 by
LucasWilkinson
Loading…
[Frontend] Use init_app_state and FrontendArgs from api_server in run_batch
frontend
#32967
opened Jan 23, 2026 by
pooyadavoodi
Loading…
5 tasks
Add TUI Monitor: Real-time Terminal Dashboard for vLLM Metrics
documentation
Improvements or additions to documentation
#32966
opened Jan 23, 2026 by
sjhddh
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.