Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Bugfix: Prevent reasoning_content leak bug Something isn't working frontend tool-calling
#32997 opened Jan 24, 2026 by RohanDisa Loading…
Feature/silu block quant fusion v1 ci/build performance Performance-related issues
#32996 opened Jan 24, 2026 by Monishver11 Draft
[docs] Update governance process links documentation Improvements or additions to documentation
#32995 opened Jan 24, 2026 by esmeetu Loading…
5 tasks
[Bugfix][Core] Fix use audio in video bug bug Something isn't working qwen Related to Qwen models v1
#32994 opened Jan 24, 2026 by xsank Loading…
[BugFix] Fix CPU Offloading Bug with UVA bug Something isn't working nvidia
#32991 opened Jan 24, 2026 by wzhao18 Loading…
5 tasks
Indicate compile mode in the benchmark results performance Performance-related issues
#32990 opened Jan 24, 2026 by huydhn Loading…
[Docs] Update README with uv recommendation and Python version requirements documentation Improvements or additions to documentation
#32987 opened Jan 23, 2026 by sjhddh Loading…
[Perf] Cache exc.errors() result in validation exception handler frontend ready ONLY add when PR is ready to merge/full CI is needed
#32984 opened Jan 23, 2026 by sjhddh Loading…
[Docs] Sync quantization list between README files documentation Improvements or additions to documentation
#32978 opened Jan 23, 2026 by sjhddh Loading…
[Docs] Fix Apple silicon include path in CPU installation docs cpu Related to CPU backends documentation Improvements or additions to documentation
#32977 opened Jan 23, 2026 by sjhddh Loading…
[AMD][Kernel][BugFix] Use correct scale in concat_and_cache_ds_mla_kernel when on gfx942 bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#32976 opened Jan 23, 2026 by rasmith Loading…
2 tasks
[Perf] Optimize detokenizer python logic ready ONLY add when PR is ready to merge/full CI is needed v1
#32975 opened Jan 23, 2026 by yewentao256 Loading…
[Bugfix][VLM] Fix transformers backend embed_multimodal for Qwen2.5-VL profiling bug Something isn't working qwen Related to Qwen models
#32969 opened Jan 23, 2026 by AndreasKaratzas Loading…
[Misc] Add run one batch script that supports profiling documentation Improvements or additions to documentation
#32968 opened Jan 23, 2026 by LucasWilkinson Loading…
Add TUI Monitor: Real-time Terminal Dashboard for vLLM Metrics documentation Improvements or additions to documentation
#32966 opened Jan 23, 2026 by sjhddh Loading…
[Kernel] [Helion] Helion kernel wrapper
#32964 opened Jan 23, 2026 by gmagogsfm Loading…
ProTip! Adding no:label will show everything without a label.