-
Notifications
You must be signed in to change notification settings - Fork 176
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Enable accuracy test for PR labeled with "*accuracy-test"
accuracy-test
enable all accuracy test for PR
ready-for-test
start test by label for PR
[Performance] [flash_communication_v1] DeepSeek communication optimization on A2 (reduce_scatter + all_gather)
#1034
opened May 30, 2025 by
underfituu
•
Draft
[Misc] Refactor additional_config
documentation
Improvements or additions to documentation
module:core
module:tests
#1029
opened May 30, 2025 by
wangxiyuan
Loading…
[Patch] Remove enable long term test for PR
ready-for-test
start test by label for PR
spec_decode.metrics
patch
long-term-test
#1016
opened May 29, 2025 by
shen-shanshan
Loading…
[ModelRunner]Add profile execute duration observation
documentation
Improvements or additions to documentation
module:core
#1013
opened May 29, 2025 by
depeng1994
Loading…
feat: support data parallel for deepseek
module:core
module:ops
module:quantization
module:tests
#1012
opened May 29, 2025 by
NeverRaR
Loading…
[Draft] support mooncake barebone connectorV1
module:core
module:ops
#1011
opened May 29, 2025 by
DreamerLeader
•
Draft
[ModelRunner][MultiModal] Automatically cast multi-modal input dtype
#1002
opened May 29, 2025 by
shen-shanshan
Loading…
Disable torchair view optimization | Support multistream of shared experts in FusedMoE
module:core
module:ops
#997
opened May 29, 2025 by
sdmyzlp
Loading…
[Bugfix][Worker] Clear NPU memory between test profiling
module:core
#989
opened May 28, 2025 by
shen-shanshan
Loading…
[Core][Kernel] add fix routing for performance test
module:core
module:ops
#987
opened May 28, 2025 by
hahazhky
Loading…
[BugFix] fix ep=1 etp=16
module:ops
module:quantization
#985
opened May 28, 2025 by
ttanzhiqiang
Loading…
[CI][Doctest] Add pip installation test
module:core
module:tests
#983
opened May 28, 2025 by
Potabk
Loading…
[perf] Improve Prefill Performance by Optimizing Alltoall Communication
module:core
module:ops
module:quantization
#978
opened May 27, 2025 by
SlightwindSec
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.