[CI/UT] Fix disaggregated prefill ci (#1313)

MengqingCao · web-flow · commit 20767a043ccc · 2025-06-24T17:11:00.000+08:00
### What this PR does / why we need it?
Use eager mode to run disaggregated prefill ci

### Does this PR introduce _any_ user-facing change?
N/A

### How was this patch tested?
CI passed with new existing test.

---------

Signed-off-by: MengqingCao &lt;cmq0113@163.com&gt;
diff --git a/.github/workflows/vllm_ascend_test_pd.yaml b/.github/workflows/vllm_ascend_test_pd.yaml
@@ -41,7 +41,11 @@ jobs:
     if: ${{ contains(github.event.pull_request.labels.*.name, 'pd-test') && contains(github.event.pull_request.labels.*.name, 'ready-for-test') || github.event_name == 'schedule' }}
     strategy:
       matrix:
-        vllm_verison: [main, v0.9.1]
+        vllm_verison: [
+            # revert me when V1 disaggregation prefill is merged in main
+            # main, 
+            v0.9.1
+          ]
     name: vLLM Ascend prefilling decoding disaggregation test
     runs-on: linux-arm64-npu-static-8
 
diff --git a/tests/e2e/pd_disaggreate/setup_pd.sh b/tests/e2e/pd_disaggreate/setup_pd.sh
@@ -66,6 +66,7 @@ function run_prefill_instance() {
   --served-model-name Deepseek \
   --max-model-len 2000 \
   --trust-remote-code \
+  --enforce-eager \
   --kv-transfer-config "$KV_CONFIG"
 }
 
@@ -119,6 +120,7 @@ function run_decode_instance() {
     --max-num-batched-tokens 2000 \
     --trust-remote-code \
     --gpu-memory-utilization 0.9 \
+    --enforce-eager \
     --kv-transfer-config "$KV_CONFIG"
 }
 

Original file line number	Diff line number	Diff line change
`@@ -66,6 +66,7 @@ function run_prefill_instance() {`
`66`	`66`	`--served-model-name Deepseek \`
`67`	`67`	`--max-model-len 2000 \`
`68`	`68`	`--trust-remote-code \`
	`69`	`+ --enforce-eager \`
`69`	`70`	`--kv-transfer-config "$KV_CONFIG"`
`70`	`71`	`}`
`71`	`72`
`@@ -119,6 +120,7 @@ function run_decode_instance() {`
`119`	`120`	`--max-num-batched-tokens 2000 \`
`120`	`121`	`--trust-remote-code \`
`121`	`122`	`--gpu-memory-utilization 0.9 \`
	`123`	`+ --enforce-eager \`
`122`	`124`	`--kv-transfer-config "$KV_CONFIG"`
`123`	`125`	`}`
`124`	`126`