[CI][Benchmark] Optimize performance benchmark workflow #1039

Potabk · 2025-05-31T09:43:51Z

What this PR does / why we need it?

This is a post patch of #1014, for some convenience optimization

Set cached dataset path for speed
Use pypi to install escli-tool
Add benchmark results convert script to have a developer-friendly result
Patch the benchmark_dataset.py to disable streaming load for internet
Add more trigger ways for different purpose, pr for debug, schedule for daily test, dispatch and pr-labled for manual testing of a single(current) commit
Disable latency test for qwen-2.5-vl, (This script does not support multi-modal yet)

Does this PR introduce any user-facing change?

No

How was this patch tested?

CI passed

Signed-off-by: wangli <[email protected]>

.github/workflows/nightly_benchmarks.yaml

benchmarks/scripts/convert_json_to_markdown.py

vllm_ascend/worker/model_runner_v1.py

demo.py

benchmarks/scripts/patch_benchmark_dataset.py

benchmarks/tests/latency-tests.json

Signed-off-by: wangli <[email protected]>

Yikun · 2025-06-03T15:25:09Z

.github/workflows/nightly_benchmarks.yaml

@@ -64,6 +64,7 @@ jobs:
      env:
        HF_ENDPOINT: https://hf-mirror.com
        HF_TOKEN: ${{ secrets.HF_TOKEN }}
+        HF_HOME: /github/home/.cache/huggingface


Why we need specified explictly?

.github/workflows/nightly_benchmarks.yaml

Signed-off-by: Yikun Jiang <[email protected]>

…#1039) ### What this PR does / why we need it? This is a post patch of vllm-project#1014, for some convenience optimization - Set cached dataset path for speed - Use pypi to install escli-tool - Add benchmark results convert script to have a developer-friendly result - Patch the `benchmark_dataset.py` to disable streaming load for internet - Add more trigger ways for different purpose, `pr` for debug, `schedule` for daily test, `dispatch` and `pr-labled` for manual testing of a single(current) commit - Disable latency test for `qwen-2.5-vl`, (This script does not support multi-modal yet) ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: wangli <[email protected]> Signed-off-by: wangxiaoxin (A) <[email protected]>

…#1039) This is a post patch of vllm-project#1014, for some convenience optimization - Set cached dataset path for speed - Use pypi to install escli-tool - Add benchmark results convert script to have a developer-friendly result - Patch the `benchmark_dataset.py` to disable streaming load for internet - Add more trigger ways for different purpose, `pr` for debug, `schedule` for daily test, `dispatch` and `pr-labled` for manual testing of a single(current) commit - Disable latency test for `qwen-2.5-vl`, (This script does not support multi-modal yet) No CI passed --------- Signed-off-by: wangli <[email protected]> Signed-off-by: wangxiaoxin (A) <[email protected]>

Potabk changed the title ~~[CI][Benchmark] Set cached dataset path for speed~~ [CI][Benchmark][WIP] Set cached dataset path for speed May 31, 2025

Potabk force-pushed the dev-bench branch 2 times, most recently from 78e396f to 2e124d4 Compare June 1, 2025 15:46

github-actions bot added the documentation Improvements or additions to documentation label Jun 1, 2025

Potabk added 25 commits June 3, 2025 15:59

cache dataset for speed

474d08f

Signed-off-by: wangli <[email protected]>

pr trigger for test

aa92f4e

Signed-off-by: wangli <[email protected]>

fix dataset path

b740cce

Signed-off-by: wangli <[email protected]>

rename job

dee9f54

Signed-off-by: wangli <[email protected]>

add hf_home

47ee5ad

Signed-off-by: wangli <[email protected]>

fix

00eda3c

Signed-off-by: wangli <[email protected]>

fix curl

579ef0c

Signed-off-by: wangli <[email protected]>

fake testing

bdf87e5

Signed-off-by: wangli <[email protected]>

test

94cc7fa

Signed-off-by: wangli <[email protected]>

add benchmark patch

35e93e3

Signed-off-by: wangli <[email protected]>

add patch

546d383

Signed-off-by: wangli <[email protected]>

fix path

5979e30

Signed-off-by: wangli <[email protected]>

fix format

19de361

Signed-off-by: wangli <[email protected]>

fix

53bb071

Signed-off-by: wangli <[email protected]>

fix isort

edf44ac

Signed-off-by: wangli <[email protected]>

fix bug

df3b385

Signed-off-by: wangli <[email protected]>

testing

efc0c1b

Signed-off-by: wangli <[email protected]>

test

c143119

Signed-off-by: wangli <[email protected]>

test

7890585

Signed-off-by: wangli <[email protected]>

testing

acc5c3f

Signed-off-by: wangli <[email protected]>

fix dataset path

23bb2e5

Signed-off-by: wangli <[email protected]>

fix convert name

4751457

Signed-off-by: wangli <[email protected]>

add convert script

3f77fa4

Signed-off-by: wangli <[email protected]>

add step summary

d753de0

Signed-off-by: wangli <[email protected]>

use pypi install escli

8b857b9

Signed-off-by: wangli <[email protected]>

Potabk force-pushed the dev-bench branch from 09476fb to 8b857b9 Compare June 3, 2025 07:59

Potabk changed the title ~~[CI][Benchmark][WIP] Set cached dataset path for speed~~ [CI][Benchmark] Set cached dataset path for speed Jun 3, 2025

Potabk changed the title ~~[CI][Benchmark] Set cached dataset path for speed~~ [CI][Benchmark] Optimize performance benchmark workflow Jun 3, 2025

fix path

19543b2

Signed-off-by: wangli <[email protected]>

Yikun reviewed Jun 3, 2025

View reviewed changes

Potabk added 4 commits June 3, 2025 18:52

remove redundant files

6a95c81

Signed-off-by: wangli <[email protected]>

fix yapf

34e0275

Signed-off-by: wangli <[email protected]>

fix

a765910

Signed-off-by: wangli <[email protected]>

fix

4de2996

Signed-off-by: wangli <[email protected]>

wangxiyuan approved these changes Jun 3, 2025

View reviewed changes

Yikun reviewed Jun 3, 2025

View reviewed changes

.github/workflows/nightly_benchmarks.yaml Outdated Show resolved Hide resolved

Apply suggestions from code review

7f5bdb2

Signed-off-by: Yikun Jiang <[email protected]>

Yikun added performance-test enable performance test for PR ready-for-test start test by label for PR and removed documentation Improvements or additions to documentation performance-test enable performance test for PR ready-for-test start test by label for PR labels Jun 3, 2025

Yikun merged commit 76dacf3 into vllm-project:main Jun 3, 2025
15 of 16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI][Benchmark] Optimize performance benchmark workflow #1039

[CI][Benchmark] Optimize performance benchmark workflow #1039

Uh oh!

Potabk commented May 31, 2025 •

edited by Yikun

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Yikun Jun 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[CI][Benchmark] Optimize performance benchmark workflow #1039

[CI][Benchmark] Optimize performance benchmark workflow #1039

Uh oh!

Conversation

Potabk commented May 31, 2025 • edited by Yikun Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Yikun Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Potabk commented May 31, 2025 •

edited by Yikun

Loading