Skip to content

[CI][Benchmark] Optimize performance benchmark workflow #1039

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 31 commits into from
Jun 3, 2025

Conversation

Potabk
Copy link
Contributor

@Potabk Potabk commented May 31, 2025

What this PR does / why we need it?

This is a post patch of #1014, for some convenience optimization

  • Set cached dataset path for speed
  • Use pypi to install escli-tool
  • Add benchmark results convert script to have a developer-friendly result
  • Patch the benchmark_dataset.py to disable streaming load for internet
  • Add more trigger ways for different purpose, pr for debug, schedule for daily test, dispatch and pr-labled for manual testing of a single(current) commit
  • Disable latency test for qwen-2.5-vl, (This script does not support multi-modal yet)

Does this PR introduce any user-facing change?

No

How was this patch tested?

CI passed

@Potabk Potabk changed the title [CI][Benchmark] Set cached dataset path for speed [CI][Benchmark][WIP] Set cached dataset path for speed May 31, 2025
@Potabk Potabk force-pushed the dev-bench branch 2 times, most recently from 78e396f to 2e124d4 Compare June 1, 2025 15:46
@github-actions github-actions bot added the documentation Improvements or additions to documentation label Jun 1, 2025
Potabk added 25 commits June 3, 2025 15:59
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
@Potabk Potabk changed the title [CI][Benchmark][WIP] Set cached dataset path for speed [CI][Benchmark] Set cached dataset path for speed Jun 3, 2025
@Potabk Potabk changed the title [CI][Benchmark] Set cached dataset path for speed [CI][Benchmark] Optimize performance benchmark workflow Jun 3, 2025
Signed-off-by: wangli <[email protected]>
Potabk added 4 commits June 3, 2025 18:52
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
Signed-off-by: wangli <[email protected]>
@@ -64,6 +64,7 @@ jobs:
env:
HF_ENDPOINT: https://hf-mirror.com
HF_TOKEN: ${{ secrets.HF_TOKEN }}
HF_HOME: /github/home/.cache/huggingface
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why we need specified explictly?

@Yikun Yikun added performance-test enable performance test for PR ready-for-test start test by label for PR and removed documentation Improvements or additions to documentation performance-test enable performance test for PR ready-for-test start test by label for PR labels Jun 3, 2025
@Yikun Yikun merged commit 76dacf3 into vllm-project:main Jun 3, 2025
15 of 16 checks passed
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 4, 2025
…#1039)

### What this PR does / why we need it?

This is a post patch of vllm-project#1014, for some convenience optimization
- Set cached dataset path for speed
- Use pypi to install escli-tool
- Add benchmark results convert script to have a developer-friendly
result
- Patch the `benchmark_dataset.py` to disable streaming load for
internet
- Add more trigger ways for different purpose, `pr` for debug,
`schedule` for daily test, `dispatch` and `pr-labled` for manual testing
of a single(current) commit
- Disable latency test for `qwen-2.5-vl`, (This script does not support
multi-modal yet)

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
CI passed

---------

Signed-off-by: wangli <[email protected]>
Signed-off-by: wangxiaoxin (A) <[email protected]>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 4, 2025
…#1039)

This is a post patch of vllm-project#1014, for some convenience optimization
- Set cached dataset path for speed
- Use pypi to install escli-tool
- Add benchmark results convert script to have a developer-friendly
result
- Patch the `benchmark_dataset.py` to disable streaming load for
internet
- Add more trigger ways for different purpose, `pr` for debug,
`schedule` for daily test, `dispatch` and `pr-labled` for manual testing
of a single(current) commit
- Disable latency test for `qwen-2.5-vl`, (This script does not support
multi-modal yet)

No

CI passed

---------

Signed-off-by: wangli <[email protected]>
Signed-off-by: wangxiaoxin (A) <[email protected]>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 4, 2025
…#1039)

This is a post patch of vllm-project#1014, for some convenience optimization
- Set cached dataset path for speed
- Use pypi to install escli-tool
- Add benchmark results convert script to have a developer-friendly
result
- Patch the `benchmark_dataset.py` to disable streaming load for
internet
- Add more trigger ways for different purpose, `pr` for debug,
`schedule` for daily test, `dispatch` and `pr-labled` for manual testing
of a single(current) commit
- Disable latency test for `qwen-2.5-vl`, (This script does not support
multi-modal yet)

No

CI passed

---------

Signed-off-by: wangli <[email protected]>
Signed-off-by: wangxiaoxin (A) <[email protected]>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 4, 2025
…#1039)

This is a post patch of vllm-project#1014, for some convenience optimization
- Set cached dataset path for speed
- Use pypi to install escli-tool
- Add benchmark results convert script to have a developer-friendly
result
- Patch the `benchmark_dataset.py` to disable streaming load for
internet
- Add more trigger ways for different purpose, `pr` for debug,
`schedule` for daily test, `dispatch` and `pr-labled` for manual testing
of a single(current) commit
- Disable latency test for `qwen-2.5-vl`, (This script does not support
multi-modal yet)

No

CI passed

---------

Signed-off-by: wangli <[email protected]>
Signed-off-by: wangxiaoxin (A) <[email protected]>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 5, 2025
…#1039)

This is a post patch of vllm-project#1014, for some convenience optimization
- Set cached dataset path for speed
- Use pypi to install escli-tool
- Add benchmark results convert script to have a developer-friendly
result
- Patch the `benchmark_dataset.py` to disable streaming load for
internet
- Add more trigger ways for different purpose, `pr` for debug,
`schedule` for daily test, `dispatch` and `pr-labled` for manual testing
of a single(current) commit
- Disable latency test for `qwen-2.5-vl`, (This script does not support
multi-modal yet)

No

CI passed

---------

Signed-off-by: wangli <[email protected]>
Signed-off-by: wangxiaoxin (A) <[email protected]>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 5, 2025
…#1039)

This is a post patch of vllm-project#1014, for some convenience optimization
- Set cached dataset path for speed
- Use pypi to install escli-tool
- Add benchmark results convert script to have a developer-friendly
result
- Patch the `benchmark_dataset.py` to disable streaming load for
internet
- Add more trigger ways for different purpose, `pr` for debug,
`schedule` for daily test, `dispatch` and `pr-labled` for manual testing
of a single(current) commit
- Disable latency test for `qwen-2.5-vl`, (This script does not support
multi-modal yet)

No

CI passed

---------

Signed-off-by: wangli <[email protected]>
Signed-off-by: wangxiaoxin (A) <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ready-for-test start test by label for PR
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants