Release Repo SoCC2025 by elvingerpaul · Pull Request #16 · eth-easl/vllm_profile

elvingerpaul · 2025-11-19T23:01:56Z

socc-25 release

* add scripts for gemm shared memory interference * split up shared_mem into llm and gemm subrepo * intra_sm/shared_mem/gemm v1

* membw to use main.py instead of gcontext_test.py * fix bugs related to num_requests and num_threads_per_tb * inter_sm/mem_bw verified * remove gcontext_test.py, new universal entrypoint is main.py * fix inter_sm/l2 to use L2Kernel * fix, missing set_percentage arg * fix num_warmup vs num_request * minor adjustment README ipc

github-actions · 2025-11-19T23:02:05Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

elvingerpaul and others added 23 commits September 28, 2025 17:30

intra_sm/ipc v1 release

5ea8f3c

intra_sm/ipc v2 release

97a2594

intra_sm/shared_mem v1 release

c90c6ae

intra_sm/shared_mem v2 release

11be1ae

intra_sm/ipc v3 release

48bcdac

intra_sm/tb_scheduler v1 release

b740d44

clean up duplicated files

b19e634

inter_sm/l2_cache v1

2f4cf95

inter_sm/mem_bw

8ba49ac

delete unused files in inter_sm

3064a8f

cleanup interference kernels

78a1d76

improve logs

e0e8a63

verified intra_sm/ipc scripts

c5dcffc

verified inter_sm/shared_mem

6b015e6

specify GPU type for IPC experiment

ce874df

verified intra_sm/tb_scheduler

3366a3c

update README

711ced7

inter_sm/l2_cache v2

64ac8ba

bug fix: use cuda.synchronize in case there is no interfernece kernel

123bd45

README v1

c96c83e

bug fixes

0f64fbd

Shared Memory GEMM (#14)

e73032e

* add scripts for gemm shared memory interference * split up shared_mem into llm and gemm subrepo * intra_sm/shared_mem/gemm v1

elvingerpaul added 6 commits November 20, 2025 21:11

add requirements.txt file

94110f3

remove custom profiling folder

c0048bb

main README

25d5618

vllm/interference

eb70a98

inter_sm/l2_cache final

ec5660a

inter_sm/mem_bw final

d528887

elvingerpaul added 4 commits November 20, 2025 22:24

intra_sm/ipc final

55b92ef

intra_sm shared mem gemm

4be3bfa

intra-sm shared mem llm

f3cf237

intra sm tb scheduler final

e63edfb

elvingerpaul merged commit 1b3f975 into main Nov 20, 2025
0 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Release Repo SoCC2025#16

Release Repo SoCC2025#16
elvingerpaul merged 33 commits intomainfrom
release-repo-socc25

elvingerpaul commented Nov 19, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Nov 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

elvingerpaul commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Nov 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

elvingerpaul commented Nov 19, 2025 •

edited

Loading