Add optimization_level to TTConfig for vLLM compile #2663

odjuricicTT · 2025-12-23T15:02:44Z

Ticket

N/A

Problem description

The vLLM plugin needed support for configuring the tt-mlir optimization level through TTConfig, similar to how enable_const_eval is propagated.

What's changed

Added optimization_level field to TTConfig dataclass with default value of 0
Propagated optimization_level through get_pjrt_compile_config() to torch_xla.set_custom_compile_options()
Set optimization_level = 1 for Qwen models in the batched inference pooling test

Checklist

New/Existing tests provide coverage for changes

codecov-commenter · 2025-12-23T15:16:44Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 28.51%. Comparing base (e3d9541) to head (92492fb).

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2663   +/-   ##
=======================================
  Coverage   28.51%   28.51%           
=======================================
  Files          31       31           
  Lines        4075     4075           
=======================================
  Hits         1162     1162           
  Misses       2913     2913

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

tests/integrations/vllm_plugin/pooling/test_batched_inference.py

- Add optimization_level field to TTConfig with default value 0 - Propagate optimization_level through get_pjrt_compile_config() - Set optimization_level = 1 for Qwen models in batched inference test

odjuricicTT requested review from AleksKnezevic, acicovicTT, ajakovljevicTT, jameszianxuTT, kmabeeTT, ljovanovicTT, mmanzoorTT, mrakitaTT, ndrakulicTT, sdjukicTT, sgligorijevicTT and vzeljkovicTT as code owners December 23, 2025 15:02

odjuricicTT force-pushed the odjuricic/vllm-optimization-level branch from f81d5b2 to b8ec277 Compare December 23, 2025 15:06

odjuricicTT requested review from acolicTT, nvukobratTT and pilkicTT as code owners December 23, 2025 15:06

odjuricicTT force-pushed the odjuricic/vllm-optimization-level branch 2 times, most recently from c9eac9c to 4043be2 Compare December 23, 2025 15:45

mmanzoorTT approved these changes Dec 23, 2025

View reviewed changes

tests/integrations/vllm_plugin/pooling/test_batched_inference.py Outdated Show resolved Hide resolved

Add optimization_level to TTConfig for vLLM compile

af5f45b

- Add optimization_level field to TTConfig with default value 0 - Propagate optimization_level through get_pjrt_compile_config() - Set optimization_level = 1 for Qwen models in batched inference test

odjuricicTT force-pushed the odjuricic/vllm-optimization-level branch from 4043be2 to af5f45b Compare December 23, 2025 15:52

Merge branch 'main' into odjuricic/vllm-optimization-level

3f5bdfd

odjuricicTT enabled auto-merge (squash) December 26, 2025 15:45

mmanzoorTT approved these changes Dec 29, 2025

View reviewed changes

Merge branch 'main' into odjuricic/vllm-optimization-level

92492fb

odjuricicTT merged commit efaac38 into main Jan 15, 2026
48 checks passed

odjuricicTT deleted the odjuricic/vllm-optimization-level branch January 15, 2026 10:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add optimization_level to TTConfig for vLLM compile #2663

Add optimization_level to TTConfig for vLLM compile #2663

Uh oh!

odjuricicTT commented Dec 23, 2025

Uh oh!

codecov-commenter commented Dec 23, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add optimization_level to TTConfig for vLLM compile #2663

Add optimization_level to TTConfig for vLLM compile #2663

Uh oh!

Conversation

odjuricicTT commented Dec 23, 2025

Ticket

Problem description

What's changed

Checklist

Uh oh!

codecov-commenter commented Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

codecov-commenter commented Dec 23, 2025 •

edited

Loading