Allow usage of fused_block_softmax_adjustment for Qwen with Lazy #246

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Draft

mswiniarsk wants to merge 1 commit into main from dev/mswiniarski/qwen_lazy_fused_block_softmax_adj

Contributor

mswiniarsk commented Jun 27, 2025

Currently Qwen with torch.compile and fused_block_softmax_adjustment has accuracy issues (in progress to resolve), so I'm relaxing current condition to allow usage of the op in Lazy mode that is properly working.


          Allow usage of fused_block_softmax_adjustment for Qwen with Lazy

d49c238

Reviewers

kzawora-intel Awaiting requested review from kzawora-intel kzawora-intel will be requested when the pull request is marked ready for review kzawora-intel is a code owner

madamczyk-intel Awaiting requested review from madamczyk-intel madamczyk-intel will be requested when the pull request is marked ready for review madamczyk-intel is a code owner

michalkuligowski Awaiting requested review from michalkuligowski michalkuligowski will be requested when the pull request is marked ready for review michalkuligowski is a code owner

mgawarkiewicz-intel Awaiting requested review from mgawarkiewicz-intel mgawarkiewicz-intel will be requested when the pull request is marked ready for review mgawarkiewicz-intel is a code owner

tzielinski-habana Awaiting requested review from tzielinski-habana tzielinski-habana will be requested when the pull request is marked ready for review tzielinski-habana is a code owner

afierka-intel Awaiting requested review from afierka-intel afierka-intel will be requested when the pull request is marked ready for review afierka-intel is a code owner

xuechendi Awaiting requested review from xuechendi xuechendi will be requested when the pull request is marked ready for review xuechendi is a code owner

jikunshang Awaiting requested review from jikunshang jikunshang will be requested when the pull request is marked ready for review jikunshang is a code owner

deepvars Awaiting requested review from deepvars deepvars will be requested when the pull request is marked ready for review deepvars is a code owner

At least 1 approving review is required to merge this pull request.

Labels

None yet

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow usage of fused_block_softmax_adjustment for Qwen with Lazy #246

Allow usage of fused_block_softmax_adjustment for Qwen with Lazy #246