[QNN EP] Fix 16x16 MatMul translation #24846

quic-tirupath · 2025-05-23T05:50:37Z

Description

QNN's 16x16 FC doesn't support asymmetric int16 weight
QNN's 16x16 MatMul doesn't support asymmetric int16 weight initializer.
Insert Convert Op to convert from asymmetric uint16 weight to symmetric int16 weight.
Add unit tests to verify 16x16 MatMul translations.

Motivation and Context

This fix schedules 16x16 MatMul Ops on QNN HTP accelerator.
This improves inference time of Models contain 16x16 MatMul operators

- QNN's 16x16 FC doesn't support asymmetric int16 weight - QNN's 16x16 MatMul doesn't support asymmetric int16 weight initializer. - Insert Convert Op to convert from asymmetric uint16 weight to symmetric int16 weight. - Add unit tests to verify 16x16 MatMul translations.

HectorSVC · 2025-05-23T21:17:12Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-05-23T21:17:31Z

Azure Pipelines successfully started running 5 pipeline(s).

HectorSVC added the ep:QNN issues related to QNN exeution provider label May 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[QNN EP] Fix 16x16 MatMul translation #24846

[QNN EP] Fix 16x16 MatMul translation #24846

quic-tirupath commented May 23, 2025

Uh oh!

HectorSVC commented May 23, 2025

Uh oh!

azure-pipelines bot commented May 23, 2025

Uh oh!

Uh oh!

[QNN EP] Fix 16x16 MatMul translation #24846

Are you sure you want to change the base?

[QNN EP] Fix 16x16 MatMul translation #24846

Conversation

quic-tirupath commented May 23, 2025

Description

Motivation and Context

Uh oh!

HectorSVC commented May 23, 2025

Uh oh!

azure-pipelines bot commented May 23, 2025

Uh oh!

Uh oh!