You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[ROCm][TunableOp] Speed-up matmul_small_brute_force_tunableop unit test (pytorch#147659)
This PR has a UT speed-up and some refactoring of tests.
A previous PR pytorch#142422 fixed this matmul_small_brute_force_tunableop for the FP16 data type by adding TunableOp numerical checks. It had the unfortunate side effect that it increased the execution time for the FP32 and FP64 data types by a significant margin. This PR *reduces* the execution time by 20+ minutes.
We also move a hipBLASLt version check to a different tunableop UT for simplicity.
Pull Request resolved: pytorch#147659
Approved by: https://github.com/jeffdaily
0 commit comments