Open
Description
Running the example from the posts/tensor-cores
folder as discussed at https://devblogs.nvidia.com/programming-tensor-cores-cuda-9/, it appears the nubmers are not as close as expected. I am getting the following output
./TCGemm
M = 16384, N = 16384, K = 16384. alpha = 2.000000, beta = 2.000000
Running with wmma...
Running with cuBLAS...
Checking results...
8266.587891 8267.766602
8240.230469 8241.420898
8242.393555 8243.574219
8209.478516 8210.649414
8100.519043 8101.664062
8251.499023 8252.675781
8189.156738 8190.297852
8260.410156 8261.580078
8311.802734 8313.015625
WMMA does not agree with cuBLAS! 268435456 errors!
Metadata
Metadata
Assignees
Labels
No labels