Description
Related to the issue #1066.
We uses the the SYCL barrier to measure the Triton kernel performance which is not accurate especially after the immediate command list function is enabled.
The Triton XPU micro-benchmark now switch to use the SYCL event and IPEX profiler extension to measure the performance. But the IPEX profiler extension is going to be removed.
We need to formally support the Triton Proton profiler tools.