You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
"CSThreadGroups", "Items", "Total number of thread groups."
117
-
"CSWavefronts", "Items", "The total number of wavefronts used for the CS."
118
-
"CSThreads", "Items", "The number of CS threads processed by the hardware."
116
+
"CSThreadGroupsLaunched", "Items", "Total number of thread groups launched."
117
+
"CSWavefrontsLaunched", "Items", "The total number of wavefronts launched for the CS."
118
+
"CSThreadsLaunched", "Items", "The number of CS threads launched and processed by the hardware."
119
119
"CSThreadGroupSize", "Items", "The number of CS threads within each thread group."
120
120
"CSVALUInsts", "Items", "The average number of vector ALU instructions executed per work-item (affected by flow control)."
121
121
"CSVALUUtilization", "Percentage", "The percentage of active vector ALU threads in a wave. A lower number can mean either more thread divergence in a wave or that the work-group size is not a multiple of the wave size. Value range: 0% (bad), 100% (ideal - no thread divergence)."
@@ -127,16 +127,10 @@ ComputeShader Group
127
127
"CSVALUBusyCycles", "Cycles", "Number of GPU cycles where vector ALU instructions are processed."
128
128
"CSSALUBusy", "Percentage", "The percentage of GPUTime scalar ALU instructions are processed. Value range: 0% (bad) to 100% (optimal)."
129
129
"CSSALUBusyCycles", "Cycles", "Number of GPU cycles where scalar ALU instructions are processed."
130
-
"CSMemUnitBusy", "Percentage", "The percentage of GPUTime the memory unit is active. The result includes the stall time (MemUnitStalled). This is measured with all extra fetches and writes and any cache or memory effects taken into account. Value range: 0% to 100% (fetch-bound)."
131
-
"CSMemUnitBusyCycles", "Cycles", "Number of GPU cycles the memory unit is active. The result includes the stall time (MemUnitStalled). This is measured with all extra fetches and writes and any cache or memory effects taken into account."
132
-
"CSMemUnitStalled", "Percentage", "The percentage of GPUTime the memory unit is stalled. Try reducing the number or size of fetches and writes if possible. Value range: 0% (optimal) to 100% (bad)."
133
-
"CSMemUnitStalledCycles", "Cycles", "Number of GPU cycles the memory unit is stalled. Try reducing the number or size of fetches and writes if possible."
134
-
"CSWriteUnitStalled", "Percentage", "The percentage of GPUTime the write unit is stalled."
135
-
"CSWriteUnitStalledCycles", "Cycles", "Number of GPU cycles the write unit is stalled."
136
130
"CSGDSInsts", "Items", "The average number of GDS read or GDS write instructions executed per work item (affected by flow control)."
137
131
"CSLDSInsts", "Items", "The average number of LDS read/write instructions executed per work-item (affected by flow control)."
138
132
"CSALUStalledByLDS", "Percentage", "The percentage of GPUTime ALU units are stalled by the LDS input queue being full or the output queue being not ready. If there are LDS bank conflicts, reduce them. Otherwise, try reducing the number of LDS accesses if possible. Value range: 0% (optimal) to 100% (bad)."
139
-
"CSALUStalledByLDSCycles", "Cycles", "Number of GPU cycles the ALU units are stalled by the LDS input queue being full or the output queue being not ready. If there are LDS bank conflicts, reduce them. Otherwise, try reducing the number of LDS accesses if possible."
133
+
"CSALUStalledByLDSCycles", "Cycles", "Number of GPU cycles each wavefronts' ALU units are stalled by the LDS input queue being full or the output queue being not ready. If there are LDS bank conflicts, reduce them. Otherwise, try reducing the number of LDS accesses if possible."
140
134
"CSLDSBankConflict", "Percentage", "The percentage of GPUTime LDS is stalled by bank conflicts. Value range: 0% (optimal) to 100% (bad)."
141
135
"CSLDSBankConflictCycles", "Cycles", "Number of GPU cycles the LDS is stalled by bank conflicts. Value range: 0 (optimal) to GPUBusyCycles (bad)."
"PreTessVALUInstCount", "Items", "Average number of vector ALU instructions executed for the VS and HS in a pipeline that uses tessellation. Affected by flow control."
63
+
"PreTessSALUInstCount", "Items", "Average number of scalar ALU instructions executed for the VS and HS in a pipeline that uses tessellation. Affected by flow control."
64
+
"PreTessVALUBusy", "Percentage", "The percentage of GPUTime vector ALU instructions are being processed for the VS and HS in a pipeline that uses tessellation."
65
+
"PreTessVALUBusyCycles", "Cycles", "Number of GPU cycles vector where ALU instructions are being processed for the VS and HS in a pipeline that uses tessellation."
66
+
"PreTessSALUBusy", "Percentage", "The percentage of GPUTime scalar ALU instructions are being processed for the VS and HS in a pipeline that uses tessellation."
67
+
"PreTessSALUBusyCycles", "Cycles", "Number of GPU cycles where scalar ALU instructions are being processed for the VS and HS in a pipeline that uses tessellation."
62
68
"PreTessVerticesIn", "Items", "The number of vertices processed by the VS and HS when using tessellation."
63
69
64
70
PostTessellation Group
@@ -69,6 +75,12 @@ PostTessellation Group
69
75
:widths: 15, 10, 75
70
76
71
77
"PostTessPrimsOut", "Items", "The number of primitives output by the DS and GS when using tessellation."
78
+
"PostTessVALUInstCount", "Items", "Average number of vector ALU instructions executed for the DS and GS in a pipeline that uses tessellation. Affected by flow control."
79
+
"PostTessSALUInstCount", "Items", "Average number of scalar ALU instructions executed for the DS and GS in a pipeline that uses tessellation. Affected by flow control."
80
+
"PostTessVALUBusy", "Percentage", "The percentage of GPUTime vector ALU instructions are being processed for the DS and GS in a pipeline that uses tessellation."
81
+
"PostTessVALUBusyCycles", "Cycles", "Number of GPU cycles vector where ALU instructions are being processed for the DS and GS in a pipeline that uses tessellation."
82
+
"PostTessSALUBusy", "Percentage", "The percentage of GPUTime scalar ALU instructions are being processed for the DS and GS in a pipeline that uses tessellation."
83
+
"PostTessSALUBusyCycles", "Cycles", "Number of GPU cycles where scalar ALU instructions are being processed for the DS and GS in a pipeline that uses tessellation."
"CSThreadGroups", "Items", "Total number of thread groups."
105
-
"CSWavefronts", "Items", "The total number of wavefronts used for the CS."
106
-
"CSThreads", "Items", "The number of CS threads processed by the hardware."
116
+
"CSThreadGroupsLaunched", "Items", "Total number of thread groups launched."
117
+
"CSWavefrontsLaunched", "Items", "The total number of wavefronts launched for the CS."
118
+
"CSThreadsLaunched", "Items", "The number of CS threads launched and processed by the hardware."
107
119
"CSThreadGroupSize", "Items", "The number of CS threads within each thread group."
108
-
"CSMemUnitBusy", "Percentage", "The percentage of GPUTime the memory unit is active. The result includes the stall time (MemUnitStalled). This is measured with all extra fetches and writes and any cache or memory effects taken into account. Value range: 0% to 100% (fetch-bound)."
109
-
"CSMemUnitBusyCycles", "Cycles", "Number of GPU cycles the memory unit is active. The result includes the stall time (MemUnitStalled). This is measured with all extra fetches and writes and any cache or memory effects taken into account."
110
-
"CSMemUnitStalled", "Percentage", "The percentage of GPUTime the memory unit is stalled. Try reducing the number or size of fetches and writes if possible. Value range: 0% (optimal) to 100% (bad)."
111
-
"CSMemUnitStalledCycles", "Cycles", "Number of GPU cycles the memory unit is stalled. Try reducing the number or size of fetches and writes if possible."
112
-
"CSWriteUnitStalled", "Percentage", "The percentage of GPUTime the write unit is stalled."
113
-
"CSWriteUnitStalledCycles", "Cycles", "Number of GPU cycles the write unit is stalled."
120
+
"CSVALUInsts", "Items", "The average number of vector ALU instructions executed per work-item (affected by flow control)."
121
+
"CSVALUUtilization", "Percentage", "The percentage of active vector ALU threads in a wave. A lower number can mean either more thread divergence in a wave or that the work-group size is not a multiple of the wave size. Value range: 0% (bad), 100% (ideal - no thread divergence)."
122
+
"CSSALUInsts", "Items", "The average number of scalar ALU instructions executed per work-item (affected by flow control)."
123
+
"CSVFetchInsts", "Items", "The average number of vector fetch instructions from the video memory executed per work-item (affected by flow control)."
124
+
"CSSFetchInsts", "Items", "The average number of scalar fetch instructions from the video memory executed per work-item (affected by flow control)."
125
+
"CSVWriteInsts", "Items", "The average number of vector write instructions to the video memory executed per work-item (affected by flow control)."
114
126
"CSGDSInsts", "Items", "The average number of GDS read or GDS write instructions executed per work item (affected by flow control)."
115
127
"CSLDSInsts", "Items", "The average number of LDS read/write instructions executed per work-item (affected by flow control)."
116
128
"CSALUStalledByLDS", "Percentage", "The percentage of GPUTime ALU units are stalled by the LDS input queue being full or the output queue being not ready. If there are LDS bank conflicts, reduce them. Otherwise, try reducing the number of LDS accesses if possible. Value range: 0% (optimal) to 100% (bad)."
117
-
"CSALUStalledByLDSCycles", "Cycles", "Number of GPU cycles the ALU units are stalled by the LDS input queue being full or the output queue being not ready. If there are LDS bank conflicts, reduce them. Otherwise, try reducing the number of LDS accesses if possible."
129
+
"CSALUStalledByLDSCycles", "Cycles", "The average number of GPU cycles the each wavefronts' ALU units are stalled by the LDS input queue being full or the output queue being not ready. If there are LDS bank conflicts, reduce them. Otherwise, try reducing the number of LDS accesses if possible."
118
130
"CSLDSBankConflict", "Percentage", "The percentage of GPUTime LDS is stalled by bank conflicts. Value range: 0% (optimal) to 100% (bad)."
119
131
"CSLDSBankConflictCycles", "Cycles", "Number of GPU cycles the LDS is stalled by bank conflicts. Value range: 0 (optimal) to GPUBusyCycles (bad)."
0 commit comments