pcm-latency reports small latency #518
Unanswered
majid0jalili
asked this question in
Q&A
Replies: 1 comment
-
There are a few differences: The DDR latency reported by pcm-latency is measured at the memory controller. The pcm main utility has LLCRDMISSLAT metric which is similar to the mentioned CHA miss latency. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I was trying to measure the DRAM latency with pcm-latency, and I noticed that the number it reports is a bit off.
For example, I ran GUPS and STREAM and below is the output for one iteration, although it is roughly similar for other samples:
GUPS:
Average latency per socket
L1 Cache Miss Latency(ns) [Adding 5 clocks for L1 Miss]
Socket0: 4.97
Socket1: 5.10
DDR read Latency(ns)
Socket0: 27.27
Socket1: 27.22
STREAM
Average latency per socket
L1 Cache Miss Latency(ns) [Adding 5 clocks for L1 Miss]
Socket0: 21.26
Socket1: 21.40
DDR read Latency(ns)
Socket0: 118.13
Socket1: 106.93
There is no way this can be true; a fully random access pattern has that low latency. I digged further and used
perf stat
to see if this can be solved. I noticedperf stat
uses 3 hw counters to measure latencyC1=
cha/event=0x36\,umask=0x21\,config=0x40433/
C2=
cha/event=0x35\,umask=0x21\,config=0x40433/
C3=
cha_0/event=0x0/
Then, does
C1/C2/C3*1billion
to report the DRAM latency. I took the below from the Kernel websiteI tried on two different machines SKX and casecadelake and observe the same behavior.
Beta Was this translation helpful? Give feedback.
All reactions