SHORT L3 cache miss rate/ratio

EVENTSET
PMC0  INSTRUCTIONS_RETIRED
PMC1  L3_READ_REQUEST_ALL_ALL_CORES
PMC2  L3_MISSES_ALL_ALL_CORES

METRICS
Runtime (RDTSC) [s] time
L3 request rate  PMC1/PMC0
L3 miss rate   PMC2/PMC0
L3 miss ratio  PMC2/PMC1

LONG
Formulas:
L3 request rate =  L3_READ_REQUEST_ALL_ALL_CORES / INSTRUCTIONS_RETIRED
L3 miss rate  = L3_MISSES_ALL_ALL_CORES / INSTRUCTIONS_RETIRED
L3 miss ratio =  L3_MISSES_ALL_ALL_CORES / L3_READ_REQUEST_ALL_ALL_CORES
-
This group measures the locality of your data accesses with regard to the
L3 Cache. L3 request rate tells you how data intensive your code is
or how many Data accesses you have in average per instruction.
The L3 miss rate gives a measure how often it was necessary to get
cachelines from memory. And finally L3 miss ratio tells you how many of your
memory references required a cacheline to be loaded from a higher level.
While the Data cache miss rate might be given by your algorithm you should
try to get Data cache miss ratio as low as possible by increasing your cache reuse.
Note: As this group measures the accesses from all cores it only makes sense
to measure with one core per socket, similiar as with the Intel Nehalem Uncore events.
This group was taken from the whitepaper -Basic Performance Measurements for AMD Athlon 64,
AMD Opteron and AMD Phenom Processors- from Paul J. Drongowski.


