I am trying to use perf on my device with an AMD cpu, but I can't really find any information about how to get, let's say, cache-misses from AMD. I read that you need to write -e rNNN, where NNN is a hex-code of event, but I didn't manage to find any table or something to look at those codes. Could you help me with this, because it seems that there is no information in the internet at all! Actually, in the manual for perf there are some links, but they are not valid :(
AMD perf events
2.9k Views Asked by Baz Pasha At
1
There are 1 best solutions below
Related Questions in X86
- Why do we need AX instead of MOV DS, data directly with a segment?
- Drawing circles of increasing radius
- Assembly print on screen using pop ecx
- How to add values from vector to each other
- Intel x64 instructions CMPSB/CMPSW/CMPSD/CMPSQ
- Compact implementation of logical AND in x86 assembly
- Can feenableexcept hurt a program performance?
- How do I display the result and remainder in ax and dx in Assembly (tasm)
- ASM : Trouble using int21h on real machine
- jmp instruction *%eax
- What steps are needed to load a second stage bootloader by name on a FAT32 file system in x86 Assembly?
- Assembly code to print a new line string
- Write System Call Argument Registers
- How to jump to an address saved in a register in intel assembly?
- Find middle value of a list
Related Questions in PERF
- How to get results each sec from "perf stat -d sleep 1000"
- QEMU-KVM and Perf Statistics
- How to measure if a program was run in parallel over multiple cores in Linux?
- perf: comparing "CPU performance / CPU impact" of two binaries?
- How can I capture combined kernel and userspace stacks with perf
- Monitoring Processes/Inter-process communication with Linux
- Compile Linux perf with ncurses interface?
- Why is IPC lower than one on a modern processor?
- Determining Instruction Size of A process / A function call in Linux
- Extract single line from command output in terminal
- Can't sample hardware cache events with linux perf
- Can I get the python call stack with the linux perf?
- perf stat frontend and backend cycles showing greater than 100%
- Understanding number of loads and stores retired in a x86 micro-benchmark
- Why are number of instructions non-deterministic in Linux performance counters
Related Questions in AMD-PROCESSOR
- Effective way to extract from SSE vector on AMD processors
- JAVA_HOME: /bin/java vs /bin/amd64/java
- How to enable virtualization in bios windows 8 using AMD E1-2100 APU G505?
- Bad results with a simple FFT 1D OpenCL example
- How should I declare a vector variable in OpenCL that can fully utilize GPU's vectorized feature
- Unable to run virtualmachine - vbox amd-v
- Method to get GPU information for OS or OpenGL API
- AMD perf events
- What are the costs of failed store-to-load forwarding on x86?
- OpenCL bicubic interpolation kernel failed with error CL_EXEC_STATUS_ERROR_FOR_EVENTS_IN_WAIT_LIST
- How to debug OpenCL on Nvidia GPUs?
- How do I increase the "global memory" available to the Intel CPU OpenCL driver?
- Getting wrong values returned from cl::BufferGL after a upgrade from HD6790 to HD7850
- Can the AMD64 ISA work without licensing the x86 ISA?
- Intel OpenMP library slows down memory bandwidth significantly on AMD platforms by setting KMP_AFFINITY=scatter
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Check
perf listoutput, in modern Linux kernel versions it may report some architecture-specific hardware events. Some generic hardware events may be always reported byperf list(especially with older kernels), but not all of them are mapped to some real hardware event. Thecache-missesandcyclesare such generic perf hw events, not always mapped (mapping is in perf source code around http://elixir.free-electrons.com/linux/latest/source/arch/x86/events/amd/core.c for amd - withcache-missesmapped to[PERF_COUNT_HW_CACHE_MISSES] = 0x077e,).Also try different events from perf list with
perf stat -e event1,cycles,instructions,cpu-clockwhere event1 is the event you want to check and there are some working events.To encode raw events it can be easier to use processor docs, perf sources (for exact hex encoding) and some external tools. For Intel there is
ocperf.pyfrom http://github.com/andikleen/pmu-tools site; and there is generic raw generator in perfmon2/libpfm4, described at http://www.bnikolic.co.uk/blog/hpc-prof-events.html "How to monitor the full range of CPU performance events" by Bojan Nikolic withshowevtinfoutil (it is also recommended way of getting rXXXX codes for perf in FAQ: http://web.eece.maine.edu/~vweaver/projects/perf_events/faq.html#q2e Q2e. How do I determine the proper "raw" event value):