I would like to extract the data from my GPU application in order to check its limits. I have to use nvprof because the application runs on a remote server, so I should create a file to import locally in the Visual Profiler. I've tried to create the file with nvprof -o file_name <app> <params> and with nvprof --analysis-metrics --output-profile file_name <app> <params> but when I import these files on the Visual Profiler, in the Analysis section some fields are empty: "insufficient global memory load data", "insufficient global memory store data", "insufficient kernel SM data"... . How could I generate a file (or more) in order to have all the information for the Analysis section? I compile the cuda code with nvcc with the flags -lineinfo -arch compute_20 -code sm_20 --ptxas-options=-v.
These are some examples of empty fields: 
Export CUDA nvprof output to the Visual Profiler
6k Views Asked by Stefano Sandonà At
1
There are 1 best solutions below
Related Questions in CUDA
- direct global memory access using cuda
- Threads syncronization in CUDA
- Merge sort using CUDA: efficient implementation for small input arrays
- why cuda kernel function costs cpu?
- How to detect NVIDIA CUDA Architecture
- What is the optimal way to use additional data fields in functors in Thrust?
- cuda-memcheck fails to detect memory leak in an R package
- Understanding Dynamic Parallelism in CUDA
- C/CUDA: Only every fourth element in CudaArray can be indexed
- NVCC Cuda 5.0 on Ubuntu 12.04 /usr/lib/libudt.so file format not recognized
- Reduce by key on device array
- Does CUDA include a real c++ library?
- cuMemcpyDtoH yields CUDA_ERROR_INVALID_VALUE
- Different Kernels sharing SMx
- How many parallel threads i can run on my nvidia graphic card in cuda programming?
Related Questions in NVVP
- nvprof to open trace format or slog2
- CUDA's nvvp reports non-ideal memory access pattern, but bandwidth is almost peaking
- Can NVIDIA Visual Profiler display concurrent kernel execution?
- Meaning of the "flop_count_sp" and "inst_fp_32" metric in CUDA Profiler
- nsight EE and nvvp both crash during startup on Ubuntu 16.10
- Profiling arbitrary CUDA applications
- Export CUDA nvprof output to the Visual Profiler
- CUDA kernels are not overlapping
- Is it possible to automatically repeat several executions on NVVP?
- What's the difference between DtoD and PtoP memory copies?
- nvvp and nsight's profiler give a different result?
- is there anyway to avoid this serialization behavior in cuda profiling?
- How can a registers-only instruction stall due to "memory dependencies"?
- Profile debug or release cuda code?
- Profilers (nvvp and nvprof) not showing "Page Fault" information
Related Questions in NVPROF
- Performance Analysis of Multiple Kernels (CUDA C)
- Meaning of the "flop_count_sp" and "inst_fp_32" metric in CUDA Profiler
- nvprof Warning: The path to CUPTI and CUDA Injection libraries might not be set in LD_LIBRARY_PATH
- get the execution time in nvprof
- dram_write_bytes result on P100
- Why don't I get "thread_inst_executed"
- How to get CUDA event starting and ending time without using nvprof
- nvprof R gputools code never ends
- How can I access the numeric stream IDs seen in nvprof, using a cudaStream_t?
- What exactly does NVPROF Power Profile measure?
- Profiling arbitrary CUDA applications
- Export CUDA nvprof output to the Visual Profiler
- Where can i find thee missing formulas in latest Nvidia CUDA Profiler user guide
- How to interpret the number shown in the square brackets?
- What is redzone_checker? Profiling my tensorflow application on a GPU
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
You can try to add a session instead of importing prof file into the visual profiler. I run into the similar problem. what I did is adding a session according to the instructions in here, and you will be able to see all the information.