I am trying to calculate the throughput of my kernel which is written in my openCL. But I am not sure how to do that, I have tried to find some file generated after compilation which shows throughput as 0.435(" found in the .attrb file") but not sure what does that mean. Is there any other way to find throughput?
Throughput calculation in OpenCl
175 Views Asked by user10149483 At
2
There are 2 best solutions below
0
Marko Bencik
On
This is a very vague question.
Do you mean only the kernel without loading the data?
What does the kernel going do, on what kind of hardware are you running it, how is your data organized, how do you manage your buffers?
Is everything in global memory? Are you defining latencies also? Do you need to maximaze the throughput? Are you going to optimize for specific hardware?
For me many questions rise.
Related Questions in OPENCL
- What is the parameter for CLI YOLOv8 predict to use Intel GPU?
- How to exploit Unified Memory in OpenCL with CL_MEM_ALLOC_HOST_PTR flag?
- PyOpenCl code hanging on a simple get() - how can I troubleshoot?
- OpenCL dynamic parallelism enqueue_kernel() functionality
- Do all OpenCL drivers come with the IntelOneAPI compiler
- How to move an array of structures to the GPU?
- Passing arguments to OpenCL kernel, before execution finished
- OpenCV acceleration (OpenCL) of gaussian blur
- CL_DEVICE_NOT_AVAILABLE using Intel(R)Xeon(R)Gold 6240 CPU
- Launch Single Kernel on problem space vs Launch same kernel, multiple times on smaller problem spaces
- Running OpenCL programs on baremetal RISC-V core
- Why did an OpenCL rendering optimization make my code slower?
- OpenCL Kernel hangs at clEnqueueReadBuffer on AMD rocm
- Is it possible to assign works to each GPU thread instead of a work to group of GPU threads?
- Fast way to rearrange bit into new byte
Related Questions in OPEN-CLOSED-PRINCIPLE
- I cannot build an Expression Tree Node that follows the Open/Close Principle
- what is the correct way to implement open-closed principle in C#?
- How to implement the Open-Closed Principle in error handling for new error types?
- deploying application to Openshift using jenkins DSL using private repositories
- Open closed principle implementation python
- Understanding the Open/Closed principle - do we have a counter-example
- Best Practice Open Close Principle
- I know we can't call an extension method with the same name. But what are the other ways to achieve this?
- Workbook_Open() execution for hidding / showing Sheets at Excel startup
- How can I make type environments for an expression to satisfy specific conditions?
- Open-Closed Principle in Python OOD
- Open-Closed Priciple: is this rephrasing correct?
- Open/Closed Principle - How to integrate the principle with this code?
- How do I implement the open-close principle?
- Open close principle and Collective ownership. Extreme Programming
Related Questions in OPENCL-C
- OpenCL hangs forever when trying to read kernel output
- OpenCL Host ran out of Memory in trivial Kernel
- Why does vectorialization of this simple openCl kernel make it slower?
- LLVM ERROR: Do not know how to split this operator's operand! Why this error happen when opencl kernel was compiled with pyopencl with CPU device?
- Multi GPU clCreateBuffer failure on single context
- Summing a stream of records using OpenCL
- Opencl Program block occurred after the program running for a long time
- Problems additionVector with OpenCL
- OpenCL clFinish never returns
- Static variable in OpenCL C
- OpenCL (in-kernel) callable SVD kernel code?
- Access violation reading location with openCL with high n values
- why doesn't the OpenCL kernel execute even though there are no errors? (c, nvidia, kubuntu)
- Using OpenCL to get the energy consumption of my OpenCL Kernel
- OpenCL: Can you have a `const __global` type or just `__global`?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Throughput of kernel in OpenCL calculated as:
For measuring time use cl_event.