I know my code is running in the GPU because performance profiler says so, but I'm getting mixed info about what actually causes it to run in the GPU. This reputable Microsoft developer says parallel_for is CPU and parallel_for_each is GPU. This reputable Microsoft developer implies parallel_for and parallel_for_each are interchangeable (with slight changes in how they are used) but doesn't even mention the GPU or C++amp, although he does compare both to OpenMP. MSDN has articles in each vein as well. Is it which 'restrict' clause one uses? I guess I could do some experiments, but that's not the official word. Any comments will be appreciated.
1
There are 1 best solutions below
Related Questions in C++
- How to immediately apply DISPLAYCONFIG_SCALING display scaling mode with SetDisplayConfig and DISPLAYCONFIG_PATH_TARGET_INFO
- Why can't I use templates members in its specialization?
- How to fix "Access violation executing location" when using GLFW and GLAD
- Dynamic array of structures in C++/ cannot fill a dynamic array of doubles in structure from dynamic array of structures
- How do I apply the interface concept with the base-class in design?
- File refuses to compile std::erase() even if using -std=g++23
- How can I do a successful map when the number of elements to be mapped is not consistent in Thrust C++
- Can std::bit_cast be applied to an empty object?
- Unexpected inter-thread happens-before relationships from relaxed memory ordering
- How i can move element of dynamic vector in argument of function push_back for dynamic vector
- Brick Breaker Ball Bounce
- Thread-safe lock-free min where both operands can change c++
- Watchdog Timer Reset on ESP32 using Webservers
- How to solve compiler error: no matching function for call to 'dmhFS::dmhFS()' in my case?
- Conda CMAKE CXX Compiler error while compiling Pytorch
Related Questions in C++-AMP
- Is the order of 'restrict's significant?
- c++ AMP Concurrency::runtime_exception occurs when creating array_view
- How many times is data copied in a C++ AMP array?
- Run AMP C++ kernel thread per row
- Get FLOAT texture from C++ AMP instead of TYPELESS
- C++ AMP Division Quirk When Stressing GPU
- Does C++Amp require GPU hardware before it will build / execute?
- 'Concurrency': a namespace with this name does not exist
- Weird performance in matrix multiplication using AMP dependent on memory layout
- Best way to leverage the GPU
- Problems in exit code using C++ AMP
- Is restrict(...) not supported with lambda functions written in template classes?
- Using Gaussian Filter in C++ AMP returning wrong colours
- FFT in C++ AMP Throw CLIPBRD_E_CANT_OPEN error
- Using c ++ amp to speed up the program in which the MPIR library (GMP) is used. Is this possible?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
If you want to be sure you are running on the GPU select the GPU to run your code on.
But to be more specific the restrict key word will attempt to run on the accelerator if one is available. if it is not the same code will run on the CPU yourself. In some of our project we detect the accelerator (GPU) with the most memory and make it the "chosen" accelerator.
Hope this helps.