CUDA graph stream capture with thrust::reduce

1.2k Views Asked by Cos_ma At 01 April 2020 at 12:00

When I am trying to capture stream execution to build CUDA graph, call to thrust::reduce causes a runtime error cudaErrorStreamCaptureUnsupported: operation not permitted when stream is capturing. I have tried returning the reduction result to both host and device variables, and I am calling reduction in a proper stream by the means of thrust::cuda::par.on(stream). Is there any way I can add thrust functions execution to CUDA graphs?

Original Q&A

There are 1 best solutions below

heapoverflow On 02 April 2020 at 13:23 BEST ANSWER

Thrust's reduction operation is a blocking operation on the host side. I am assuming that you are using the result of reduction as a parameter to one of your following kernels. So that when you are capturing a CUDA graph, it cannot instantiate the graph executable because you are dependent on a variable that is on the host side but not available until the reduction kernel finishes execution. As a solution, you can try adding a host node to your graph that returns the result of the reduction.

CUDA graph stream capture with thrust::reduce

There are 1 best solutions below

Related Questions in CUDA

Related Questions in THRUST

Related Questions in CUDA-STREAMS

Related Questions in CUDA-GRAPHS

Trending Questions

Popular # Hahtags

Popular Questions