Can I really launch a library kernel (CUkernel) rather than an in-context kernel (CUfunction)?

66 Views Asked by einpoklum At 25 January 2024 at 19:10

With CUDA 12.0, support has been added to loading libraries of kernels dynamically from disk or from memory: Driver API, § 6.12 Library Management. And from these libraries, one can load "kernels" - without an associated device nor a context. Their handle is a CUkernel, as opposed to CUfunction for proper, in-context, kernels.

Now, in the § 6.22 execution control section of the Driver API, various launch functions are now described as taking either "a CUDA function CUfunction or a CUDA kernel CUkernel: cuLaunchKernel, cuLaunchKernelEx, cuLaunchCooperativeKernel and perhaps additional ones.

The thing is, that when I look at their signatures - they all still take plain old CUfunction's no CUkernel - and there is no overloaded function which differs on the choice of this parameter.

So, what gives? Can we launch CUKernel's, or can't we?

Original Q&A

There are 1 best solutions below

Anis Ladram On 25 January 2024 at 20:53

You can cast CUkernel to CUfunction when using it with cuLaunchKernel, as indicated on this part of the CUDA Driver API:

Note that the API can also be used to launch context-less kernel CUkernel by querying the handle using cuLibraryGetKernel() and then passing it to the API by casting to CUfunction. Here, the context to launch the kernel on will either be taken from the specified stream hStream or the current context in case of NULL stream.

For more information, check out this blog post on context-independent module loading. Thanks!

Can I really launch a library kernel (CUkernel) rather than an in-context kernel (CUfunction)?

There are 1 best solutions below

Related Questions in CUDA

Related Questions in GPGPU

Trending Questions

Popular # Hahtags

Popular Questions