The canonical way to force runtime API context establishment is to call cudaFree(0). If you have multiple devices, call cudaSetDevice() with the ID of the device you want to establish a context on, then cudaFree(0) to establish the context.
EDIT: Note that as of CUDA 5.0, it appears that the heuristics of context establishment are slightly different and cudaSetDevice() itself establishes context on the device is it called on. So the explicit cudaFree(0) call is no longer necessary (although it won't hurt anything).
3
ChrisV
On
Using the runtime API: cudaDeviceSynchronize, cudaDeviceGetLimit, or anything that actually accesses the context should work.
I'm quite certain you're not using the driver API, as it doesn't do that sort of lazy initialization, but for others' benefit the driver call would be cuCtxCreate.
The canonical way to force runtime API context establishment is to call
cudaFree(0)
. If you have multiple devices, callcudaSetDevice()
with the ID of the device you want to establish a context on, thencudaFree(0)
to establish the context.EDIT: Note that as of CUDA 5.0, it appears that the heuristics of context establishment are slightly different andcudaSetDevice()
itself establishes context on the device is it called on. So the explicitcudaFree(0)
call is no longer necessary (although it won't hurt anything).