Local Memory: cuda presentation

187 Views Asked by Thierry Brown At 08 August 2025 at 03:33

I was reading this presentation document: http://on-demand.gputechconf.com/gtc-express/2011/presentations/register_spilling.pdf

In page 3 of the presentation, the author states:

A store always happens before a load –Only GPU threads can access LMEM addresses

Can anybody explain to me why? Does he mean when the local memory is first initialised?

Original Q&A

There are 2 best solutions below

Robert Crovella On 13 March 2017 at 17:49

In this respect, local memory is something like shared memory.

In order to do anything useful with shared memory, you have to initialize (store something) first. The same is true for Local memory.
Only CUDA thread code can access local memory. There are no CUDA API calls like cudaMemcpy that can access local memory. It is not possible to initialize local memory from host code.

The same comments are basically true for shared memory.

tera On 13 March 2017 at 17:50

"Does he mean when the local memory is first initialised?" - Yes.

You cannot "cudaMemcpy()" to local memory, because it is outside of the global address space. If you try to explicitly initialise local variables, the compiler generates stores to local memory, because the initialisation needs to be repeated for each block. So there is no way to have a defined value in local memory without writing it there first.

Local Memory: cuda presentation

There are 2 best solutions below

Related Questions in CUDA

Related Questions in GPGPU

Related Questions in GPU-LOCAL-MEMORY

Trending Questions

Popular # Hahtags

Popular Questions