Consider the following code:
// In the interrupt handler file:
volatile uint32_t gSampleIndex = 0; // declared 'extern'
void HandleSomeIrq()
{
gSampleIndex++;
}
// In some other file
void Process()
{
uint32_t localSampleIndex = gSampleIndex; // will this be optimized away?
PrevSample = RawSamples[(localSampleIndex + 0) % NUM_RAW_SAMPLE_BUFFERS];
CurrentSample = RawSamples[(localSampleIndex + 1) % NUM_RAW_SAMPLE_BUFFERS];
NextSample = RawSamples[(localSampleIndex + 2) % NUM_RAW_SAMPLE_BUFFERS];
}
My intention is that PrevSample
, CurrentSample
and NextSample
are consistent, even if gSampleIndex
is updated during the call to Process()
.
Will the assignment to the localSampleIndex
do the trick, or is there any chance it will be optimized away even though gSampleIndex
is volatile?
In your function you access
volatile
variable just once (and it's the onlyvolatile
one in that function) so you don't need to worry about code reorganization that compiler may do (andvolatile
prevents). What standard says for these optimizations at §5.1.2.3 is:Note last sentence: "...no needed side effects are produced (...accessing a volatile object)".
Simply
volatile
will prevent any optimization compiler may do around that code. Just to mention few: no instruction reordering respect othervolatile
variables. no expression removing, no caching, no value propagation across functions.BTW I doubt any compiler may break your code (with or without
volatile
). Maybe local stack variable will be elided but value will be stored in a registry (for sure it won't repeatedly access a memory location). What you needvolatile
for is value visibility.EDIT
I think some clarification is needed.
Let me safely assume you know what you're doing (you're working with interrupt handlers so this shouldn't be your first C program): CPU word matches your variable type and memory is properly aligned.
Let me also assume your interrupt is not reentrant (some magic
cli
/sti
stuff or whatever your CPU uses for this) unless you're planning some hard-time debugging and tuning.If these assumptions are satisfied then you don't need atomic operations. Why? Because
localSampleIndex = gSampleIndex
is atomic (because it's properly aligned, word size matches and it'svolatile
), with++gSampleIndex
there isn't any race condition (HandleSomeIrq
won't be called again while it's still in execution). More than useless they're wrong.One may think: "OK, I may not need atomic but why I can't use them? Even if such assumption are satisfied this is an *extra* and it'll achieve same goal" . No, it doesn't. Atomic has not same semantic of
volatile
variables (and seldomvolatile
is/should be used outside memory mapped I/O and signal handling). Volatile (usually) is useless with atomic (unless a specific architecture says it is) but it has a great difference: visibility. When you updategSampleIndex
inHandleSomeIrq
standard guarantees that value will be immediately visible to all threads (and devices). with atomic_uint standard guarantees it'll be visible in a reasonable amount of time.To make it short and clear: volatile and atomic are not the same thing. Atomic operations are useful for concurrency, volatile are useful for lower level stuff (interrupts, devices). If you're still thinking "hey they do *exactly* what I need" please read few useful links picked from comments: cache coherency and a nice reading about atomics.
To summarize:
In your case you may use an atomic variable with a lock (to have both atomic access and value visibility) but no one on this earth would put a lock inside an interrupt handler (unless absolutely definitely doubtless unquestionably needed, and from code you posted it's not your case).