I have a case where there are two processes which act on the same file - one as a writer and one as a reader. The file is a one line text file and the writer re-writes the line in a loop. reader reads the line. The pseudo code looks like this:
Writer Process
char buf[][18] = {
"xxxxxxxxxxxxxxxx",
"yyyyyyyyyyyyyyyy"
};
i = 0;
while (1) {
pwrite(fd, buf[i], 18, 0);
i = (i + 1) % 2;
}
Reader Process
while(1) {
pread(fd, readbuf, 18, 0);
//check if readbuf is either buf[0] or buf[1]
}
After a while of running both processes, I could see that the readbuf
is either xxxxxxxxxxxxxxxxyy
or yyyyyyyyyyyyyyyyxx
.
My understanding was that the write would be atomic for sizes upto 512 bytes. but from my experiment, it looks like the atomicity is only for 16 bytes.
man pages doesn't say anything about atomicity for normal files, it mentions only about pipe atomicity for 512 bytes.
I have tried this with tmpfs and ext4 and the result are the same. with O_SYNC
, ext4 writes become atomic and I understand it because writes don't return until it hits the disk, but O_SYNC
doesn't help for tmpfs (/dev/shm
).
POSIX doesn't give any minimum guarantee of atomic operations for
read
andwrite
except for writes on a pipe (where a write of up toPIPE_BUF
(≥ 512) bytes is guaranteed to be atomic, but reads have no atomicity guarantee). The operation ofread
andwrite
is described in terms of byte values; apart from pipes, awrite
operation offers no extra guarantees compared to a loop around single-bytewrite
operations.I'm not aware of any extra guarantee that Linux would give, neither 16 nor 512. In practice I'd expect it to depend on the kernel version, on the filesystem, and possibly on other factors such as the underlying block device, the number of CPUs, the CPU architecture, etc.
The
O_SYNC
,O_RSYNC
andO_DSYNC
guarantees (synchronized I/O data integrity completion, given forread
andwrite
in the optional SIO feature of POSIX) are not what you need. They guarantee that writes are committed to persistent storage before theread
orwrite
system call, but do not make any claim regarding awrite
that is started while theread
operation is in progress.In your scenario, reading and writing files doesn't look like the right toolset.
mmap
if it is. This doesn't magically solve the atomicity problem, but is likely to improve the performance of a proper synchronization mechanism. To perform synchronization, there are two basic approaches:mmap
+msync
) or a different channel (e.g. pipe).msync
). Then the producer writes a well-known value to one machine word (asig_atomic_t
will typically work, even though its atomicity is formally guaranteed only for signals — or, in practice, auintptr_t
). The consumer reads that one machine word and only processes the corresponding data if this word has an acceptable value.