Below are the pseudo-code of the producer-consumer problem as given Operating System Concepts by Galvin, et. al.
We assume that the pool consists of
n
buffers, each capable of holding one item. The mutex binary semaphore provides mutual exclusion for accesses to the buffer pool and is initialized to the value1
.
//initialization
int n;
semaphore mutex = 1;
semaphore empty = n;
semaphore full = 0;
Below is the pseudo-code for the structure of the producer:
while (true)
{
. . .
/* produce an item in next_produced */
. . .
wait(empty);
wait(mutex);
. . .
/* add next produced to the buffer */
. . .
signal(mutex);
signal(full);
}
Below is the pseudo-code for the structure of the consumer:
while (true)
{
wait(full);
wait(mutex);
. . .
/* remove an item from buffer to next_consumed */
. . .
signal(mutex);
signal(empty);
. . .
/* consume the item in next consumed */
. . .
}
The above is as far as the text is concerned. Let me go a bit further and include the working of the buffer as well. Let char buffer[n]
be a shared array between the two processes acting as the buffer. So we have:
/*Structure of the producer- elaborated by me*/
int i=0;
while (true)
{
. . .
/* produce an item in next_produced */
. . .
wait(empty);
//wait(mutex);
. . .
buffer[i]=next_produced;
/* add next produced to the buffer */
i=(i+1)%n;
. . .
//signal(mutex);
signal(full);
}
/*Structure of the consumer elaborated by me*/
int i=0;
while (true)
{
wait(full);
//wait(mutex);
. . .
next_consumer=buffer[i];
/* remove an item from buffer to next_consumed */
i=(i+1)%n;
. . .
//signal(mutex);
signal(empty);
. . .
/* consume the item in next_consumed */
. . .
}
Though the text uses mutex locks before accessing the buffers, (as per their logic, since the buffer is a shared item, it should be accessed in a mutually exclusive manner), I do not think that is strictly necessary to use mutex while accessing the buffer elements, because, the producer and the consumer though can access the buffer simultaneously, but I guess they can never access the same the location of the buffer
array simultaneously. Since they cannot access the same location simultaneously, there is no possibility of race condition...
This is what I feel. Please correct me if I am wrong. Also please point me to the portion where I made the mistake. Thank you.
After digging the web for days, I think I've finally found the answer!
It has to do with "memory consistency", and as @jom-rogers noted, multiple readers/writers.
The solution was first offered by Dijkstra [1], who first introduces the solution without the mutex. Then he says:
Honestly, I don't understand the problem. However, to solve the problem, he adds the mutex to the solution. Then, he goes on to say (emphasis mine):
So, one aspect of the solution is to handle multiple producers and consumers.
But I also found the following excerpt in a thesis [2] (emphasis mine):
So, as he says, if some memory consistency assumptions are correct, the mutex can be removed. I checked the original paper by Lamport [3] that he cites, but the notation was too difficult for me to understand.
[1] Dijkstra, Edsger W. "Cooperating sequential processes." The origin of concurrent programming. Springer, New York, NY, 1968. 65-138.
[2] Frank, Reginald Austin. Designing a High Throughput Bounded Multi-Producer, Multi-Consumer Queue. Diss. 2021.
[3]Lamport, Leslie. "Specifying concurrent program modules." ACM Transactions on Programming Languages and Systems (TOPLAS) 5.2 (1983): 190-222.