CRITICAL_SECTION locking (enter) and unlocking (leave) are efficient because CS testing is performed in user space without making the kernel system call that a mutex makes. Unlocking is performed entirely in user space, whereas ReleaseMutex requires a system call.
I just read these sentences in this book.
What the kernel system call mean? Could you give me the function's name?
I'm a English newbie. I interpreted them like this.
- CS testing doesn't use a system call.
- Mutex testing uses a system call.(But I don't know the function name. Let me know)
- CS unlocking doesn't call a system call.
- Mutex unlocking requires a system call.(But I don't know the function name. Let me know)
Another question.
- I think CRITICAL_SECTION might call WaitForSingleObject or family functions. Don't these functions require a system call? I guess they do. So CS testing doesn't use a system call is very weird to me.
The implementation of critical sections in Windows has changed over the years, but it has always been a combination of user-mode and kernel calls.
The CRITICAL_SECTION is a structure that contains a user-mode updated values, a handle to a kernel-mode object - EVENT or something like that, and debug information.
EnterCriticalSection uses an interlocked test-and-set operation to acquire the lock. If successful, this is all that is required (almost, it also updates the owner thread). If the test-and-set operation fails to aquire, a longer path is used which usually requires waiting on a kernel object with
WaitForSignleObject
. If you initialized withInitializeCriticalSectionAndSpinCount
thenEnterCriticalSection
may spin an retry to acquire using interlocked operation in user-mode.Below is a diassembly of the "fast" / uncontended path of
EnterCriticialSection
in Windows 7 (64-bit) with some comments inlineSo the bottom line is that if the thread does not need to block it will not use a system call, just an interlocked test-and-set operation. If blocking is required, there will be a system call. The release path also uses an interlocked test-and-set and may require a system call if other threads are blocked.
Compare this to Mutex which always requires a system call
NtWaitForSingleObject
andNtReleaseMutant