I'm studying Tarjan's algorithm for strongly-connected components and the way it works is clear to me. Anyway there's a line I don't understand:
// Consider successors of v
for each (v, w) in E do
if (w.index is undefined) then
// Successor w has not yet been visited; recurse on it
strongconnect(w)
v.lowlink := min(v.lowlink, w.lowlink)
else if (w.onStack) then
// Successor w is in stack S and hence in the current SCC
v.lowlink := min(v.lowlink, w.index) // *************
end if
end for
I marked the line with asterisks. Why should we take the discovery index/time of a node when encountering a back-edge
v.lowlink := min(v.lowlink, w.index)
rather than just grabbing its component value?
v.lowlink := min(v.lowlink, w.lowlink)
I can't think of a case where this would be a problem.
Can someone enlighten me please? Edit: I suspect this is only a semantic requirement, i.e. lowlink being defined as the earliest ancestor reachable from the node with only one back-edge, but this is just a wild guess.
The correctness proof goes through if
w.lowlink
is at least the lowest index reachable fromw
and at most the lowest index reachable fromw
using at most one back edge. Component detection just requires us to know if we can "escape" to a lower index.Probably the reason that it's presented the way that it is is that one can imagine
lowlink
only being set in post-order, and then your variation wouldn't be well defined. (The Wikipedia pseudocode initializes it in pre-order toindex
.)