Given the following list of redirects
[
{
"old": "a",
"target": "b"
},
{
"old": "b",
"target": "c"
},
{
"old": "c",
"target": "d"
},
{
"old": "d",
"target": "a"
},
{
"old": "o",
"target": "n"
},
{
"old": "n",
"target": "b"
},
{
"old": "j",
"target": "x"
},
{
"old": "whatever",
"target": "something"
}
]
Here we can see that the first item "a" should redirect to "b". If we follow the list we can see the following pattern:
a -> b
b -> c
c -> d
d -> a
So we would end up with a circular reference since "a" would end up pointing to "d" and "d" points to "a".
What would be the most efficient way of finding the circular references?
I've come up with the following algorithm in C#
var items = JsonConvert.DeserializeObject<IEnumerable<Item>>(json)
.GroupBy(x => x.Old)
.Select(x => x.First())
.ToDictionary(x => x.Old, x => x.Target);
var circulars = new Dictionary<string, string>();
foreach (var item in items)
{
var target = item.Value;
while (items.ContainsKey(target))
{
target = items[target];
if (target.Equals(item.Key))
{
circulars.Add(target, item.Value);
break;
}
}
}
This will give me a list containing 4 items looking like this:
[
{
"old": "a",
"target": "b"
},
{
"old": "b",
"target": "c"
},
{
"old": "c",
"target": "d"
},
{
"old": "d",
"target": "a"
}
]
But im only interested in telling the user something like
"Hey you can't do that, It will be a circular reference because of "a" points to "b" which points to "c" which points to "d" which points to "a"
So, do you guys have any suggestions? Im sure that some other(better) algorithms exists for doing this... :)
While the generic graph-cycle-finding algorithms will work, your case is a bit special due to the "Old is unique, target is not" constraint. It effectively means, that each node can only have a single successor and thus it can at most be part of one cycle. Also, when DFS-Traversing the nodes, there won't be any fork, so an iterative DFS implementation becomes very easy.
Given an arbitrary starting node, this function can find a cycle that is reachable from the starting node:
In order to maintain efficiency, it can be extended to early-return when an already checked node is reached (using
previouslyVisited
):The following function is used to maintain consistency of the visited sets
It is called for each
old
node in order to check for cycles. When a cycle starting node is detected, the cycle information can be separately created:Output your found cycles any way you want. For example
The implementation of
GetCycleMembers
is pretty easy - it depends on a correct starting node: