I would like to ask if someone knows a performant way to store the path from the root node to a new node of a multiway tree during the insertion of the new node. E.g., if I have the following tree:
For each node, I currently store an array of the path to the node from the root node during insertion in the following way by assigning a unique int
ID to each children on the same depth:
Root node -> [1]
Depth 1, child 1 of root -> [1, 1]
Depth 1, child 2 of root -> [1, 2]
Depth 2, child 1 of parent 1 -> [1, 1, 1]
Depth 2, child 2 of parent 1 -> [1, 1, 2]
Depth 2, child 3 of parent 1 -> [1, 1, 3]
Depth 2, child 1 of parent 2 -> [1, 2, 4]
Depth 2, child 2 of parent 2 -> [1, 2, 5]
Depth 3, child 1 of parent 3 -> [1, 1, 3, 1]
...
If I now insert a new node from the leaf node 1
on depth 3, I would have to create a new path array for it storing all the nodes of the parent 1
(i.e. [1, 1, 3, 1]
) plus the new child ID, which is 1
for the first child:
Depth 4, child 1 of parent 1 -> [1, 1, 3, 1, 1]
As my tree grows a lot in height (the number of children per depth is relatively low, but the depth can be high), the slow part of this algorithm would be this array recreation process. Just imagine a tree of depth 1.000.000
, if I insert a new node from a node of depth 1.000.000
, I would have to create a new array for this new node storing all the 1.000.001
IDs of the parent plus appending the new node's ID:
Depth 1.000.001, child 1 of parent x -> [...1 million and one IDs... , 1]
Is there a more efficient way to store the path on each node during node's insertion?
I basically need this to determine if any given node is a child of a possible parent node in the tree and as I have the path stored in each node, I can easily do that by checking the path array of the child, like this:
// Ex. 1
Is node 4 of depth 2 the parent/ancestor of node 1 of depth 3?
node 1 of depth 3 has the following path array: pathArray = [1, 1, 3, 1]
Its ancestor ID on depth 2 is: pathArray[2] -> 3
3 != 4 and therefore I know that node 4 of depth 2
is not a parent of node 1 of depth 3.
// Ex. 2
Is node 1 of depth 1 the parent/ancestor of node 1 of depth 3?
node 1 of depth 3 has the following path array: pathArray = [1, 1, 3, 1]
Its ancestor ID on depth 1 is: pathArray[1] -> 1
1 == 1 and therefore I know that node 1 of depth 1
is a parent of node 1 of depth 3.
This lookup operation would be fast, the problem is the creation of the path array as the tree goes deeper.
Any suggestions would be appreciated.
Thank you for the attention.
Arrays have all their values stored contiguously in the memory. If you want to retain this property you must use them. Or, if you are OK with hoping through several memory locations, you can store in every node only its immediate parent and trace up to the required level to do the check required.