Readying here and there how to solve my use case, i came up with the following to deduplicate the result of a query i am performing.
Basically, in the code below, in the Links Table, we have Nodes and Controls.
Controls are things that Link Nodes together with a directionality expressed in the nType.
However Each row only gives half of the info. In other words, each row is a link between a node and a control.
I am trying to create a table where there is both nodes and the control in the middle.
The issue is repetition, i.e. duplicate.
If we have
Protein01 Biding Ntype0
Molecule01 Binding Ntype 1
In the end i want one row
Protein01 Biding Ntype0 Molecule01 Ntype 1
The problem is given the self join here, i get
Protein01 Biding Ntype0 Molecule01 Ntype 1
Molecule01 Biding Ntype1 Protein01 Ntype 0
I need to eliminate one of the row, and i wrote the code below. But then i get the nasty error.
Ora-ORA-01445: cannot select ROWID from, or sample, a join view without a key-preserved table
What am i doing wrong ?
With dup AS
(
select "Links"."idControl",
"ControlTypes"."sObjectTypeName" as "ControlType",
"Links"."idNode",
"NodeTypes"."sObjectTypeName" as "NodeType",
"Links"."nLinkType" as "nLinkType1",
"Links2"."idNode" as "idNode2",
"Links2"."sObjectTypeName" as "Node2Type",
"Links2"."nLinkType" as "nLinkType2"
from "Links"
join "Nodes" on "Links"."idNode" = "Nodes".ID
join "Controls" on "Links"."idControl" = "Controls".ID
join "ObjectTypes" "NodeTypes" on "Nodes"."idObjectType" = "NodeTypes".ID
join "ObjectTypes" "ControlTypes" on "Controls"."idObjectType" = "ControlTypes"."ID"
join (select "Links2"."idNode",
"Links2"."idControl",
"NodeTypes2"."sObjectTypeName",
"Links2"."nLinkType"
from "Links" "Links2"
join "Nodes" "Nodes2" on "Links2"."idNode" = "Nodes2".ID
join "Controls" "Controls2" on "Links2"."idControl" = "Controls2".ID
join "ObjectTypes" "NodeTypes2" on "Nodes2"."idObjectType" = "NodeTypes2".ID
join "ObjectTypes" "ControlTypes2" on "Controls2"."idObjectType" = "ControlTypes2"."ID"
) "Links2" On "Links2"."idControl" = "Links"."idControl" and "Links2"."idNode" != "Links"."idNode"
)
select * from dup a where rowid in ( select max(rowid) from dup b where a."idNode" = b."idNode2");
Note that the Links Table has a primary key which Links.ID
Your sample view is quite large and sample data is very unclear so I will use some synthetic data and view.
To deduplicate the rows you just need to have some other identifier instead of rowid. Also you can apply some analytic function over the view to determine which row to leave. From the question it looks like you need to leave random row. In this case you can use
row_number
to create identifier within the column group which should be deduplicated. Assume that you have the view with columnskey1, key2, tail1, tail2, tail3
and thekey1, key2
combination should be unique. In this case you can applyAs the other option - keeping the code like in question, you can just use
rownum
pseudocolumn inside view to create unique row identifier and then use it instead of rowid likeSee also fiddle