Let's say we have two tables:
CREATE TABLE element (
pk1 BIGINT NOT NULL,
pk2 BIGINT NOT NULL,
pk3 BIGINT NOT NULL,
-- other columns ...
PRIMARY KEY (pk1, pk2, pk3)
);
CREATE TYPE element_pk_t AS (
pk1 BIGINT,
pk2 BIGINT,
pk3 BIGINT
);
CREATE TABLE collection (
id BIGINT,
elements element_pk_t[] NOT NULL,
);
The element
has a composite PK. The custom type element_pk_t
registers a matching composite type. The collection
table contains array of element_pk_t
.
I want to query all rows from table element
where the PK matches an element in a chosen collection.elements
, in a single query.
What I've tried:
SELECT *
FROM element
WHERE (pk1, pk2, pk3) IN (SELECT unnest(elements)
FROM collection
WHERE id = 1);
I get an error in the IN
clause:
ERROR: subquery has too few columns
However, this works:
SELECT *
FROM element
WHERE (pk1, pk2, pk3) IN ((1, 2, 3), (4, 5, 6));
So it seems that the problem is how to expand the customized type element_pk_t
to 3 columns that can match (pk1, pk2, pk3)
.
This works:
Or more verbose, but preferable:
More robust and avoids evaluating
unnest()
multiple times. See:This works, too:
The core of the problem is that
IN
taking a subquery knows two separate forms. Quoting the manual:Your failing query resolves to the second form, while you (understandably) expect the first. But the second form does this:
My first and second query make it work by decomposing the row type to the right of the operator. So Postgres has three
bigint
values left and right and is satisfied.My third query makes it work by nesting the row type to the left in another row constructor. Postgres only decomposes the first level and ends up with a single composite type - matching the single composite type to the right.
Note that the keyword
ROW
is required for the single field we are wrapping. The manual:Your working query is subtly different as it provides a list of values to the right instead of a subquery (set). That's a different implementation taking a different code path. It even gets a separate chapter in the manual. This variant has no special treatment for a ROW constructor to the left. So it just works as expected (by you).
More equivalent (working) syntax variants with
= ANY
:Also valid with
(pk1, pk2, pk3)::element_pk_t
orROW(pk1, pk2, pk3)::element_pk_t
See:
Since your source is an array, Daniel's second query with
(e.pk1, e.pk2, e.pk3) = ANY(c.elements)
lends itself naturally.But for a bet on the fastest query, my money is on my second variant, because I expect it to use the PK index optimally.
Just as proof of concept. Like a_horse commented: a normalized DB design will probably scale best.