I have the following table:
CREATE TABLE tbl (tbl_id int, messages jsonb);
CREATE INDEX index_tbl ON tbl USING gin (messages);
And the JSON messages as array:
[{"user_id":1,"created_at":"2016-12-20","content":"Suspendisse accumsan tortor quis turpis. Sed ante."},
{"user_id":2,"created_at":"2016-12-20","content":"Morbi sem mauris tibulum sagittis sapien."},
{"user_id":1,"created_at":"2016-10-21","content":"In blandit ultrices enim. Phasellus id sapien in sapien iaculis congue."},
{"user_id":3,"created_at":"2016-11-20","content":"Quisque ut erat. Curabitur gravida nisi at nibh. In hac habitasse platea dictumst."},
{"user_id":4,"created_at":"2016-12-21","content":"Nunc rhoncus dui vel sem. Sed sagittis. ectus. Pellentesque at nulla. Suspendisse potenti. Cras in purus eu magna vulputate luctus."},
{"user_id":6,"created_at":"2016-12-21","content":"Praesent id massa id nisl venenatis lacinia. iaculis congue."}]
How can I select the messages where the user is in only one month? For example, users of the month of December only return me the messages of users 2, 4 and 6 because user 1 is in the month of November as well.
This returns all ..
Unnest the JSON array with
jsonb_array_elements()
.Extract values for relevant keys. Only the month of the date is relevant, use
date_trunc()
and cast the result back todate
.Run the actual query on the CTE
msg
resulting from step 1. and 2. Identify users that posted in only one month - across the whole table, not just within one row. You did not clarify, I chose this interpretation.Self-join to the same CTE
msg
to return all messages of identifies users.The GIN index is not going to help, since you need to inspect every single array element either way.
Storing messages in a normalized table to begin with would be simpler and faster for your query: