I have a query that returns either a 1 or 0 based on whether or not an event occurred on a given date. This is ordered by date. Basically, a simple result set is:
Date | Type
---------------------
2010-09-27 1
2010-10-11 1
2010-11-29 0
2010-12-06 0
2010-12-13 1
2010-12-15 0
2010-12-17 0
2011-01-03 1
2011-01-04 0
What I would now like to be able to do is to count the number of separate, non-concurrent instances of '0's there are - i.e. count how many different groups of 0s appear.
In the above instance, the answer should be 3 (1 group of 2, then another group of 2, then finally 1 to end with).
Hopefully, the above example illustrates what I am trying to get at. I have been searching for a while, but am finding it difficult to succinctly describe what I am looking for, and hence haven't found anything of relevance.
Thanks in advance,
Josh
This is a variant of the "islands" problem. My first answer uses Itzik Ben Gan's double
row_number
trick to identify contiguous groups of data efficiently. The combination ofType,Grp
identifies each individual island in the data.You can read more about the different approaches to tackling this problem here.
My second answer requires a single pass through the data. It is not guaranteed to work but is on the same principle as a technique that many people successfully use to concatenate strings without problems.