R's duplicated
returns a vector showing whether each element of a vector or data frame is a duplicate of an element with a smaller subscript. So if rows 3, 4, and 5 of a 5-row data frame are the same, duplicated
will give me the vector
FALSE, FALSE, FALSE, TRUE, TRUE
But in this case I actually want to get
FALSE, FALSE, TRUE, TRUE, TRUE
that is, I want to know whether a row is duplicated by a row with a larger subscript too.
duplicated
has afromLast
argument. The "Example" section of?duplicated
shows you how to use it. Just callduplicated
twice, once withfromLast=FALSE
and once withfromLast=TRUE
and take the rows where either areTRUE
.Some late Edit: You didn't provide a reproducible example, so here's an illustration kindly contributed by @jbaums
Edit: And an example for the case of a data frame: