Given the following example
status <- c("Open", "In Progress", "DevTest", "Stage Test: mw", "Stage Test: customer", "DevDone", "Done")
a <- c("Open, Open")
b <- c("Open, In Progress, DevTest, DevTest")
c <- c("DevTest, Done")
d <- c("Done, Done")
data <- tibble(status = c(a, b, c, d))
Now I want mutate an additional column with the following condition
- If status only contains "Open" -> Open
- If status contains "In Progress" or "DevTest" -> "In Progress"
- If status contains only "Done" -> "Done"
So the result should look like
status | status_simple |
---|---|
Open, Open | Open |
Open, In Progress, DevTest, DevTest | In Progress |
DevTest, Done | In Progress |
Done, Done | Done |
This is a case where
case_when
can be very useful, to sort through the conditions and their consequence, in combination withgrepl
, to search for patterns in the character strings (and returns boolean vectors, which is whatcase_when
expects):And the result: