I have a dataframe that contains mostly characters that I'm matching to a list of patterns. I'm trying to add a column on the end that counts how many times there's a hit to the list in that row. Basically, what I'm going for is
patterns <- c("Yes", "No", "Maybe")
df <- data.frame (first_column = c("Why", "Sure", "But", ...),
second_column = c("Yes", "Okay", "If Only" ...),
third_column = c("No", "When", "Maybe so" ...),
fourth_column = c("But", "I won't", "Truth" ...)
)
and after running the code, down a fifth column labeled "counter", you'd see 2, 0, 1,... Right now I'm accomplishing this with a pair of nested for loops, with an if statement inside. It works on toy datasets, but I think it will break if I try it on the full size data. Is there a better way using dplyr, grepl, or lapply? My instinct says dplyr, but I'm not sure how to do it. My code is below:
filename = choose.files(caption='Select File')
cases = read.csv(filename)
cases = cbind(cases, counter=0)
l = nrow(cases)
col = ncol(cases)
for (i in 1:l){
for (j in 1:col){
if(cases[i,j] %in% patterns)
{
cases$counter[i]=cases$counter[i]+1
}
}
}
Try this
Output