Imagine a very simple csv file that looks like this:
output,input,tags
0,0,sunday-hot
1,3,sunday
5,1,hot-random
where the tags column is made up of a series of keywords all separated by "-".
What I would like is to turn it into a dataset that looks like this:
output,input,sunday,hot,random
0,0,T,T,F
1,3,T,F,F
5,1,F,T,T
Here is one way to do it with dplyr/tidyr. Use
separate_rows()
to separate the pasted tags column to a normal column which contains one value in each cell; Create a value column that contains all TRUE to spread on; Fill missing values with FALSE: