This has been asked in various forms before, but I'm attempting it in a slightly different way and can't seem to get it exactly right. When I use this code:
d %>%
drop_na(attend) %>%
count(race, attend) %>%
group_by(race) %>%
mutate(percent = n/sum(n)*100) %>%
ggplot(aes(race, percent, fill = race)) +
geom_col(position = "dodge")
I get this figure:
The 'attend' variable is just 0s and 1s, and I want to display the percent of 1s within each race. I think that those lines that are showing up inside the charts are actually correct, but what's going on with the rest of those columns? I can't quite figure out that last step.
To achieve your desired result filter your data for
attend == 1
values after computing the percentages.Note: The blacks lines appear because of overplotting, i.e. as you set
position = "dodge"
the bars forattend=0
andattend=1
are plotted on top of each other.Using some random example data: