How do I add a percentage to a countplot?

1.4k Views Asked by At

I need to show a percentage for my bar graph. However I am not sure how to do it.

sns.set_style('whitegrid')
sns.countplot(y='type',data=df,palette='colorblind')
plt.xlabel('Count')
plt.ylabel('Type')
plt.title('Movie Type in Disney+')
plt.show()

enter image description here

2

There are 2 best solutions below

0
On BEST ANSWER
ax = sns.countplot(y='type', data=df, palette='colorblind')

# get the total count of the type column
total = df['type'].count()

# annotate the bars with fmt from matplotlib v3.7.0
ax.bar_label(ax.containers[0], fmt=lambda x: f'{(x/total)*100:0.1f}%')

# add space at the end of the bar for the labels
ax.margins(x=0.1)

ax.set(xlabel='Count', ylabel='Type', title='Movie Type in Disney+')
plt.show()

enter image description here

  • The same implementation also works for vertical bars
ax = sns.countplot(x='type', data=df, palette='colorblind')

# get the total count of the type column
total = df['type'].count()

# annotate the bars with fmt from matplotlib v3.7.0
ax.bar_label(ax.containers[0], fmt=lambda x: f'{(x/total)*100:0.1f}%')
plt.show()

enter image description here

  • v3.4.0 <= matplotlib < v3.7.0 use the labels parameter.
# for horizontal bars
labels = [f'{(w/total)*100:0.1f}%' if (w := v.get_width()) > 0 else '' for v in ax.containers[0]]

# for vertical bars
# labels = [f'{(h/total)*100:0.1f}%' if (h := v.get_height()) > 0 else '' for v in ax.containers[0]]

ax.bar_label(ax.containers[0], labels=labels)
0
On

The beginning code looks great! You just have to calculate the percentage values for each bar by dividing the width of the bar by the total count and multiplying by 100. Then use the annotate function to add those values you calculated as text to the bar. Try the code below and see if it works out for you!

import seaborn as sns
import matplotlib.pyplot as plt

sns.set_style('whitegrid')

# Create the countplot and naming it 'plot'. 
plot = sns.countplot(y='type', data=df, palette='colorblind')

plt.xlabel('Count')
plt.ylabel('Type')
plt.title('Movie Type in Disney+')

total = len(df['type']) 

for p in plot.patches:
    percentage = '{:.1f}%'.format(100 * p.get_width() / total)
    x = p.get_x() + p.get_width() + 0.02
    y = p.get_y() + p.get_height() / 2
    plot.annotate(percentage, (x, y))
plt.show()