How to remove outliers from Seaborn boxplot charts

In this article, I am going to show you how to remove outliers from Seaborn boxplots. First, I am going to plot a boxplot without modifications. Then, I will remove all of the outliers. In the end, I am going to restore outliers, but this time I am going to make them less prominent.

Boxplot with outliers

Let’s start with plotting the data I already have.

import seaborn as sb

sb.boxplot(x = 'Value', data = with_merged)

Would you like to help fight youth unemployment while getting mentoring experience?

Develhope is looking for tutors (part-time, freelancers) for their upcoming Data Engineer Courses.

The role of a tutor is to be the point of contact for students, guiding them throughout the 6-month learning program. The mentor supports learners through 1:1 meetings, giving feedback on assignments, and responding to messages in Discord channels—no live teaching sessions.

Expected availability: 15h/week. You can schedule the 1:1 sessions whenever you want, but the sessions must happen between 9 - 18 (9 am - 6 pm) CEST Monday-Friday.

Check out their job description.

(free advertisement, no affiliate links)

Boxplot without outliers

To remove the outliers from the chart, I have to specify the “showfliers” parameter and set it to false.

sb.boxplot(x = 'Value', data = with_merged, showfliers = False)

Change the outliers style

In the next example, I am going to change the size of the outliers markers to make them less distracting for people who look at the chart.

sb.boxplot(x = 'Value', data = with_merged, flierprops = dict(markerfacecolor = '0.50', markersize = 2))

Remember to share on social media!
If you like this text, please share it on Facebook/Twitter/LinkedIn/Reddit or other social media.

If you want to contact me, send me a message on LinkedIn or Twitter.

Bartosz Mikulski
Bartosz Mikulski * MLOps Engineer / data engineer * conference speaker * co-founder of Software Craft Poznan & Poznan Scala User Group

Subscribe to the newsletter and get access to my free email course on building trustworthy data pipelines.