In the previous article, I wrote about outlier detection using a simple statistical technique called Z-score. While that’s an easy way to create a filter for screening outliers, there’s even a better way to do it — using boxplots.

Boxplots are an excellent statistical technique to understand the distribution, dispersion and variation of univariate and categorical data— all in a single plot.

The purpose of this article is to introduce boxplot as a tool for outlier detection, and I’m doing so focusing on the following areas:

  • the statistical intuition behind boxplots
  • how they are used in outlier detection
  • a tiny bit of programming

#machine-learning #data-science #anomaly-detection #outlier-detection #algorithms

Boxplot for anomaly detection
1.35 GEEK