Box Plot (original) (raw)

Last Updated : 8 Dec, 2025

A Box Plot is also known as a Box and Whisker Plot and it is a graphical tool used to understand the distribution of numerical data. It shows the median, quartiles and possible outliers in a simple visual form. It helps compare distributions between groups with ease.

Interquartile Range (IQR)

The box in a box plot represents the Interquartile Range (IQR).

\mathbf{IQR} = Q_3 - Q_1

It covers the middle 50% of the data, making it a strong measure of spread that is resistant to extreme values.

Identifying Outliers in a Box Plot

Outliers are values that lie outside the upper and lower limits:

\text{Lower Limit} = Q_1 - 1.5 \times \text{IQR}

\text{Upper Limit} = Q_3 + 1.5 \times \text{IQR}

Components of a Box Plot

A Box Plot visually represents the five-number summary of a dataset:

419171619

Box Plot

The box plot shown in the above diagram is a perfect plot with no skewness. The plots can have skewness and the median might not be at the center of the box.

How to Create a Box Plot

Consider the runs scored by a cricket team in 12 matches: 100, 120, 110, 150, 110, 140, 130, 170, 120, 220, 140, 110

**Step 1: Arrange Data in Ascending Order

**Step 2: **Find Quartiles and Median

**Step 3: Compute IQR

**Step 4: Calculate Limits

**Step 5: Identify Min, Max and Outliers

This information is then used to draw the box plot.

419171618

Box Plot

Use-Cases of Box Plots

Common use-cases include:

How to Compare Box Plots

Comparing multiple box plots helps understand differences between groups in terms of center, spread, skewness and outliers.

plot

Box Plot

1. Compare the Medians

2. Compare the Dispersion (Spread)

3. Compare Outliers

4. Compare Skewness

Difference Between Vertical and Horizontal Box Plot

Here we compare vertical and horizontal box plot

Feature Vertical Box Plot Horizontal Box Plot
Orientation Box is drawn vertically along the y-axis Box is drawn horizontally along the x-axis
Best For Comparing numerical values across categories. Showing distribution when category labels are long.
Readability Can become cramped with many categories. More readable when dataset has many labels.
Common Use Default plot in most statistical tools. Useful for survey data, ratings and long text labels.
Space Utilization Requires more vertical space. Utilizes horizontal space more efficiently.

Advantages

Limitation