What is a five number summary?

A five number summary describes a dataset using five values: the minimum, first quartile (Q1), median, third quartile (Q3), and maximum. Together they show the center and spread of a dataset without relying on the mean or standard deviation, which makes the summary useful for skewed data.

How do you calculate Q1 and Q3 by hand?

Sort the data, find the median, then split the dataset into a lower half and an upper half. Q1 is the median of the lower half and Q3 is the median of the upper half. Different textbooks handle the middle value differently when the dataset has an odd number of points, which is why two calculators can occasionally give slightly different quartiles for the same data.

How do you tell if a value is an outlier?

Calculate the interquartile range (IQR = Q3 − Q1), then build fences at Q1 − 1.5×IQR and Q3 + 1.5×IQR. Anything outside those fences is flagged as an outlier on a standard box plot. This rule, devised by John Tukey, is the most common outlier rule taught alongside the five number summary.

Five Number Summary Visualizer | Free Box Plot & Quartile Tool

Five Number Summary Visualizer

Computes Min, Q1, Median, Q3, Max, IQR, Outlier Fences

Paste your data — comma, space, or newline separated

Quartile method

Plot orientation

Chart title (optional)

Or upload a CSV / text file Reads the first column of numeric values it finds.

Mode Enter your own five numbers directly

Minimum

Median

Maximum

Plot orientation

Chart title (optional)

Mode Two box plots on one aligned axis

Dataset 1

Label

Data

Dataset 2

Label

Data

Chart title (optional)

Five Number Summary

The five values Min, Q1, Median, Q3, Max

IQR Q3 − Q1

Lower fence Q1 − 1.5 × IQR

Upper fence Q3 + 1.5 × IQR

Reading the Box Plot

The box spans Q1 to Q3 — the middle 50% of your data.
The line inside the box marks the median.
Whiskers reach to the most extreme points that aren't outliers.
Dots beyond the whiskers are flagged outliers.

Five Number Summary Guide

Read the full written explanation

Related Tools

Five Number Summary Examples

Browse worked datasets or build your own above

View:

What a Five Number Summary Actually Tells You

A five number summary is five values — minimum, first quartile (Q1), median, third quartile (Q3), and maximum — that describe where a dataset sits and how spread out it is. Unlike the mean and standard deviation, every one of these five numbers is a real data point (or sits exactly between two real data points), so a single huge value can't drag the whole summary off course the way it drags a mean.

That's the main reason statisticians reach for it on messy or skewed data: household income, hospital wait times, response times on a server, anything with a long tail. The five number summary is also what feeds directly into a box plot, so once you have the five values, the chart practically draws itself.

The Five Values, One at a Time

Value	Percentile	What it marks	Where it sits on a box plot
Minimum	0th	The smallest value in the dataset	End of the lower whisker (unless it's an outlier)
Q1	25th	Median of the lower half	Left/bottom edge of the box
Median	50th	Middle value of the sorted dataset	Line drawn inside the box
Q3	75th	Median of the upper half	Right/top edge of the box
Maximum	100th	The largest value in the dataset	End of the upper whisker (unless it's an outlier)

How to Calculate It by Hand

1. Sort the data. Smallest to largest. This step is non-negotiable — every later step assumes the list is in order.

2. Pull off the minimum and maximum. They're just the first and last entries in your sorted list.

3. Find the median. For an odd number of values, it's the middle one. For an even number, average the two middle values.

4. Split the data into a lower half and an upper half. Some methods include the median itself in both halves (the "inclusive" or Tukey hinge method); others leave it out of both halves (the "exclusive" method, used by Moore and McCabe and by most US statistics textbooks). This tool lets you pick either one, and it's worth checking which one your course or software uses, since the two methods can give different Q1/Q3 values on small odd-sized datasets.

5. Q1 is the median of the lower half. Q3 is the median of the upper half. Apply the same odd/even rule from step 3 to each half.

Worked Example

Nine students' exam scores: 84, 67, 92, 75, 54, 88, 79, 98, 71.

Sorted 54, 67, 71, 75, 79, 84, 88, 92, 98

With 9 values, the median is the 5th value: 79. Using the exclusive method, the lower half is {54, 67, 71, 75} and the upper half is {84, 88, 92, 98}. Q1 is the average of 67 and 71, which is 69. Q3 is the average of 88 and 92, which is 90. The minimum is 54 and the maximum is 98, so the five number summary reads 54, 69, 79, 90, 98.

From there, IQR = 90 − 69 = 21. The lower fence is 69 − 1.5(21) = 37.5 and the upper fence is 90 + 1.5(21) = 121.5. Both 54 and 98 fall inside those fences, so this dataset has no outliers.

The 1.5×IQR Outlier Rule

John Tukey proposed this rule in the 1970s as part of his work on exploratory data analysis, and it's still the default outlier check on almost every box plot drawn today. Take the interquartile range, multiply by 1.5, and use that as a buffer below Q1 and above Q3. Anything outside those two fences gets flagged and plotted as a separate point rather than absorbed into the whisker.

It's a useful default, not a law of nature. On a small dataset, one slightly unusual value can trip the fence even though nothing went wrong with the measurement. On a large dataset with heavy tails, the rule can flag dozens of points that are genuinely part of the distribution. Treat a flagged point as a prompt to look closer, not as automatic grounds to delete it.

A Second Worked Example, With an Outlier

Ten e-commerce order values: 12, 15, 22, 120, 18, 25, 31, 14, 19, 28.

Sorted 12, 14, 15, 18, 19, 22, 25, 28, 31, 120

With 10 values (even), the median is the average of the 5th and 6th values: (19 + 22) / 2 = 20.5. The lower half {12, 14, 15, 18, 19} has a median of 15 for Q1. The upper half {22, 25, 28, 31, 120} has a median of 28 for Q3. IQR = 28 − 15 = 13, so the upper fence sits at 28 + 1.5(13) = 47.5. The order of 120 is well past that fence, so it's plotted as an outlier and the upper whisker stops at 31, the largest value that isn't flagged.

Five Number Summary vs. Mean and Standard Deviation

Both approaches summarize a dataset, but they answer slightly different questions and behave differently when data misbehaves.

Question	Five Number Summary	Mean & Standard Deviation
Sensitive to one extreme value?	No — it's based on rank, not magnitude	Yes — one huge value moves both
Assumes a particular shape?	No — works on skewed or lumpy data	Most interpretation assumes roughly symmetric data
Easy to read off a chart?	Yes — that's exactly what a box plot shows	Needs a separate calculation or error bar
Uses every data point's exact value?	No — only rank order matters	Yes — every value contributes to the calculation

In practice, a lot of analysts report both. The five number summary (via a box plot) gives a fast visual read on skew and outliers; the mean and SD support the inferential tests — t-tests, confidence intervals, regression — that assume something closer to a normal distribution.

Common Mistakes

Forgetting to sort first.

Pulling the "middle" value from an unsorted list gives a number that has nothing to do with the actual median.

Mixing inclusive and exclusive quartile methods mid-comparison.

If you're comparing your hand calculation to a calculator or to Excel's QUARTILE function, check which convention each one uses. Small odd-sized datasets are where the two methods diverge most.

Drawing whiskers to the fence values instead of to actual data points.

The fences (Q1 − 1.5×IQR and Q3 + 1.5×IQR) are boundaries for the outlier test. The whisker itself should stop at the most extreme real value that falls inside those boundaries, not at the fence number itself.

Treating IQR as the same thing as variance.

IQR measures the width of the middle 50% of the data. It's a spread measure, but it isn't computed the same way as variance or standard deviation and the two won't generally match.

Quick Reference Code

If you're computing this in a script rather than by hand, here's the same calculation in Python and R.

# Python — pandas/numpy
import numpy as np

data = [54, 67, 71, 75, 79, 84, 88, 92, 98]
q1, median, q3 = np.percentile(data, [25, 50, 75])
iqr = q3 - q1
print(min(data), q1, median, q3, max(data), iqr)

# R
data <- c(54, 67, 71, 75, 79, 84, 88, 92, 98)
fivenum(data)  # returns min, Q1 (Tukey hinge), median, Q3 (Tukey hinge), max

Note that R's fivenum() uses Tukey's hinge method, while NumPy's default percentile() call uses linear interpolation — neither matches the exclusive method taught in many introductory statistics courses exactly. Always check which method a tool is using before comparing results across software.

Five Number Summary Visualizer

Five Number Summary Visualizer

Five Number Summary

Spread & Outlier Fences

Spread

Outlier Fences

Dataset 1 Summary

Dataset 2 Summary

Median Comparison

Five Number Summary Examples

What a Five Number Summary Actually Tells You

The Five Values, One at a Time

How to Calculate It by Hand

Worked Example

The 1.5×IQR Outlier Rule

A Second Worked Example, With an Outlier

Five Number Summary vs. Mean and Standard Deviation

Common Mistakes

Quick Reference Code

Related Topics

Frequently Asked Questions