BY: Statistics Fundamentals Team
Reviewed By: Minsa A (Senior Statistics Editor)

Box Plot Comparison Tool

Compare two to ten datasets side by side using box plots. Paste your data, upload a CSV, or load a sample set, and the tool works out the five-number summary, interquartile range, and outliers for each group automatically, then draws every box on the same scale so differences in center and spread are easy to see.

Box Plot Comparison Tool

Outlier rule Beyond Q1 − 1.5×IQR or Q3 + 1.5×IQR
Format Row 1 = dataset names, each column = one dataset

Example format: a header row like Class A,Class B,Class C, followed by one row of numbers per line, one value per column. Blank or non-numeric cells are skipped automatically. Loading a file switches you to the Enter Data tab with the columns filled in and the comparison generated.

Load one of these ready-made comparisons to see the tool in action, then edit the numbers to match your own data.

Box Plot Comparison Examples

Browse ready-made comparisons or generate your own above

View:

What Is a Box Plot Comparison Tool?

A box plot comparison tool takes several datasets and draws their box plots — also called box-and-whisker plots — on one shared scale, so you can line up medians, spreads, and outliers across groups without recalculating anything by hand. Each box plot on its own summarizes a dataset using five numbers: the minimum, the first quartile, the median, the third quartile, and the maximum. Put several of them side by side on the same axis, and a comparison that would otherwise take a table of statistics — or a wall of raw numbers — becomes something you can read in a few seconds.

This tool computes the five-number summary, the interquartile range, and any outliers for each dataset you enter, then renders the plots as a downloadable SVG. It's part of the free calculators and guides at Statistics Fundamentals, alongside the single-dataset Box Plot Generator and the site's other visual tools.

The Five-Number Summary

Every box plot is built from the same five values. Using a worked example of 12 quiz scores — 62, 68, 71, 74, 75, 77, 79, 80, 82, 85, 88, 95 — here's what each one means and how it's found:

StatisticWhat it tells youHow it's foundExample value
MinimumThe smallest value in the datasetFirst value once the data is sorted62
Q1 (first quartile)25% of the data falls below this valueMedian of the lower half of the sorted data72.5
Median (Q2)The middle value; 50% of the data falls below itMiddle of the sorted data78
Q3 (third quartile)75% of the data falls below this valueMedian of the upper half of the sorted data83.5
MaximumThe largest value in the datasetLast value once the data is sorted95

From there, the interquartile range is IQR = Q3 − Q1 = 83.5 − 72.5 = 11, and the outlier fences sit at Q1 − 1.5×IQR = 56 and Q3 + 1.5×IQR = 100. Since none of the 12 scores fall outside that range, this particular dataset has no outliers under the standard rule.

How to Compare Box Plots

Start with clean, grouped data. Each group you want to compare — a class, a region, a treatment arm — needs its own list of numeric values. This tool accepts pasted text (comma, space, or newline separated) or a CSV where each column is one group.
Generate the plots. The tool sorts each dataset, works out the five-number summary, and draws every box on the same vertical scale so heights and positions are directly comparable.
Compare the medians first. The line inside each box marks the median. A higher median means that group's typical value is larger, but check the spread before drawing conclusions from the median alone.
Compare the box heights, or IQR. A taller box means the middle 50% of that group's data is more spread out. Two groups can share a median and still have very different amounts of variability.
Look at the whiskers and any points beyond them. Whiskers show the typical range once outliers are set aside. Points plotted individually beyond the whiskers are outliers, flagged here using the 1.5×IQR rule by default.
Check whether a box is lopsided. If the median line sits closer to the top or bottom of the box, or one whisker is much longer than the other, that group's data is skewed rather than symmetric.

Applications of Box Plot Comparisons

Education: Teachers compare box plots of test scores across sections or years to see whether the whole distribution shifted, not just the average. A tight box with a high median points to a class doing well consistently; a wide box with a similar median points to a bigger gap between top and bottom performers.
Research: A study with a control group and one or more treatment groups can plot the outcome variable for each arm side by side. Boxes that barely overlap are an early visual signal worth following up with a formal test such as a t-test or ANOVA.
Business analytics: Analysts compare revenue, order size, or session duration across regions or channels. A box plot makes it easy to spot which segment has the widest spread, and which has outliers worth investigating, such as a handful of unusually large orders skewing a region's average.
Manufacturing & quality control: In statistical process control, box plots of a measured dimension from different machines or shifts reveal which source is running tighter to spec. A narrow box centered on the target value indicates a well-controlled process; points beyond the whiskers flag parts worth a closer look.
Healthcare: Clinical studies compare outcomes, such as blood pressure change, between treatment groups. Box plots surface non-responders and unusually strong responders as outliers, which matters clinically even when the group medians look similar.
Finance & data science: Before training a model or comparing portfolios, analysts use box plots to check for skew and outliers in each feature or return series. A feature with far more spread than the others, or several points beyond the fences, often needs a closer look before it goes into a model.

Box Plot Comparisons vs Other Charts

A box plot isn't always the right chart. Here's how it stacks up against the other charts people reach for when comparing groups or exploring a distribution.

FeatureBox PlotHistogramViolin PlotDot PlotScatter PlotBar Chart
Shows distribution shapePartialYesYesPartialNoNo
Shows outliers directlyYesNoPartialYesYesNo
Shows quartiles / IQRYesNoPartialNoNoNo
Shows the median directlyYesNoPartialNoNoNo
Compares many groups cleanlyExcellentFairGoodFairFairGood (means only)
Works well with large samplesYesYesYesNoYesYes

Glossary

TermDefinition
Box plotA chart that displays a dataset's five-number summary as a box with whiskers extending to either side.
Five-number summaryThe minimum, first quartile, median, third quartile, and maximum of a dataset.
QuartileOne of three values (Q1, Q2, Q3) that split ranked data into four equal-sized groups.
Interquartile range (IQR)The distance between the third and first quartiles (Q3 − Q1); the height of the box.
WhiskerThe line extending from the box to the smallest or largest value that isn't classified as an outlier.
OutlierA value far enough from the rest of the data, conventionally beyond 1.5×IQR from the nearest quartile, that it's plotted as an individual point.
FenceThe cutoff value (Q1 − 1.5×IQR or Q3 + 1.5×IQR) used to decide whether a point counts as an outlier.
SkewnessA measure of how asymmetric a distribution is; on a box plot it shows up as an off-center median or whiskers of very different lengths.

Related Topics

Sources & further reading:

  • NIST/SEMATECH Engineering Statistics Handbook — Box Plot
  • Tukey, J.W. (1977). Exploratory Data Analysis. Addison-Wesley. [Origin of the box-and-whisker plot and the 1.5×IQR outlier rule]
  • Khan Academy — Box Plot Review

Frequently Asked Questions

It's a calculator that takes two or more lists of numbers, works out the five-number summary — minimum, Q1, median, Q3, maximum — for each one, and draws the results as box plots on a shared scale. That makes it possible to see at a glance whether groups differ in their typical value, their spread, or the number of unusual observations, without doing the arithmetic by hand.

Line the two plots up on the same axis and look at three things: where the median sits, how tall each box is, and whether the boxes overlap. If the boxes barely overlap, the two groups are probably genuinely different; heavy overlap suggests the difference could just be noise. Enter both datasets in the Enter Data tab above to see this side by side.

A single box plot shows five things about a dataset: the minimum, the first quartile (Q1), the median, the third quartile (Q3), and the maximum. The box spans Q1 to Q3 — the middle 50% of the data — with a line at the median. Whiskers extend out to the smallest and largest values that aren't flagged as outliers, and anything beyond that is plotted as an individual point.

The standard method, often called Tukey's rule, flags any value more than 1.5 times the interquartile range (IQR) below Q1 or above Q3. The lower fence is Q1 − 1.5×IQR and the upper fence is Q3 + 1.5×IQR — anything outside that range is drawn as a separate point rather than folded into the whisker. This tool applies that rule automatically and can also flag extreme outliers past 3×IQR if you turn that option on in the Enter Data tab.

Both are conventions for deciding when a value is unusual enough to plot separately rather than extend the whisker to it. The 1.5×IQR fence, the default in most statistics courses and software, catches moderate outliers. The 3×IQR fence is stricter and only flags values that are far more extreme, sometimes labeled "extreme outliers" to distinguish them from the milder ones caught by 1.5×IQR.

Use a box plot when you're comparing three or more groups and want medians, spread, and outliers side by side without the chart getting cluttered. Use a histogram when you care about the actual shape of a single distribution — whether it's bimodal, for instance — since a box plot compresses that shape down to five numbers. For a single-distribution histogram, see our Histogram Maker.