Descriptive Statistics Central Tendency Frequency Distribution 20 min read May 24, 2026
BY: Statistics Fundamentals Team
Reviewed By: Minsa A (Senior Statistics Editor)

What is Mode in Statistics? The Ultimate Guide to Central Tendency

A shoe retailer stocks a new model in every size from 5 to 13. After one month, sizes 9 and 10 have sold out while sizes 5 and 13 still sit on the shelf. The mean shoe size sold tells you little about which box to reorder. The mode — the size that sold most often — tells you exactly what to put back in stock.

This guide defines the mode precisely, shows you how to find it for both raw data and grouped frequency tables, explains what bimodal and multimodal distributions mean, and gives you an interactive calculator to work through your own datasets. It also covers Karl Pearson's empirical relationship connecting the mode, median, and mean.

What You'll Learn
  • ✓ The precise definition of mode and the two-word version that works for featured snippets
  • ✓ A four-step method for ungrouped data and the interpolation formula for grouped data
  • ✓ How to classify a dataset as unimodal, bimodal, multimodal, or no-mode
  • ✓ How mode, mean, and median shift on skewed versus symmetric distributions
  • ✓ Pearson's empirical formula and exactly when it breaks down
  • ✓ When the mode is more informative than the mean (with a decision guide)

What Is Mode in Statistics? (Simple Definition)

Definition — Measure of Central Tendency
In statistics, the mode (symbol: Mo) is the value — or values — that appears most often in a dataset. It is the only measure of central tendency that can be calculated for both numerical and categorical data. A dataset can have one mode, multiple modes, or no mode at all.
Mo = the value x where f(x) is maximum

Statisticians place the mode alongside the mean and median as the three standard measures of central tendency. Of the three, the mode is the simplest to compute and the only one that answers "what is the single most common value?" rather than some arithmetic average. According to the NIST/SEMATECH e-Handbook of Statistical Methods, the mode is particularly appropriate when the data are not quantitative — that is, when you are working with labels, categories, or names where arithmetic operations have no meaning.

The word "mode" comes from the French la mode, meaning "in fashion" — and the statistical sense follows: the mode is the most fashionable value, the one that shows up most in the crowd.

⚡ Quick Reference — Mode Key Facts
  • Symbol: Mo (mode) — no Greek letter; sometimes written as Z in older British textbooks
  • Data types: Works on nominal, ordinal, interval, and ratio data — the only central tendency measure that works on categorical labels
  • Outlier resistance: Completely unaffected by extreme values; adding 10,000 to {2, 5, 5, 7} leaves the mode at 5
  • Multiple modes: A dataset can be unimodal (1 mode), bimodal (2 modes), multimodal (3+), or have no mode if all values are distinct
  • Symmetric distributions: In a perfect normal distribution, Mean = Median = Mode
  • Grouped data: Requires the interpolation formula Mo = l + ((f₁−f₀)/(2f₁−f₀−f₂)) × h
Mo
Standard Symbol
0+
Modes Possible
4
Data Types Supported
100%
Outlier Resistant

How to Find the Mode: Ungrouped Data vs. Grouped Data

The procedure for finding the mode depends on whether your data is ungrouped (a list of raw values) or grouped into class intervals in a frequency table. Both are covered below with fully worked examples.

Mode for Ungrouped Data: Step-by-Step

📋
The 4-Step Method — Works for Any Raw Dataset

Step 1: Arrange values in ascending order. Step 2: Count how often each distinct value appears (its frequency). Step 3: Find the value with the highest frequency. Step 4: If one value has the highest count → unimodal. If two values tie → bimodal. If all values appear equally → no mode.

Worked Example 1 — Unimodal Dataset

Seven students' quiz scores: 72, 85, 85, 90, 68, 85, 91. Find the mode.

1

Arrange in ascending order: 68, 72, 85, 85, 85, 90, 91

2

Count each value's frequency: 68 → 1  |  72 → 1  |  85 → 3  |  90 → 1  |  91 → 1

3

Find the maximum frequency: The value 85 appears 3 times — more than any other value.

4

State the mode: Mo = 85. This is a unimodal dataset.

✓ Mode = 85. Three students scored 85, which is more than any other single score in the set.

Worked Example 2 — Bimodal Dataset

Daily high temperatures (°F) for 10 days: 72, 75, 75, 78, 80, 80, 82, 84, 86, 88. Find the mode.

1

Already sorted. Count each value: 72→1, 75→2, 78→1, 80→2, 82→1, 84→1, 86→1, 88→1

2

Maximum frequency: Both 75 and 80 appear twice — tied for the highest count.

3

State the modes: Mo₁ = 75 and Mo₂ = 80. This is a bimodal dataset.

✓ Both 75°F and 80°F are modes. When the temperature data is plotted as a histogram, two peaks appear at these two values.

The frequency table format — listing each value beside its count — is the most systematic way to find the mode without missing anything in a larger dataset. Penn State's introductory statistics course (STAT 100) presents frequency tables as the standard first step in identifying the mode for discrete data (Penn State STAT 100, Lesson 2.2).

Mode for Grouped Data: The Interpolation Formula

When data has been grouped into class intervals — the format you encounter in census data, income brackets, and exam grade ranges — the individual values are not visible. You cannot simply spot the most-frequent number. Instead, you first identify the modal class (the interval with the highest frequency), then apply an interpolation formula to estimate where within that interval the mode falls.

Mode Formula for Grouped Data
Mo = l + [ (f₁ − f₀) / (2f₁ − f₀ − f₂) ] × h
Apply when data is organized into class intervals with known frequencies
l = lower class limit of the modal class f₁ = frequency of the modal class f₀ = frequency of the class preceding the modal class f₂ = frequency of the class succeeding the modal class h = class width (size of each interval)
💡
Why an approximation, not an exact value?

Grouping data into intervals destroys information about individual values. The formula uses the relative frequency difference between the modal class and its neighbors to "pull" the estimate toward whichever side has the steeper drop. If f₀ is very small relative to f₂, the mode estimate sits closer to the lower limit l. If f₂ is very small, the estimate sits closer to l + h (the upper limit of the modal class).

Worked Example: Grouped Data Mode Calculation

Worked Example 3 — Grouped Data (Salary Brackets)

A company reports employee monthly salaries in the frequency table below. Find the modal salary.

Salary Class ($) Frequency (f) Notes
2,000 – 3,0008
3,000 – 4,00014← f₀ (class before modal)
5,000 – 6,00010← f₂ (class after modal)
6,000 – 7,0006
1

Identify the modal class: The class 4,000–5,000 has the highest frequency: f₁ = 22.

2

Read the required values: l = 4,000  |  f₁ = 22  |  f₀ = 14  |  f₂ = 10  |  h = 1,000

3

Plug into the formula:
Mo = 4,000 + [(22 − 14) / (2×22 − 14 − 10)] × 1,000
Mo = 4,000 + [8 / (44 − 24)] × 1,000
Mo = 4,000 + [8 / 20] × 1,000
Mo = 4,000 + 0.4 × 1,000 = 4,000 + 400

✓ Estimated Mode = $4,400 per month. The formula places the mode 40% of the way across the modal class interval, pulled toward the lower limit because f₀ (14) is larger than f₂ (10).

⚠️
When the formula becomes unstable

The denominator (2f₁ − f₀ − f₂) must never equal zero. This happens when f₀ + f₂ = 2f₁, which means the modal class has equal slopes on both sides. In practice, this is rare, but if it occurs, report the midpoint of the modal class (l + h/2) as a fallback estimate.

Classification of Distributions: From Unimodal to No Mode

Counting the modes of a distribution tells you something concrete about its shape. A single peak means data clusters around one central value. Two peaks suggest two distinct sub-groups within the data. The examples below use small, explicit datasets so you can trace the logic by hand.

Unimodal

One Mode

{3, 5, 5, 5, 8, 12}

The value 5 appears three times; all others appear once. Mode = 5. The histogram has one peak.

Bimodal

Two Modes

{10, 10, 15, 20, 25, 25, 30}

Both 10 and 25 appear twice; all others appear once. Modes = 10 and 25. Two histogram peaks.

Multimodal

Three or More Modes

{2, 2, 4, 4, 6, 6, 9}

Values 2, 4, and 6 each appear twice; 9 appears once. Modes = 2, 4, and 6. Three peaks.

No Mode

Uniform / No Repeat

{1, 2, 3, 4, 5, 6}

Every value appears exactly once. No frequency is higher than any other. This dataset has no mode.

Real-World Example — Bimodal Distribution

Restaurant Traffic by Hour

A restaurant records customers arriving per hour across a 12-hour day. Traffic peaks between 12–1pm (lunch) and again between 7–8pm (dinner). The hourly count distribution is bimodal, with two distinct modes. If a manager calculated only the mean arrival rate and used it for staffing, they would be systematically understaffed at both peak hours and overstaffed mid-afternoon. In this case, the bimodal structure contains more actionable information than any single average.

What "No Mode" Actually Means

A common misconception is that a dataset with no mode has a mode of zero. That is incorrect. "No mode" means the concept does not apply — every value is equally (un)common. The set {1, 2, 3, 4, 5} has no mode, not a mode of 0. This distinction matters when you are writing statistical reports: state "the distribution has no mode" rather than "the mode is zero," because zero is a valid data value in many contexts.

A uniform probability distribution — where every outcome has exactly the same probability — is the theoretical version of the no-mode scenario. Rolling a fair six-sided die produces values 1 through 6 with equal probability (1/6 each), so there is no modal value over the long run (MIT 18.600 Lecture Notes on Probability, Sheffield).

Interactive Mode Calculator

Enter your data points as comma-separated numbers. The calculator finds all modes, classifies the distribution type, and builds a frequency table with a full step-by-step breakdown.

Mode Calculator — Ungrouped Data

▶ Show frequency table

Visualizing the Mode: Peaks on Histograms and Probability Curves

On any histogram or probability density curve, the mode corresponds to the peak — the bar with the tallest height in a histogram, or the highest point on a continuous curve. This geometric interpretation is useful because it connects the abstract definition ("most frequent value") to a shape you can see and compare.

Symmetric bell curve showing mean, median, and mode all equal at the center peak

Symmetric Distributions: Mean = Median = Mode

In a perfectly symmetric, unimodal distribution (such as the normal distribution), the mean, median, and mode all coincide at exactly the same value — the center of the curve. This is because symmetry ensures no value is systematically pulled left or right, so the most common value, the middle value, and the arithmetic average all land at the same point. This is covered formally in any mathematical statistics text, including Wackerly, Mendenhall, and Scheaffer's Mathematical Statistics with Applications (8th ed., Cengage, 2008), which is standard reading at most university statistics programs.

Skewed Distributions: How the Mode, Median, and Mean Separate

Skewness pulls the mean and, to a lesser degree, the median away from the mode. The mode always sits at the peak — where the data is densest. The mean gets dragged toward the tail because it responds to every value arithmetically. The median lands between them. This ordering gives rise to Pearson's empirical rule, covered in the next section.

Side-by-side diagrams of positively skewed and negatively skewed distributions showing positions of mode, median, and mean on each curve

Case Study — Income Data in the United States

Why median household income is reported instead of mean income

The U.S. Census Bureau reports median household income rather than mean household income in its official summary tables (U.S. Census Bureau, Income and Poverty). The reason is skewness: because a small number of households earn very high incomes, the mean is consistently higher than the median. The mode of the income distribution — the income level that appears most frequently — sits lower still, typically near the most common income bracket. Reporting the mean alone would give a figure that exceeds the actual income of the majority of households.

The Empirical Relationship: Mean, Median, and Mode

Karl Pearson derived an approximate relationship between the three measures of central tendency that holds for moderately skewed, unimodal distributions. This formula lets you estimate any one of the three values from the other two — useful when only summary statistics are available and the raw data is not.

Pearson's Empirical Relationship
Mode ≈ 3(Median) − 2(Mean)
Valid for moderately skewed, unimodal, continuous distributions.
Rearranges to: Median ≈ (Mode + 2·Mean) / 3  |  Mean ≈ (3·Median − Mode) / 2
Worked Example — Estimating Mode from Mean and Median

A dataset has a mean of 60 and a median of 65. Estimate the mode.

1

Apply the formula: Mode ≈ 3(Median) − 2(Mean) = 3(65) − 2(60) = 195 − 120 = 75

2

Check the direction: Mode (75) > Median (65) > Mean (60). This ordering is consistent with a negatively skewed distribution, where the left tail drags the mean down below the median and mode. The empirical relationship is internally consistent.

✓ Estimated Mode ≈ 75. Note that this is an approximation. For exact results, you need the raw data or the full frequency distribution.

⚠️
When the empirical formula does not work

Pearson's rule breaks down for bimodal or multimodal distributions (it assumes a single peak), for heavily skewed datasets, and for discrete distributions with very few distinct values. Do not apply it mechanically without first checking that the distribution is roughly unimodal and moderately skewed. If the raw data is available, compute the mode directly.

Ordering of Mean, Median, and Mode by Skewness

Distribution Shape Ordering Example Context Pearson Rule Applies?
Perfectly symmetric (normal)Mean = Median = ModeHeights in a large homogeneous populationTrivially — all three equal
Positively skewed (right tail)Mode < Median < MeanIncome distributions, house pricesYes, for moderate skew
Negatively skewed (left tail)Mean < Median < ModeAge at retirement; exam scores with a ceilingYes, for moderate skew
BimodalNo fixed orderingHeights of adults (male + female combined)No — formula assumes unimodal
UniformMean = Median; No modeOutcomes of a fair die rollNot applicable (no mode)

When Should You Use the Mode Instead of the Mean?

The mean is the go-to measure of central tendency for most numerical data, but three specific situations make the mode the better — or the only — choice.

👗

Categorical / Nominal Data

What is the most popular shirt color sold this month? The most common voting choice? These questions have no numeric order, so mean and median are undefined. The mode is the only valid answer.

📦

Inventory & Demand Planning

A store stocking shoe sizes reorders based on the mode, not the mean. Knowing the average shoe size is 9.3 does not tell you what to put back on the shelf. The mode — the size that sold most often — does.

🔢

Discrete Data with Few Values

Survey responses on a 1–5 scale rarely have a meaningful mean (3.4 stars does not correspond to a real rating). The mode — the most common rating selected — is more directly interpretable.

📊

Outlier-Contaminated Datasets

When one or two extreme values would distort the mean, the mode is stable. The mode of {2, 5, 5, 7, 10,000} is 5, regardless of the outlier. The mean is 2003.8, which misrepresents the bulk of the data.

🏥

Healthcare: Most Common Diagnosis

In a clinic's monthly patient log, the modal diagnosis tells staff which condition to prepare for most often — a planning input that neither mean nor median diagnosis codes would provide.

🗳️

Voting and Elections

In a plurality election (first-past-the-post), the winner is the candidate chosen by the most voters — i.e., the modal choice. Mean and median are not defined for named candidates.

Mode Formula Glossary: All Notations Defined

The table below consolidates every term used in mode calculations so the formulas can be read without referring back to earlier sections. This format is designed for quick lookup when working through exam problems or grouped-data calculations.

Symbol / Term Notation Meaning Example Value
Mode (ungrouped)MoThe value that appears most often in raw dataMo = 85 in {68, 72, 85, 85, 85, 90, 91}
Modal classThe class interval with the highest frequency in a grouped table4,000–5,000 (frequency = 22)
Lower class limitlThe lowest value that can belong to the modal classl = 4,000
Modal class frequencyf₁The count of observations in the modal classf₁ = 22
Pre-modal frequencyf₀Frequency of the class immediately before the modal classf₀ = 14
Post-modal frequencyf₂Frequency of the class immediately after the modal classf₂ = 10
Class widthhThe range of each class interval (upper limit minus lower limit)h = 1,000
Empirical mode estimateMo ≈ 3Md − 2x̄Pearson's formula to approximate mode from median (Md) and mean (x̄)3(65) − 2(60) = 75
Unimodal|Mo| = 1Exactly one mode exists{3, 5, 5, 5, 8} → Mo = 5
Bimodal|Mo| = 2Exactly two modes tied at the highest frequency{10, 10, 25, 25} → Mo = 10, 25
Multimodal|Mo| ≥ 3Three or more modes{2, 2, 4, 4, 6, 6} → Mo = 2, 4, 6
No modeMo = ∅All values appear the same number of times (usually once){1, 2, 3, 4, 5, 6}

Mode vs. Mean vs. Median: Full Comparison

The three measures of central tendency answer different questions about a dataset. None is universally superior — the right choice depends on the data type, the distribution's shape, and what the number will be used for. The table below captures the key distinctions that appear on exams and in applied statistics work.

Property Mode (Mo) Median (Md) Mean (x̄)
DefinitionMost frequent valueMiddle value when sortedSum ÷ count
Works on categorical data?✅ Yes❌ No (needs order)❌ No (needs numbers)
Affected by outliers?No — completely immuneSlightly — resistant but not immuneYes — highly sensitive
Unique value guaranteed?No — can have 0, 1, or manyYes — always one valueYes — always one value
Best forCategorical data; most common value; inventorySkewed numerical data; ranked data; incomesSymmetric numerical data; further calculations (e.g., SD)
Symmetric normal distribution= Median = Mean= Mode = Mean= Mode = Median
Positively skewed distributionLowest of threeMiddle valueHighest of three
Used in further statistics?Rarely (no algebra)Percentile calculationsSD, variance, t-tests, regression

For a side-by-side comparison with worked examples across all three measures, see the Mean vs. Median vs. Mode guide on this site. If you need to compute all three simultaneously, the mean, median, and mode calculator accepts any dataset and returns all three values plus the range.

Frequently Asked Questions about the Mode

Can nominal or categorical data have a mode?

Yes — the mode is the only measure of central tendency that applies to nominal data. If a survey of 100 people about their favorite primary color returns 40 votes for red, 35 for blue, and 25 for yellow, the mode is red. There is no way to compute a "mean" or "median" primary color, but the mode is well-defined. This is covered in UCLA's OARC Statistical Methods guide, which lists the mode as the standard summary statistic for nominal measurement scales.

How do outliers affect the mode compared to the mean?

The mode is completely unaffected by outliers. Consider the dataset {3, 5, 5, 7}: mode = 5, mean = 5. Now add the outlier 10,000: {3, 5, 5, 7, 10,000}. The mode is still 5. The mean is now 2,004. This immunity is one reason the mode is preferred over the mean in datasets where extreme values occur but should not define the "typical" observation — for example, in quality control, where a single catastrophic failure should not define the typical product performance.

What does a uniform distribution mean for the mode?

A uniform distribution has no mode. By definition, every value in a uniform distribution occurs with equal frequency. Since no value occurs more than any other, there is no single "most common" value. For example, in a perfectly fair coin flip (heads or tails, each 50%), neither outcome is the mode. Similarly, a dataset {4, 4, 7, 7, 9, 9} has three equally frequent values — it could be called trimodal, but some statisticians prefer to say it has no single dominant mode.

What is the difference between bimodal and multimodal distributions?

Bimodal means exactly two modes — two values share the highest frequency. Multimodal is the broader term covering any distribution with two or more modes (so bimodal is a specific type of multimodal). A dataset with three tied-highest frequencies is trimodal, which is multimodal but not bimodal. The distinction matters in practice: a bimodal income distribution suggests two distinct population sub-groups, while a trimodal or higher pattern may suggest a more complex underlying structure in the data.

When should you use the mode instead of the mean?

Use the mode when: (1) your data is categorical and arithmetic is undefined, (2) you need to know what is most common rather than the average level, (3) your dataset has outliers that would distort the mean, or (4) your data is discrete with few possible values and a mean would not correspond to a real observation. Use the mean when the data is numerical, roughly symmetric, and you need a value for further calculations like standard deviation or hypothesis tests.

Sources and Further Reading

Primary Sources Cited in This Guide

Academic and Institutional References

NIST/SEMATECH e-Handbook of Statistical Methods. "Measures of Location: Mode." National Institute of Standards and Technology. Available at: itl.nist.gov.

Penn State STAT 100: Statistical Concepts and Reasoning. Lesson 2.2: "Describing Data: Measures of Center." Available at: online.stat.psu.edu.

Sheffield, S. (2022). MIT 18.600: Probability and Random Variables, Lecture 3 Notes. Massachusetts Institute of Technology. Available at: math.mit.edu.

U.S. Census Bureau. Income and Poverty Statistics Documentation. Available at: census.gov.

UCLA Statistical Consulting Group (OARC). "What Statistical Analysis Should I Use?" — measurement scale guidance. Available at: stats.oarc.ucla.edu.

Wackerly, D., Mendenhall, W., & Scheaffer, R.L. (2008). Mathematical Statistics with Applications, 7th ed. Thomson Brooks/Cole. (Standard university text covering Pearson's empirical relationship, Chapter 1–2.)

The mode is one piece of a broader toolkit in descriptive statistics. The guides below cover the concepts that appear most often alongside the mode in coursework and applied analysis.

μ

Mean (Average)

Arithmetic, geometric, and weighted means with full formulas and step-by-step examples.

Md

Median

The middle value for sorted data, resistant to outliers, and the standard for skewed distributions.

σ

Standard Deviation

The primary measure of spread — how far individual values sit from the mean.

📊

Variance

The squared average of deviations from the mean — the building block of standard deviation.

🔢

Types of Data

Nominal, ordinal, interval, and ratio scales — and which statistical operations apply to each.

📈

Normal Distribution

The bell curve where mean = median = mode — and how the 68-95-99.7 rule applies.

Visit Statistics Fundamentals for the full library of guides, calculators, and statistical tables covering everything from outlier detection to hypothesis testing and regression analysis.

Citation guidance: When referencing this page in academic work, cite it as: Statistics Fundamentals Team (2026). "What is Mode in Statistics? Definition, Formulas & Examples." Statistics Fundamentals. Retrieved from https://statisticsfundamentals.com/descriptive-statistics/mode/. For primary statistical theory, cite the NIST Handbook and Penn State STAT 100 sources listed in the Sources section above.