Descriptive Statistics Outlier Detection IQR & Z-Score 28 min read May 20, 2026
BY: Statistics Fundamentals Team
Reviewed By: Minsa A (Senior Statistics Editor)

Outliers in Statistics: Meaning, Detection Methods, and Real Examples

Imagine you survey ten employees about their annual salaries. Nine earn between $40,000 and $80,000. The tenth is the CEO, earning $5 million. That single number does not represent typical pay — it is an outlier, and including it makes the average salary look like $538,000, misleading everyone.

This guide explains what outliers are, why they occur, how to detect them using the IQR method and z-scores, and — critically — when to remove them versus when to keep them. The interactive calculator lets you find outliers in your own data instantly.

What You'll Learn
  • ✓ The plain-English definition with numerical examples and everyday analogies
  • ✓ How outliers distort the mean, inflate standard deviation, and bias regression models
  • ✓ Step-by-step IQR method — the best beginner technique — with a worked example
  • ✓ Z-score detection with thresholds, assumptions, and when it breaks down
  • ✓ Real-world case studies: fraud detection, medical research, housing markets
  • ✓ The "Signal vs. Noise" decision framework for handling outliers ethically

What Are Outliers?

Definition — Statistical Anomaly
An outlier is a data point that lies significantly far from the other values in a dataset. Outliers break the pattern of the data — they sit well outside the range where most observations cluster. They may result from errors, unusual events, or genuine rare phenomena, and they have a disproportionate effect on statistical summaries.
Outlier if: x < Q₁ − 1.5·IQR   or   x > Q₃ + 1.5·IQR

Simple Definition of an Outlier

In plain terms, an outlier is a value that does not fit with the rest. If a class of 25 students scores between 60 and 90 on an exam, but one student scores 12 or 99, those scores stand apart from the group. They are outliers — not because they are "wrong," but because they are statistically distant from the main cluster.

Numerically, outliers are defined relative to the spread of the data. The most widely used definition comes from John Tukey's 1977 work on Exploratory Data Analysis (Addison-Wesley), which established the 1.5×IQR rule still used today in box plots and statistical software. According to the NIST/SEMATECH e-Handbook of Statistical Methods, outliers are extreme observations that warrant careful scrutiny before any statistical analysis proceeds. The team at Statistics Fundamentals has built this guide to make that scrutiny approachable for beginners and practitioners alike.

⚡ Quick Reference — Outlier Key Facts
  • IQR fences: Lower = Q₁ − 1.5×IQR  |  Upper = Q₃ + 1.5×IQR
  • Z-score threshold: |z| > 3 flags an extreme outlier in normally distributed data
  • Effect on mean: One extreme value can shift the mean dramatically — the median is resistant
  • Not always errors: Outliers can signal fraud, discoveries, or rare events
  • Best beginner method: IQR — works on skewed data without normality assumptions
  • Tukey's 1.5 rule: Flags ~0.7% of data as outliers in a perfectly normal distribution

Why Outliers Matter in Statistics

Outliers matter because they affect nearly every descriptive statistic you compute. The mean shifts toward them. The standard deviation inflates. Regression lines get pulled in their direction. Correlation coefficients weaken. A single extreme value can change your conclusions entirely — which is why every rigorous data analysis includes outlier detection as a standard step.

↑↑
Effect on Mean
↑↑
Effect on SD
~0
Effect on Median
↑↑
Regression Bias

Real-World Examples of Outliers

💰

Billionaire in Income Dataset

Jeff Bezos in a U.S. household income sample would push the mean income to an absurd figure, making "average" income meaningless.

🌡️

Faulty Sensor Reading

A malfunctioning temperature sensor records −999°C. That single value will destroy any trend analysis if left unchecked.

🏥

Medical Anomaly

A patient with an unusually rare enzyme level — dismissed as a data error — turned out to signal a previously undocumented metabolic condition.

📈

Stock Market Crash

On Black Monday (1987), the Dow Jones fell 22.6% in one day — a statistical outlier relative to all historical daily returns.

Why Outliers Occur

Not all outliers are the same. Before deciding what to do with one, you must understand where it came from. The cause determines the response. A typing error should be corrected. A genuine rare event should be preserved — and possibly studied separately.

Human and Measurement Errors

The most common source of outliers in real datasets is human error. A data entry operator types 85,000 instead of 8,500 for a salary record. A lab technician forgets to recalibrate a scale. A survey respondent misreads a question and enters age as 220 instead of 22. These outliers carry no real information — they are noise that corrupts your data, and correcting or removing them is both statistically and ethically appropriate.

⚠️
Check Before You Delete

Never remove an outlier based on its value alone. First verify the original source — the paper form, sensor log, or database record. If the original confirms the value, the outlier may be real.

Natural Variability and Rare Events

Some outliers are genuine. A 7-foot basketball player in a population height dataset is unusual, but biologically real. An earthquake of magnitude 9.0 appears as an outlier in a seismic dataset dominated by magnitude 2–4 tremors, but it represents one of the most important events in the record. According to the U.S. Geological Survey (USGS), extreme seismic events follow a power-law distribution where rare, high-magnitude quakes are statistically expected but infrequent.

Fraud, System Failure, or Unusual Events

In financial and cybersecurity data, outliers often carry the most important signal. A credit card transaction for $14,000 at 3 AM in a country the cardholder has never visited is an outlier relative to their spending history — and likely a fraud alert. In manufacturing, a bolt with a diameter 3 standard deviations above specification is a quality defect. These outliers are not errors; they are the events the system was built to detect.

Sampling Problems

Small samples are particularly vulnerable to apparent outliers. With only 10 data points, a value that would be unremarkable in a dataset of 1,000 can appear extreme. This is especially important to recognize in preliminary analyses: what looks like an outlier in n=8 may dissolve into the normal range once the full sample is collected.

How Outliers Affect Data Analysis

Understanding the mechanism by which outliers distort statistics makes you a better analyst. Here is a before-and-after breakdown across the four statistics most affected.

Effect on the Mean

The arithmetic mean divides the sum of all values by the count. Because every value contributes equally to the sum, a single extreme value can drag the mean far from where the bulk of the data lives. Consider seven employee salaries: $42k, $45k, $48k, $51k, $54k, $57k, and a CEO earning $2,000k.

Scenario Values Mean Median
Without CEO42k–57k (6 employees)$49,500$49,500
With CEO ($2M)42k–57k + $2,000k$334,000$51,000
Distortion from one outlier+576%+3%

The mean jumped from $49,500 to $334,000 — a 576% change — because of a single CEO salary. The median moved only 3%. This is why median is used to report income distributions rather than the mean.

Arithmetic Mean — Sensitive to Outliers
x̄ = (Σ xᵢ) / n
Every value xᵢ has equal weight — one extreme value shifts x̄ disproportionately
= sample mean xᵢ = each data point n = number of values Σ = sum of all

Why the Median Is More Resistant

The median is the middle value of a sorted dataset. Because it depends only on position — not magnitude — extreme values at either tail cannot move it far. This makes the median a "robust" or "resistant" statistic, and it is recommended when distributions are skewed or when outliers are present but cannot be removed.

Impact on Standard Deviation

Standard deviation squares every deviation from the mean: s = √[Σ(xᵢ − x̄)² / (n−1)]. Squaring amplifies large deviations. A value 4 standard deviations from the mean contributes 16 times more to the sum of squares than a value 1 standard deviation away. One extreme outlier can double a dataset's measured variability even when 99% of the data is tightly clustered.

How Outliers Distort Regression Models

In simple linear regression, the least-squares method minimizes the sum of squared residuals. An outlier with a large residual is penalized disproportionately, pulling the regression line toward it. This is called an influential observation, and it can change the slope and intercept of a regression model substantially — altering predictions for every other value in the dataset.

🚨
Regression Warning

A single influential outlier can flip the sign of a regression coefficient. If your regression R² drops significantly after removing one point, that point was controlling your model — which is rarely valid.

Types of Outliers

Not every unusual value is the same kind of unusual. Statisticians distinguish three main types, each with different implications for detection and handling.

Type Definition Example
Global Outlier A point extreme relative to the entire dataset A $5M salary in a dataset of $40k–$80k salaries
Contextual Outlier Normal globally, but abnormal in context 30°C temperature in Helsinki in December (not unusual in July)
Collective Outlier A group of points that are individually normal but collectively unusual 10 consecutive transactions of exactly $999 (credit card structuring)
Influential Observation A point that strongly affects model parameters A single data point that changes regression slope by 40%

This taxonomy was formalized in the machine learning literature, including Chandola, Banerjee, and Kumar's survey Anomaly Detection: A Survey (ACM Computing Surveys, 2009), which remains the standard academic reference on the subject. Understanding which type you are dealing with shapes both your detection approach and your response strategy.

How to Detect Outliers

The method you choose for outlier detection depends on your data's distribution, size, and the nature of your analysis. There is no single universal method — each makes different assumptions and has different strengths.

Visual Detection Using Box Plots

A box plot (or box-and-whisker plot) is the fastest visual tool for spotting outliers. The box represents the interquartile range (IQR) from Q₁ to Q₃. Whiskers extend to the last point within 1.5×IQR of the box edges. Any point beyond the whiskers is plotted as an individual dot and flagged as a potential outlier.

Box Plot Anatomy — Understanding the Visual

10 20 30 40 50 57 Outlier Outlier Min Q₁ Median Q₃ Max ← IQR →

Red dots outside the whiskers are potential outliers. The box spans Q₁ to Q₃; the blue line is the median.

IQR Method (Best Beginner Method)

The interquartile range method is the most accessible and robust outlier detection technique for beginners. It does not require the data to follow a normal distribution, making it usable on skewed datasets where the z-score method would fail.

IQR Outlier Fences — Tukey's Method (1977)
IQR = Q₃ − Q₁
Lower Fence = Q₁ − 1.5 × IQR
Upper Fence = Q₃ + 1.5 × IQR
Values below the lower fence or above the upper fence are flagged as potential outliers
Q₁ = 25th percentile (first quartile) Q₃ = 75th percentile (third quartile) IQR = interquartile range 1.5 = Tukey's constant

Why 1.5? John Tukey chose this multiplier specifically so that in a perfectly normal distribution, approximately 0.7% of observations fall outside the fences — a meaningful but not overly sensitive threshold. For extreme outlier detection, some analysts use 3×IQR instead. The interquartile range guide on this site walks through quartile calculations in full detail.

Worked Example — IQR Outlier Detection

Dataset: Annual salaries (in $000s): 42, 45, 48, 51, 54, 57, 60, 63, 66, 210

1

Sort the data (already sorted): 42, 45, 48, 51, 54, 57, 60, 63, 66, 210

2

Find Q₁ (25th percentile): Lower half = 42, 45, 48, 51, 54 → Median = 48. So Q₁ = 48.

3

Find Q₃ (75th percentile): Upper half = 57, 60, 63, 66, 210 → Median = 63. So Q₃ = 63.

4

Calculate IQR: IQR = Q₃ − Q₁ = 63 − 48 = 15

5

Calculate fences: Lower = 48 − 1.5×15 = 48 − 22.5 = 25.5  |  Upper = 63 + 1.5×15 = 63 + 22.5 = 85.5

6

Flag outliers: Is any value below 25.5? No. Is any value above 85.5? Yes — $210k exceeds the upper fence of $85.5k.

✓ $210k is flagged as an outlier by the IQR method. All other values (42–66) fall within the fences and are considered typical for this dataset.

Z-Score Method

The z-score measures how many standard deviations a value lies from the mean. A z-score of +3 means a value sits 3 SDs above the mean. Conventionally, values with |z| > 3 are flagged as outliers in datasets that approximate a normal distribution.

Z-Score Formula — Distance from Mean in SD Units
z = (x − μ) / σ
Flag as outlier when |z| > 3 (in normal distributions, this covers 99.73% of values)
z = z-score (standardized value) x = the data point μ = population mean σ = standard deviation

The z-score method has a critical limitation: it assumes the data follows a normal (bell-shaped) distribution. On heavily skewed datasets, it underperforms — and worse, the presence of outliers themselves inflates both the mean and standard deviation, making the z-score of those same outliers appear smaller than they really are. This is called "masking." Penn State's STAT 501 regression course notes document this masking effect and recommend the modified z-score for non-normal data (Penn State STAT 501).

⚠️
Z-Score Warning: Assumes Normality

Never apply the z-score method to income data, housing prices, or any right-skewed distribution. Use IQR instead, or the modified z-score (based on the median absolute deviation).

Scatter Plot Detection

In bivariate data (two variables), outliers reveal themselves as points far from the main cluster in a scatter plot. A point with an unusually large residual in a regression scatter plot is an influential observation — it is not just extreme in one variable but inconsistent with the relationship between both variables. Scatter plots are explored in depth in the scatter plots and correlation guide on this site.

Modified Z-Score and Robust Statistics

The modified z-score, developed by Iglewicz and Hoaglin for the American Society for Testing and Materials (ASTM), replaces the mean and standard deviation with the median and median absolute deviation (MAD). Because the median is robust to outliers, this method avoids the masking problem of the standard z-score. A modified z-score above 3.5 is commonly used as the outlier threshold.

Modified Z-Score — Robust Against Outlier Masking
M = 0.6745 × (xᵢ − x̃) / MAD
Flag as outlier when |M| > 3.5  |  MAD = Median(|xᵢ − x̃|)
= median of the dataset MAD = median absolute deviation 0.6745 = scaling constant for normal consistency

Machine Learning Approaches to Anomaly Detection

For high-dimensional datasets where traditional methods struggle, machine learning provides more powerful alternatives. These approaches are covered at an introductory level here — each has dedicated academic literature for deeper study.

🌲

Isolation Forest

Randomly partitions data into trees. Outliers are isolated in fewer splits because they are rare and distinctive. Works well on high-dimensional data without distance calculations.

🔵

DBSCAN

Density-based clustering that labels low-density points as outliers. Effective for detecting spatial anomalies and does not require specifying the number of clusters in advance.

🧠

Autoencoders

Neural networks that learn to compress and reconstruct normal patterns. Points with high reconstruction error — those the model struggles to replicate — are flagged as anomalies.

These methods are documented in Liu, Ting, and Zhou's foundational 2008 paper on the Isolation Forest algorithm (IEEE ICDM), and in Breunig et al.'s 2000 paper on LOF (Local Outlier Factor) at ACM SIGMOD — both seminal references in the anomaly detection literature.

Outlier Detection Calculator

🔍 Outlier Detector — IQR & Z-Score Methods

Should Outliers Be Removed?

This is the most consequential question in outlier analysis, and the answer is never automatic. Removing outliers without investigation is a form of data manipulation. Keeping errors that you know are wrong is equally problematic. The decision requires judgment, domain knowledge, and transparency.

When Removal Makes Sense

Removal is justified when the outlier is clearly attributable to a process that is outside the scope of your study. Specific justifications include: confirmed data entry errors (the original source shows a different value), known equipment malfunction (the sensor was malfunctioning during that period), duplicate or corrupted records, and values that are physically impossible (a human age of 312 years).

When Outliers Should Stay

Keep outliers when they represent genuine observations that are relevant to your question. The highest-performing salesperson in a sales dataset is an outlier — but removing them would bias any model of sales performance. In epidemiology, the first patient with an unusual presentation often appears as an outlier. In fraud detection, the outlier is the finding. The National Institutes of Health (NIH) publishes guidance on outlier handling in clinical trials that emphasizes pre-specified criteria for exclusion — decisions made before seeing the data, not after.

Ethical Concerns in Outlier Handling

Removing outliers selectively to improve a p-value or regression fit is a form of scientific misconduct. This practice — sometimes called "cherry-picking" or "data dredging" — inflates false positive rates and undermines the reliability of research. The American Statistical Association's 2022 Ethical Guidelines for Statistical Practice explicitly address this, stating that analysts must document all decisions about data exclusion and must not make those decisions based on the direction they push results (ASA Ethical Guidelines).

🚫
Never Remove Outliers to Improve Your Results

If your reason for removing an outlier is "it made my p-value worse," that is not a statistical justification — it is research bias. Decide outlier criteria before seeing results, document every exclusion, and disclose them in your methods section.

Best Practices for Documentation

Whether you keep or remove an outlier, document your decision completely: record the value, the detection method used, the reason for the decision, and whether a sensitivity analysis was run with and without the outlier. Transparent reporting is both a scientific standard and a requirement of reproducibility.

Real-World Outlier Case Studies

Abstract definitions become meaningful through concrete examples. Each case study below shows a domain where outlier detection has tangible consequences.

Fraud Detection in Banking

Case Study

Credit Card Transaction Anomaly Detection

A cardholder's transaction history shows purchases averaging $45, clustered between $10 and $150. A new transaction for $4,200 at a foreign merchant at 2:47 AM appears as an outlier in both the amount distribution (z-score ≈ 5.8) and the time dimension (3 SDs from the mean transaction hour). Modern fraud systems at institutions like JPMorgan Chase and Visa use FDIC-documented outlier-flagging pipelines that run in real time. The outlier here is not an error — it is the signal the entire system was built to catch.

Medical Research Anomalies

Case Study

Fleming's Penicillin Discovery — An Outlier That Saved Lives

In 1928, Alexander Fleming noticed a bacterial culture dish where contaminating mold had created a clear zone of bacterial inhibition — a striking outlier in his experimental results. A researcher focused only on "cleaning up noisy data" might have discarded this anomalous plate. Instead, Fleming investigated the outlier. The result was the discovery of penicillin. This example, documented in the history of medicine at institutions including the U.S. National Library of Medicine, is the canonical case for why outliers should be investigated, not reflexively deleted.

Manufacturing Quality Control

Case Study

Statistical Process Control — Six Sigma Outlier Detection

In manufacturing, control charts (Shewhart charts) flag data points that fall beyond 3σ from the process mean as "out-of-control" outliers. When a bolt production line begins generating bolts 0.3mm larger than specification, this outlier triggers an investigation. The cause is often tool wear, material batch variation, or miscalibration. Six Sigma quality programs formalized by Motorola and documented by the American Society for Quality (ASQ) rely on this outlier logic to maintain defect rates below 3.4 per million opportunities.

Housing Market Price Spikes

Case Study

Luxury Property Distortion in Median Home Price Reporting

In a neighborhood where homes sell for $350,000–$550,000, a single estate selling for $18 million is a statistical outlier. If included in mean price calculations, it makes the neighborhood appear unaffordable to buyers looking at median-priced homes. This is why the National Association of Realtors (NAR) reports median home prices rather than mean prices — the median is resistant to these outliers and gives a more accurate picture of typical affordability.

Social Media Analytics

Case Study

Viral Content Traffic Anomaly

A website typically receives 1,200 daily visitors with a standard deviation of 180. One day, a post goes viral and drives 340,000 visitors — a z-score of approximately 1,882, far beyond any reasonable threshold. This traffic spike is a genuine outlier in operational terms. Infrastructure teams use outlier detection in real-time monitoring tools to distinguish between viral events (benign outliers) and DDoS attacks (malicious outliers) — two very different responses to the same statistical signal.

Common Mistakes Beginners Make With Outliers

Mistake Wrong Approach Correct Approach
Auto-deleting outliers Remove any point beyond 2 SDs without checking Investigate each outlier's cause before deciding
Using z-scores on skewed data Apply z-score to income or house price distributions Use IQR or modified z-score on non-normal data
Ignoring domain context Flag a 100-year-old's age as impossible in a health dataset Verify the value; centenarians exist
Confusing noise with signal Smooth out all anomalies before fraud detection Preserve anomalies — they may be the most important data
Post-hoc removal Remove outliers after seeing they hurt your p-value Define exclusion criteria before analysis begins
One method fits all Always use the same detection threshold regardless of data shape Choose the method that fits your distribution and sample size

The Signal vs. Noise Outlier Framework

This five-step decision framework gives you a structured, repeatable process for every outlier you encounter. It is designed to be easy to remember, applicable in both research and business settings, and defensible under peer review.

1

Spot the Unusual Value

Use box plots or the IQR method for your initial scan. Flag any value beyond Q₁ − 1.5×IQR or Q₃ + 1.5×IQR. If the data is normally distributed, supplement with z-scores. Record the value, its position, and the method that flagged it.

2

Verify the Data

Trace the value back to its source. Is there a transcription error? A unit mismatch (kilograms vs. pounds)? A decimal point error? A sensor malfunction timestamp? If you cannot verify the original source, the value remains uncertain — treat it with caution.

3

Understand the Context

Ask whether the value is plausible in the real world. A 95-year-old patient in a geriatric study is not an outlier in context, even if flagged statistically. A $0 salary in an employee payroll is suspicious. Domain expertise — not just statistics — must inform this step.

4

Measure the Impact

Run your analysis both with and without the outlier. Compare the mean, standard deviation, regression coefficients, and any hypothesis test results. If the outlier changes your conclusions, it is influential and warrants special attention — either deeper investigation or clearly documented separate analysis.

5

Decide Transparently

Choose one of four responses: (a) Keep — the value is valid and relevant; (b) Remove — it is confirmed as an error; (c) Transform — apply a log or winsorization to reduce influence while keeping the data point; (d) Analyze separately — report results both ways. Whatever you decide, document it in your methods section.

Outlier Detection Method Cheat Sheet

Use this quick-reference table to choose the right detection method for your data situation. No single method is universally best — match the technique to the characteristics of your dataset.

Situation Recommended Method Why
Small dataset (n < 30) IQR Method Robust; no normality assumption needed
Normal distribution confirmed Z-Score (|z| > 3) Precise; exploits known distribution shape
Skewed data (income, prices) Modified Z-Score (MAD-based) Immune to outlier masking; uses median
Visual / exploratory analysis Box Plot Immediate visual summary; easy to communicate
Bivariate / regression data Scatter Plot + Cook's Distance Detects influential observations in context
High-dimensional data Isolation Forest Scales to many variables; no distance matrix needed
Clustered / spatial data DBSCAN Identifies low-density outlier regions naturally
Time series data ARIMA residuals or STL decomposition Separates trend/seasonality before flagging anomalies

Outlier Statistics Glossary

Term Formula / Symbol Plain-English Meaning Common Misunderstanding
Outlier x < Q₁−1.5IQR or x > Q₃+1.5IQR A value far from the main cluster of data All outliers are measurement errors
IQR Q₃ − Q₁ The range of the middle 50% of data IQR works perfectly on all distributions
Z-Score (x − μ) / σ Standard deviations a point is from the mean Always reliable for outlier detection
MAD Median(|xᵢ − x̃|) Median of absolute deviations; robust spread measure Same as standard deviation
Influential Observation High Cook's Distance A point that significantly changes regression results Same as any outlier
Robust Statistic Various (median, MAD) A measure resistant to the influence of outliers Just another name for "mean"
Winsorization Replace extremes with fence values Caps outliers at a threshold rather than removing them Same as deletion
Anomaly Detection ML methods (IF, DBSCAN) Identifying unusual patterns in complex datasets Only used in AI, not statistics

Frequently Asked Questions About Outliers

FAQ

What causes outliers in data?

Outliers arise from four main sources: (1) human errors in data entry or measurement; (2) instrument failure or sensor malfunction; (3) genuine rare events in the real world (extreme weather, fraud, medical anomalies); (4) sampling problems where a small sample captures an unusual observation. Identifying the cause determines the correct response.

FAQ

Can outliers be important information instead of errors?

Absolutely. In fraud detection, the outlier is the fraud. In drug discovery, an anomalous patient response may reveal a new mechanism. In quality control, a manufacturing outlier signals a process defect. Some of the most significant scientific and business discoveries began with someone investigating an outlier rather than deleting it.

FAQ

Which outlier detection method is best for beginners?

The IQR method. It is visual (works directly with box plots), requires no distributional assumptions, is taught in every introductory statistics course, and produces clear, interpretable boundaries. Once you understand IQR, the z-score and modified z-score are straightforward extensions.

FAQ

What is the difference between an anomaly and an outlier?

In classical statistics, "outlier" refers to a single data point extreme relative to the rest of the distribution. "Anomaly" is a broader data science term covering unexpected patterns — which may be a single point, a time-period, a group of points, or a contextual deviation. In practice, the terms are often used interchangeably, though anomaly detection methods typically handle more complex pattern types than traditional outlier tests.

Outliers connect to many other foundational topics in statistics. Understanding them fully requires knowing the measures they affect:

📊

Standard Deviation

Outliers inflate standard deviation. Understand how SD is calculated to see exactly why one extreme value can double a dataset's measured spread.

📐

Interquartile Range

The IQR is the foundation of Tukey's outlier fences. Master quartile calculations to apply the IQR method with full understanding.

⚖️

Variance

Variance squares every deviation, amplifying the impact of outliers far more than the original values suggest.

📉

Z-Score

The z-score standardizes any value in terms of standard deviations from the mean — the basis of z-score outlier detection.

📈

Simple Linear Regression

Influential observations in regression can change slope and intercept dramatically. Learn how regression is built to understand why.

🔔

Normal Distribution

Z-score outlier detection assumes normality. Understanding the normal curve explains why the 3σ threshold corresponds to ~0.3% of data.

Academic References: Tukey, J.W. (1977). Exploratory Data Analysis. Addison-Wesley.  |  Chandola, V., Banerjee, A., & Kumar, V. (2009). Anomaly Detection: A Survey. ACM Computing Surveys, 41(3).  |  Iglewicz, B. & Hoaglin, D. (1993). How to Detect and Handle Outliers. ASQC Quality Press.  |  Liu, F.T., Ting, K.M., & Zhou, Z.H. (2008). Isolation Forest. IEEE ICDM.  |  American Statistical Association. (2022). Ethical Guidelines for Statistical Practice. amstat.org.