What is the difference between sample and population standard deviation?

Population standard deviation (σ) divides the sum of squared deviations by N and is used when you have data on an entire population. Sample standard deviation (s) divides by n-1 (Bessel's correction) and is used when working with a subset of a larger population, which corrects for bias in estimating the true population variability.

How do you calculate standard deviation step by step?

Step 1: Calculate the mean of the dataset. Step 2: Subtract the mean from each value to find deviations. Step 3: Square each deviation. Step 4: Sum all squared deviations. Step 5: Divide by n-1 for sample variance (or N for population variance). Step 6: Take the square root to get standard deviation.

What does the Empirical Rule (68-95-99.7) mean?

The Empirical Rule states that in a normal distribution: approximately 68% of data falls within 1 standard deviation of the mean (μ ± 1σ), 95% falls within 2 standard deviations (μ ± 2σ), and 99.7% falls within 3 standard deviations (μ ± 3σ).

What is the coefficient of variation?

The coefficient of variation (CV) is the ratio of the standard deviation to the mean, expressed as a percentage: CV = (s / x̄) × 100. It allows comparison of variability between datasets with different units or scales. A CV below 15% typically indicates low variability; above 30% indicates high variability.

Standard Deviation Visualizer: Interactive Variance Calculator & Graph

Standard Deviation Visualizer & Variance Calculator

Sample SD s = √[Σ(xᵢ − x̄)² / (n−1)]

Calculation Mode

Decimal Precision

Dataset Values Supports comma-separated, space-separated, or one value per line. Paste CSV data directly.

Enter data in the Enter Data tab and click Calculate & Visualize first, then return here for the full step-by-step breakdown.

Formula Reference

Sample Standard Deviation s = √[Σ(xᵢ−x̄)² / (n−1)]

Population Standard Deviation σ = √[Σ(xᵢ−μ)² / N]

Variance (Sample) s² = Σ(xᵢ−x̄)² / (n−1)

Coefficient of Variation CV = (s / x̄) × 100%

Standard Error of Mean SEM = s / √n

Empirical Rule

μ ± 1σ ≈ 68.27% of data

μ ± 2σ ≈ 95.45% of data

μ ± 3σ ≈ 99.73% of data

Standard Deviation Guide

Complete reference on standard deviation concepts

Related Tools

CV Interpretation Guide

CV < 5% — Ultra-low variability
5%–15% — Moderate, controlled
15%–30% — Elevated variation
CV > 30% — High volatility

Worked Examples — Standard Deviation Across Industries

Click any example to load it directly into the calculator above.

What Is Standard Deviation? A Complete Reference Guide

Standard deviation is a statistical measure that quantifies the amount of variation, spread, or dispersion within a dataset relative to its mean. A low standard deviation indicates that data points cluster tightly around the average value, while a high standard deviation demonstrates that values are scattered across a wider range. It is one of the most widely used measures in descriptive statistics, research, finance, quality control, and data science.

Developed from the concept of variance, standard deviation returns the measure of spread to the original unit of the data by taking the square root of variance — making it far more interpretable in real-world contexts. For students, researchers, and analysts at Statistics Fundamentals, understanding standard deviation is foundational to statistical literacy.

Standard Deviation Visualizer — At a Glance

What It Calculates

Mean (x̄ / μ)
Sample & Population Variance
Standard Deviation (s / σ)
Standard Error (SEM)
Coefficient of Variation (CV)
Min, Max, Range, IQR

Visual Outputs

Interactive bell curve
Histogram with SD bands
Empirical rule shading
Downloadable SVG charts

Common Uses

Descriptive statistics
Research analysis
Quality control / Six Sigma
Financial risk analysis
Academic assignments
Data science / EDA

Standard Deviation Formula Library

The formula you use depends on whether your dataset is a sample (a subset drawn from a larger population) or the entire population. The distinction is critical because using the wrong formula systematically biases the result.

Population Standard Deviation (σ)

Use this when you have data on every member of the target group — for example, the exact test scores for an entire class:

Population Standard Deviation Formula σ = √[ Σ(xᵢ − μ)² / N ]

Where: σ = population standard deviation, xᵢ = each individual value, μ = population mean, N = total number of observations.

Sample Standard Deviation (s)

Use this when working with a sample drawn from a larger population. Dividing by n−1 (Bessel's correction) compensates for the tendency of samples to underestimate the true population variability:

Sample Standard Deviation Formula s = √[ Σ(xᵢ − x̄)² / (n − 1) ]

Where: s = sample standard deviation, xᵢ = each value, x̄ = sample mean, n = number of values in the sample.

Variance Formulas

Sample Variance (s²) s² = Σ(xᵢ − x̄)² / (n − 1)

Population Variance (σ²) σ² = Σ(xᵢ − μ)² / N

Coefficient of Variation (CV) CV = (s / x̄) × 100%

Standard Error (SEM) SEM = s / √n

How to Calculate Standard Deviation — Step-by-Step Guide

The following eight-step process walks through calculating the sample standard deviation manually using the dataset {10, 12, 16, 20, 22}.

Document your dataset. Write out all values clearly. Here: {10, 12, 16, 20, 22} with n = 5 observations.
Calculate the arithmetic mean (x̄). Add all values and divide by n: x̄ = (10+12+16+20+22) / 5 = 80 / 5 = 16.00
Find each deviation from the mean. Subtract the mean from every value: −6, −4, 0, +4, +6
Square each deviation. Squaring removes negative signs and amplifies larger deviations: 36, 16, 0, 16, 36
Sum the squared deviations (SS). SS = 36+16+0+16+36 = 104
Divide by n−1 to get sample variance. s² = 104 / (5−1) = 104/4 = 26.00
Take the positive square root. s = √26 ≈ 5.0990
Interpret the result. The average data point in this dataset sits roughly 5.10 units from the mean. Use the visualizer above to see this distribution plotted as a bell curve and histogram.

Standard Deviation vs Variance — Key Differences

Characteristic	Variance (s² / σ²)	Standard Deviation (s / σ)
Unit of measurement	Squared original units (e.g., cm²)	Same as original data (e.g., cm)
Interpretability	Not directly comparable to raw data	Directly plotted on the data scale
Outlier sensitivity	Very high — squaring amplifies outliers	Moderate — via square root transformation
Primary use cases	ANOVA, regression modeling, theoretical work	Reporting, risk analysis, quality control
Relationship	s² = (standard deviation)²	s = √(variance)

Sample vs Population Standard Deviation

Aspect	Sample (s)	Population (σ)
When to use	You have a subset of a larger group	You have data on the entire group
Denominator	n − 1 (Bessel's correction)	N (total count)
Result	Slightly larger — corrects for bias	Slightly smaller
Excel function	STDEV.S( )	STDEV.P( )
Python / NumPy	np.std(ddof=1)	np.std(ddof=0)

Measures of Dispersion — Comparison

Measure	Formula	What It Tells You	Limitations
Standard Deviation	√[Σ(xᵢ−x̄)²/(n−1)]	Average distance of each point from the mean; preserves original units	Sensitive to outliers due to squaring; assumes normality for the Empirical Rule
Variance	Σ(xᵢ−x̄)²/(n−1)	Squared average deviation; used in ANOVA and regression	Unit is squared, making direct interpretation harder
Range	Max − Min	Total spread of the dataset	Ignores all intermediate values; highly sensitive to outliers
IQR	Q3 − Q1	Spread of the middle 50% of data; robust to outliers	Ignores data outside the middle 50%
Mean Absolute Deviation	Σ\|xᵢ−x̄\| / n	Average absolute distance from the mean	Not as mathematically tractable as SD for advanced statistics

The Empirical Rule (68–95–99.7 Rule)

For data that follows a normal distribution, the Empirical Rule describes how observations distribute across standard deviation bands. This rule is foundational to quality control, risk assessment, and educational grading systems.

Range	% of Data	Plain English	Applications
μ ± 1σ	68.27%	About 2 in 3 observations fall within one SD of the mean	Typical operating range; standard reporting window
μ ± 2σ	95.45%	About 1 in 20 falls outside this band	95% confidence intervals; upper control limits
μ ± 3σ	99.73%	Only 1 in 370 falls outside — statistically rare	Six Sigma quality control; outlier thresholds
μ ± 1.96σ	95.00%	Exactly 95% (standard z-critical value)	Z-test critical value; 95% confidence intervals
μ ± 2.576σ	99.00%	Exactly 99% of all observations	99% confidence intervals; strict quality standards

Real-World Applications of Standard Deviation

Finance & Investment Risk: In portfolio management, standard deviation is the primary proxy for volatility. A stock returning an average of 8% annually with a standard deviation of 15% may return anywhere from −7% to +23% in a given year. Higher standard deviation means higher risk and higher potential reward. The Black-Scholes option pricing model relies directly on standard deviation of returns.

Manufacturing & Six Sigma Quality Control: Six Sigma processes aim to keep defects within ±6σ of the process mean, achieving fewer than 3.4 defects per million opportunities. Control charts use ±3σ limits as action thresholds. A shift in standard deviation signals the need for tool recalibration or process review.

Healthcare & Clinical Research: Standard deviation defines the reference ranges for clinical laboratory values (e.g., normal blood pressure, cholesterol levels). Values beyond 2 SD from the population mean are flagged as clinically abnormal. Clinical trials use SD to calculate statistical power and required sample sizes.

Education & Psychometrics: IQ scores are standardized to μ=100, σ=15. SAT sections use μ=500, σ=100. Standard deviation enables percentile rankings and fair grade-curve adjustments across different exam versions.

Data Science & Machine Learning: Feature normalization (z-score standardization) uses mean and standard deviation to scale all variables to the same range, preventing large-magnitude features from dominating distance-based models like k-nearest neighbors and support vector machines.

8 Common Mistakes in Standard Deviation Analysis

1. Using population formula on sample data. Dividing by N instead of n−1 for a sample artificially deflates the standard deviation, underestimating true variability and leading to overconfident conclusions.

2. Confusing standard deviation with standard error. Standard deviation describes variability within your dataset. Standard error (SEM = s/√n) describes how precisely your sample mean estimates the true population mean — these are two very different concepts.

3. Adding standard deviations directly. You cannot combine the spread of two datasets by adding their standard deviations. Variances add under independence (not standard deviations): if combining two groups, use pooled variance formulas.

4. Applying the Empirical Rule to non-normal distributions. The 68-95-99.7 rule only holds for approximately normal distributions. Applying it to skewed or bimodal data produces incorrect probability estimates. Always check normality first.

5. Ignoring outlier influence. Because squaring amplifies deviations, a single extreme outlier can dramatically inflate standard deviation. Report alongside median and IQR for skewed datasets.

6. Interpreting standard deviation without context. There is no universal "good" or "bad" standard deviation. A σ of 5 is excellent for manufacturing piston diameters in mm, but unacceptably high for a chemical compound's purity measured in parts per million.

7. Using too small a sample. With n < 30, standard deviation estimates are unstable. A sample of 5 values can yield wildly different standard deviations from the same population. Larger samples produce more reliable estimates.

8. Mixing grouped and raw data formulas. Grouped data (data presented in frequency tables) requires a modified formula that accounts for class midpoints and frequencies. The standard raw-data formula applied to grouped data produces incorrect results.

Statistical Glossary — Key Terms & Definitions

Term	Formula	Definition	Use Case
Standard Deviation	s = √(s²)	Square root of variance; measures average distance of data points from the mean in original units	Risk reporting, quality control, research summaries
Variance	s² = Σ(xᵢ−x̄)²/(n−1)	Average of the squared deviations from the mean; unit is squared	ANOVA, regression, theoretical statistics
Mean (x̄)	x̄ = Σxᵢ / n	Arithmetic average; the balancing point of the distribution	Central tendency reporting across all domains
Z-Score	z = (x − μ) / σ	Number of standard deviations a value sits above or below the mean	Standardizing test scores; identifying outliers
Coefficient of Variation	CV = (s / x̄) × 100	Relative dispersion expressed as a percentage of the mean; unitless	Comparing variability across datasets with different scales
Standard Error (SEM)	SEM = s / √n	Precision of the sample mean as an estimate of the population mean; decreases as n grows	Confidence intervals; hypothesis testing
Empirical Rule	68–95–99.7%	For normal distributions: 68%, 95%, 99.7% of data falls within 1, 2, 3 SDs of the mean	Probability estimation; outlier detection; quality control
Normal Distribution	f(x) = e^(−z²/2) / σ√2π	Symmetric, bell-shaped probability distribution defined by mean and SD; foundation of inferential statistics	Hypothesis testing; confidence intervals; natural phenomena
IQR	IQR = Q3 − Q1	Range of the middle 50% of data; robust measure of spread unaffected by extreme outliers	Box plots; outlier detection; skewed data reporting

Sources & further reading:

NIST Engineering Statistics Handbook — Measures of Scale (Standard Deviation)
Freedman, D., Pisani, R., & Purves, R. (2007). Statistics (4th ed.). W.W. Norton. [Standard reference textbook on statistical dispersion]
Khan Academy — Standard Deviation & Variance Review
Wikipedia — Standard Deviation [overview and mathematical derivation]

Frequently Asked Questions

Standard deviation is a statistical measure that quantifies the amount of variation or dispersion within a dataset. A low standard deviation means data points cluster tightly around the mean; a high standard deviation means they are spread widely. It is calculated as the square root of variance, which returns the measure to the original unit of the data — making it directly interpretable alongside the raw values.

Population standard deviation (σ) is used when your dataset represents every member of the group you are studying, and divides by N. Sample standard deviation (s) is used when your data is a subset of a larger population, and divides by n−1 (Bessel's correction). The sample version produces a slightly larger result, which corrects for the statistical bias of small samples tending to underestimate the true population variability. In practice, most research situations call for sample standard deviation.

Interpretation depends on context. In a normal distribution, approximately 68% of data falls within ±1 SD of the mean, 95% within ±2 SD, and 99.7% within ±3 SD (the Empirical Rule). For comparing datasets, the coefficient of variation (CV = SD/mean × 100%) is more useful because it is unitless. A CV below 15% typically signals controlled, low variability; above 30% indicates high volatility. Always interpret standard deviation relative to the scale and domain of the data.

Yes. This Standard Deviation Visualizer is completely free, browser-based, and requires no sign-up. Paste or type your values in the Enter Data tab, select sample or population mode, and click Calculate to get instant results including mean, variance, standard deviation, coefficient of variation, IQR, median, skewness, and more — alongside an interactive bell curve and histogram showing your data's distribution.

There is no universal "good" standard deviation — it depends entirely on the domain and scale of measurement. In manufacturing quality control (e.g., piston diameters in mm), a near-zero standard deviation signals excellent process consistency. In a financial portfolio, a higher standard deviation reflects higher risk but also higher potential returns. In education, a standard deviation of 10–15 points on a 100-point exam is typical. Use the coefficient of variation (CV) to compare variability across datasets with different scales.

In finance, standard deviation is the primary measure of investment risk and market volatility. It quantifies how much an asset's returns deviate from its historical average return. A stock with a high standard deviation of returns is considered volatile and risky; a low-standard-deviation stock or bond behaves more predictably. Standard deviation also underpins the Sharpe Ratio (risk-adjusted return = (return − risk-free rate) / SD), portfolio optimization models, and Value at Risk (VaR) calculations used in risk management.

Standard deviation directly determines the width of confidence intervals. A 95% confidence interval for a population mean is calculated as x̄ ± 1.96 × (s/√n), where s/√n is the standard error. Higher data variability (larger s) produces wider confidence intervals, indicating less precision in estimating the true population mean. You can calculate and visualize confidence intervals using the Confidence Interval Calculator.

The Empirical Rule states that for data following a normal distribution: approximately 68% of values fall within ±1 standard deviation of the mean, 95% fall within ±2 standard deviations, and 99.7% fall within ±3 standard deviations. This rule is used in quality control (Six Sigma uses ±6σ limits), outlier detection (values beyond ±3σ are statistically rare), and educational grading curves. The rule only applies to approximately normal distributions — for skewed data, use box plots and IQR instead.

Standard Deviation Visualizer

Standard Deviation Visualizer & Variance Calculator

Central Tendency

Dispersion

Range & Bounds

Distribution

Bell Curve — Normal Distribution

Histogram with Standard Deviation Bands

Empirical Rule (68–95–99.7)