What does the Central Limit Theorem say?

The Central Limit Theorem states that the sampling distribution of the sample mean approaches a normal distribution as the sample size n increases, regardless of the shape of the population distribution. In practice, n ≥ 30 is the common guideline for the approximation to be reliable, though the required n depends on how skewed the population is.

What is the difference between standard deviation and standard error?

Standard deviation (σ or s) measures the spread of individual observations within a single sample or population. Standard error (SE = σ/√n) measures the spread of the sample mean across all possible samples — it is the standard deviation of the sampling distribution. Standard error shrinks as n increases; standard deviation of the population does not.

When should I use the t-distribution instead of the Z-distribution?

Use the t-distribution when the population standard deviation σ is unknown and you are estimating it from sample data. If n is small (n < 30) and the population is roughly normal, the t-distribution with df = n − 1 degrees of freedom correctly accounts for the extra uncertainty. As n grows large, the t-distribution converges to the Z-distribution.

What is a bootstrap sampling distribution?

A bootstrap sampling distribution is an empirical approximation of the true sampling distribution constructed by resampling with replacement from your observed data. You draw thousands of bootstrap samples, compute the statistic on each one, and plot the result. No parametric assumptions about the population shape are required, making it useful when the CLT conditions are not met.

What is the finite population correction factor?

When sampling more than roughly 5% of a finite population (n/N > 0.05), the standard error formula overstates the true variability. The corrected formula is SE = (σ/√n) × √((N−n)/(N−1)), where N is the population size and n is the sample size. The correction factor √((N−n)/(N−1)) is always less than 1, so it reduces the standard error.

Does the population need to be normally distributed for the CLT to apply?

No. The Central Limit Theorem applies to any population with a finite mean and variance, regardless of its shape. With a roughly symmetric population, n ≥ 15 may be sufficient. With a heavily skewed population, you may need n ≥ 50 or more. The more extreme the skewness, the larger the sample size needed for the sampling distribution to become approximately normal.

Sampling Distributions: Definition, Formula, CLT & Examples

What Is a Sampling Distribution?

Imagine drawing 1,000 random samples of size 50 from the same population. Each sample produces a slightly different mean. Plot all 1,000 of those means on a histogram, and what you get is the sampling distribution of the sample mean. That distribution — not the original population, not any single sample — is what inferential statistics runs on.

Three distinct distributions are easy to confuse, and the confusion leads to real errors in calculations:

Distribution	What it describes	Shape	Spread parameter
Population distribution	Individual values in the full population	Any shape (skewed, bimodal, uniform…)	σ (population SD)
Sample distribution	Individual values in one sample	Approximates population shape	s (sample SD)
Sampling distribution	Values of a statistic across many samples	Approaches normal (CLT)	SE = σ/√n

💡

Definition — AI Overview Block

A sampling distribution is the probability distribution of a sample statistic (such as the mean or proportion) computed from all possible samples of a fixed size n drawn from the same population. Its mean equals the population parameter (unbiasedness), and its spread is measured by the standard error, which shrinks as n increases.

🔑 Key Takeaways

The most important ideas to carry into every sampling distribution problem.

●

The sampling distribution is not the population distribution. It is the distribution of a statistic, not of individual data values.

●

The mean of the sampling distribution equals μ. Sample means are unbiased estimators of the population mean.

●

Standard error = σ/√n. Larger samples produce narrower sampling distributions — meaning more precise estimates.

●

The Central Limit Theorem is why normal-based inference works. Even non-normal populations produce a roughly normal sampling distribution when n ≥ 30.

●

When σ is unknown and n is small, use t — not Z. The t-distribution with df = n − 1 accounts for extra estimation uncertainty.

●

Bootstrap sampling distributions need no parametric assumptions. They construct empirical distributions from your observed data using resampling.

Types of Sampling Distributions

The three most commonly used sampling distributions each answer a different question about a different kind of data.

Sampling Distribution of the Sample Mean (x̄)

When the variable of interest is quantitative — test scores, reaction times, incomes — and you want to estimate the population mean μ, the relevant sampling distribution is that of x̄. If random samples of size n are drawn from a population with mean μ and standard deviation σ, then the sampling distribution of x̄ has:

μx̄ = μ

Mean of sampling distribution

σx̄ = σ/√n

Standard error of the mean

Shape → N(μ, σ/√n)

Shape by CLT (n ≥ 30)

Sampling Distribution of the Sample Proportion (p̂)

When data is categorical — yes/no, pass/fail, clicked/did not click — and you want to estimate a population proportion p, the sampling distribution of p̂ applies. It requires two conditions before the normal approximation holds:

np ≥ 10

The expected number of successes must be at least 10. Some textbooks use np ≥ 5 as the threshold; the stricter rule is more conservative and widely accepted.

n(1−p) ≥ 10

The expected number of failures must also be at least 10. Both conditions must hold simultaneously.

Independence

Observations are independent, or the sample is less than 10% of the population (the 10% condition for sampling without replacement).

When the conditions hold, the sampling distribution of p̂ is approximately normal with mean p and standard error SE = √(p(1−p)/n).

Sampling Distribution of the Difference Between Two Means

When comparing two groups — treatment vs. control, Group A vs. Group B — the quantity of interest is x̄₁ − x̄₂. Under independence, this difference follows a sampling distribution with mean μ₁ − μ₂ and standard error √(σ₁²/n₁ + σ₂²/n₂). This is the foundation of the two-sample t-test and the two-sample z-test covered in the hypothesis testing guide at Statistics Fundamentals.

Standard Error: Formula, Meaning & Effect of Sample Size

Standard error is the standard deviation of the sampling distribution. It measures how much the statistic fluctuates across repeated samples — not how much individual data points fluctuate within a sample. That distinction is what many students miss.

Standard Error of the Mean

SE = σ / √n

If σ is unknown, substitute the sample standard deviation s

σ = population standard deviation n = sample size SE = spread of the sampling distribution

A concrete example shows how dramatically sample size changes precision. Suppose a population has σ = 20:

Sample size (n)	SE = 20/√n	Interpretation
n = 4	SE = 10.0	Sample mean varies by ±10 points on average
n = 16	SE = 5.0	Precision doubles from n = 4
n = 100	SE = 2.0	Now within ±2 points of μ on average
n = 400	SE = 1.0	Quadrupling n halves SE again

⚠️

Standard Deviation vs. Standard Error — the Most Common Confusion

Standard deviation (s or σ) describes spread within a dataset. Standard error (SE = σ/√n) describes spread of a statistic across samples. Reporting SE in place of SD makes results look more precise than they are — a frequent error in published research. The two quantities shrink at different rates: SE shrinks with 1/√n; SD stays roughly constant regardless of n.

Finite Population Correction Factor

The formula SE = σ/√n assumes the population is either infinite or that sampling occurs with replacement. When sampling without replacement from a finite population of size N, and the sample makes up more than about 5% of that population (n/N > 0.05), the standard SE formula overstates true variability. Apply the finite population correction (FPC):

Standard Error with Finite Population Correction

SE_FPC = (σ/√n) × √((N−n)/(N−1))

Apply when n/N > 0.05 (sample exceeds 5% of population)

N = population size n = sample size √((N−n)/(N−1)) = FPC factor (always < 1)

Example: A company has 200 employees (N = 200). You survey 40 of them (n = 40). Since 40/200 = 20% > 5%, the FPC applies. With σ = 10: unadjusted SE = 10/√40 = 1.58. FPC factor = √((200−40)/(200−1)) = √(160/199) = 0.897. Corrected SE = 1.58 × 0.897 = 1.42. The correction reduces SE by about 10% in this case.

The Central Limit Theorem

The Central Limit Theorem (CLT) is the reason normal-based inference works on data that are decidedly not normal. It states that as sample size n increases, the sampling distribution of the sample mean approaches a normal distribution — regardless of the shape of the population distribution, provided the population has a finite mean and variance.

✅

Central Limit Theorem — Formal Statement

If X₁, X₂, …, Xₙ are independent, identically distributed random variables with mean μ and variance σ², then the sampling distribution of x̄ = (X₁ + X₂ + … + Xₙ)/n converges to N(μ, σ²/n) as n → ∞. In practice, n ≥ 30 is a common guideline for the approximation to be reliable.

Three Conditions for the CLT to Apply

Independence

The sampled values must be independent. For sampling without replacement, independence holds when the sample is less than 10% of the population (the 10% condition).

Sample Size (n ≥ 30)

The general guideline is n ≥ 30. For highly skewed populations, a larger n may be needed. For populations already close to normal, n can be smaller — even n = 10 may suffice.

Finite Variance

The population must have a finite mean and finite variance. Heavy-tailed distributions like the Cauchy distribution violate this and the CLT does not apply to them.

How Skewness Affects the Required Sample Size

The n ≥ 30 rule is a guideline, not a law. A roughly symmetric population achieves a near-normal sampling distribution at n = 10 or 15. A right-skewed income distribution or an exponential distribution may need n = 50 or more for the approximation to be accurate. The more extreme the skewness, the more samples you need to average before the CLT smooths out the population's shape.

The Central Limit Theorem (Wikipedia) provides a rigorous treatment of the convergence conditions, including Lindeberg's more general version that relaxes the identical distribution requirement.

Real-World Example

Election Polling and the CLT

National poll samples are rarely drawn from normally distributed populations — voter preferences are binary (support or oppose). But because samples of n = 800 to 1,500 are used, the CLT guarantees the sampling distribution of the sample proportion is approximately normal. That is why every poll reports a margin of error: it is approximately 2 × SE = 2 × √(p̂(1−p̂)/n). For n = 1,000 and p̂ = 0.50, the margin is 2 × √(0.25/1000) ≈ ±3.2 percentage points — a number you will see cited in virtually every major media poll.

When σ Is Unknown: The t-Distribution

The Z-based sampling distribution (x̄ ~ N(μ, σ/√n)) assumes σ is known. In practice, σ is almost never known — you estimate it from the sample using s. That estimation introduces additional uncertainty, and the t-distribution accounts for it.

When σ is unknown and either the population is approximately normal or n is small (typically n < 30), the standardized sample mean follows a t-distribution rather than a Z-distribution:

t-Statistic for the Sampling Distribution of x̄

t = (x̄ − μ) / (s / √n)

Follows t-distribution with df = n − 1 degrees of freedom

x̄ = sample mean μ = hypothesized population mean s = sample standard deviation n = sample size df = n − 1 = degrees of freedom

t-Distribution vs. Z-Distribution: When to Use Each

Situation	Use Z	Use t
σ known	Yes	No
σ unknown, n ≥ 30	Acceptable (t ≈ Z)	Preferred (technically correct)
σ unknown, n < 30, population normal	No	Yes (df = n − 1)
σ unknown, n < 30, population non-normal	No	Problematic — consider bootstrap

The t-distribution is wider and heavier-tailed than Z when df is small. As df → ∞ (i.e., as n → ∞), the t-distribution converges to the standard normal. At df = 30, t and Z are already very close. Critical values for the t-distribution appear in the t-distribution table — look up the correct df and significance level before computing a test or interval.

Step-by-Step Worked Examples

Work through each example in order: identify parameters, compute the standard error, standardize, and look up the probability. That four-step structure works for every sampling distribution problem.

Example 1 — Sample Mean (Z)

Exam scores at a university have μ = 72 and σ = 15. A random sample of n = 36 students is selected. What is the probability that the sample mean exceeds 76?

Identify parameters: μ = 72, σ = 15, n = 36. CLT check: n = 36 ≥ 30 ✓. Sampling distribution: x̄ ~ N(72, 15/√36) = N(72, 2.5).

Compute SE: SE = σ/√n = 15/√36 = 15/6 = 2.5

Standardize: Z = (x̄ − μ) / SE = (76 − 72) / 2.5 = 4 / 2.5 = 1.60

Find probability: P(x̄ > 76) = P(Z > 1.60) = 1 − 0.9452 = 0.0548

Answer: There is a 5.48% probability that a sample of 36 students has a mean exam score above 76. Use the Z-table to verify P(Z ≤ 1.60) = 0.9452.

Example 2 — Sample Proportion

A product has a 30% defect rate. A quality inspector samples n = 100 items. What is the probability that fewer than 25% of the sample is defective?

Identify parameters: p = 0.30, n = 100. Conditions: np = 30 ≥ 10 ✓, n(1−p) = 70 ≥ 10 ✓. Sampling distribution of p̂ is approximately normal.

Compute SE of proportion: SE = √(p(1−p)/n) = √(0.30 × 0.70 / 100) = √(0.0021) = 0.0458

Standardize: Z = (p̂ − p) / SE = (0.25 − 0.30) / 0.0458 = −0.05 / 0.0458 = −1.09

Find probability: P(p̂ < 0.25) = P(Z < −1.09) = 0.1379

Answer: There is a 13.8% chance the inspector observes fewer than 25% defective items in a sample of 100, even though the true defect rate is 30%.

Example 3 — Small Sample, t-Distribution

A lab measures reaction times for n = 16 participants. The sample gives x̄ = 248 ms and s = 40 ms. The population mean is hypothesized to be 260 ms. Does the sample provide evidence that the true mean differs from 260 ms (two-tailed, α = 0.05)?

Identify parameters: x̄ = 248, μ₀ = 260, s = 40, n = 16. Since σ is unknown and n < 30, use t with df = 16 − 1 = 15.

Compute SE: SE = s/√n = 40/√16 = 40/4 = 10 ms

Compute t-statistic: t = (248 − 260) / 10 = −12/10 = −1.20

Compare to critical value: For two-tailed test with df = 15 and α = 0.05, critical t = ±2.131 (from the t-table). Since |−1.20| = 1.20 < 2.131, fail to reject H₀.

Answer: The sample does not provide sufficient evidence (at α = 0.05) that the true mean reaction time differs from 260 ms. The observed difference of 12 ms is within normal sampling variability for n = 16.

Bootstrap Sampling Distributions

Classical sampling distribution theory requires assumptions: normality of the population, or n large enough for the CLT. Bootstrap resampling sidesteps these requirements by constructing an empirical sampling distribution directly from your observed data.

How Bootstrapping Works

Start with your original sample of n observations.
Draw a new sample of size n with replacement from the original — this is one bootstrap sample.
Compute the statistic of interest (mean, median, correlation, etc.) on the bootstrap sample.
Repeat steps 2–3 a large number of times — typically B = 1,000 to 10,000.
Plot the B bootstrap statistics. The resulting histogram is the bootstrap sampling distribution.
Use the 2.5th and 97.5th percentiles of this distribution as a 95% confidence interval.

💡

When Is Bootstrap Most Useful?

Bootstrap distributions work well when the population is non-normal and n is too small for the CLT, when you are estimating a complex statistic like a correlation or ratio, or when no theoretical sampling distribution formula exists for your statistic. The bootstrap method (Wikipedia) was introduced by Bradley Efron in 1979 and is now a standard tool in computational statistics and machine learning.

How Sampling Method Affects the Distribution

The standard error formula SE = σ/√n assumes simple random sampling (SRS) — every possible sample of size n has an equal chance of selection. Other sampling designs change the standard error, sometimes substantially.

Simple Random Sampling (SRS)

Every member of the population has an equal probability of selection. SE = σ/√n applies directly. Independence holds (or approximately holds via the 10% condition).

Stratified Sampling

The population is divided into strata; separate random samples are drawn from each. Within-stratum homogeneity reduces the effective SE — often producing a narrower sampling distribution than SRS of the same n. The design effect (DEFF) quantifies the gain.

Cluster Sampling

Groups (clusters) are randomly selected, then all members within a cluster are surveyed. Observations within the same cluster tend to be correlated, which inflates the effective SE relative to SRS. The intraclass correlation coefficient (ICC) captures this inflation.

Real-World Applications

Sampling distributions are not abstract theory. They drive decisions in every field that draws inferences from data.

📊

Political Polling

Polls report a margin of error — which is precisely 2 × SE of the sample proportion. A poll of n = 1,000 with p̂ = 0.50 has a margin of ±3.2 percentage points at 95% confidence.

🏥

Clinical Trials

The sampling distribution of the difference between treatment and control group means determines whether observed differences are statistically significant — or merely sampling variability. Every drug approval relies on this logic.

🏭

Quality Control

Control charts plot sample means over time and use the sampling distribution (specifically, 3 × SE control limits) to flag when a manufacturing process has shifted out of statistical control.

🖥️

A/B Testing

In digital product experiments, the sampling distribution of the difference in conversion rates between two variants determines the minimum detectable effect and required sample size for a valid test.

💹

Financial Risk

Portfolio managers use the sampling distribution of average returns across historical periods to construct confidence intervals for expected future performance — a direct application of the CLT to asset return data.

Sampling Distributions and Inferential Statistics

Every procedure in inferential statistics — confidence intervals, Z-tests, t-tests, proportion tests — is built directly on the sampling distribution of some statistic. Understanding this connection removes the mystery from these procedures.

Confidence Intervals

A 95% confidence interval for the population mean is: x̄ ± z* × SE, where z* = 1.96 for 95% confidence. The width of the interval is 2 × 1.96 × SE. Narrower sampling distributions (larger n, smaller σ) produce narrower intervals — which is why increasing sample size improves estimation precision. See the hypothesis testing guide on Statistics Fundamentals for confidence interval construction with t-critical values.

Hypothesis Testing

A hypothesis test computes the probability of observing a test statistic as extreme as the one obtained, assuming H₀ is true. That probability — the p-value — is an area under the sampling distribution curve. Reject H₀ when the p-value falls below the chosen significance level α. The normal distribution guide explains how to read Z-scores from these distributions, and the Z-table and t-table provide the critical values.

Common Mistakes and Misconceptions

❌

Treating the sampling distribution as the sample distribution

The sampling distribution is the distribution of a statistic across repeated samples — it is not the distribution of data values within a single sample. Using the sample's standard deviation where the standard error is required produces completely wrong probability calculations.

Fix: Always ask "am I describing individuals in one sample (use SD) or the behavior of a statistic across samples (use SE)?"

❌

Treating n ≥ 30 as a hard threshold

The n ≥ 30 guideline is a rule of thumb, not a mathematical threshold. A strongly skewed or heavy-tailed population may need n = 100 or more. A near-normal population may be fine at n = 10. Checking the shape of your data matters more than hitting a magic number.

Fix: Look at the population shape and assess skewness. Increase n if the distribution is heavily non-normal.

❌

Using Z when t is required

When σ is unknown and n is small, plugging in s and using a Z-critical value underestimates the true uncertainty. The t-distribution is wider, which correctly reflects the fact that using an estimated standard deviation introduces additional error.

Fix: If you are using s (sample SD) rather than σ (population SD), use the t-distribution with df = n − 1.

❌

Assuming the population must be normally distributed

The CLT does not require a normal population. It only requires that the population has a finite mean and variance, and that n is large enough. The sampling distribution of x̄ becomes normal with sufficient n, regardless of the population's shape.

Fix: The population shape is mostly irrelevant for large n. Focus on whether n is large enough relative to the population's skewness.

❌

Forgetting the finite population correction when sampling large fractions

The standard formula SE = σ/√n assumes sampling from a large population. When the sample exceeds 5–10% of the population, ignoring the FPC overstates the standard error and makes confidence intervals unnecessarily wide.

Fix: When n/N > 0.05, multiply SE by √((N−n)/(N−1)) to get the corrected standard error.

Quick Reference: Sampling Distributions Summary

Statistic	Mean of sampling dist.	Standard Error (SE)	Shape (large n)	Use t when
Sample mean (x̄)	μ	σ/√n	N(μ, σ²/n)	σ unknown, n small
Sample proportion (p̂)	p	√(p(1−p)/n)	N(p, p(1−p)/n) if np,n(1−p)≥10	Rarely (use Z for proportions)
Difference x̄₁ − x̄₂	μ₁ − μ₂	√(σ₁²/n₁ + σ₂²/n₂)	Normal (by CLT)	σ₁, σ₂ unknown
With FPC	μ	(σ/√n) × √((N−n)/(N−1))	Normal	Same as x̄ rules
Bootstrap	Empirical	SD of bootstrap statistics	Empirical (no assumption)	Not applicable

Sampling distributions sit at the center of inferential statistics. Every topic below connects directly to concepts covered on this page. See the random variables guide for the formal probability theory, and the descriptive statistics section for the foundational summary measures that sampling distributions are built around.

Normal Distribution

The target shape of sampling distributions
Governs Z-score calculations
Defines confidence interval critical values
Connected via CLT

Hypothesis Testing

Uses sampling distributions for p-values
Z-tests and t-tests are direct applications
Type I/II errors defined in terms of sampling dist.
Power depends on SE

Binomial Distribution

Sampling distribution of p̂ derives from binomial
Normal approximation via CLT
np ≥ 10 condition parallels CLT requirement
Foundation for proportion tests

t, Chi-Square & F Distributions

All are sampling distributions of specific statistics
t = ratio involving normal and chi-square
F = ratio of two chi-square variables
Used in ANOVA and regression inference

Normal Distribution

Understand the bell curve that sampling distributions converge toward — properties, Z-scores, and the empirical rule.

Hypothesis Testing

See how sampling distributions drive Z-tests, t-tests, and proportion tests in practice.

Binomial Distribution

The distribution behind the sampling distribution of proportions — with full formula walkthrough and examples.

Frequently Asked Questions

Imagine taking many random samples of the same size from a population and computing the mean of each one. The distribution of all those sample means is the sampling distribution of the mean. It tells you how much sample means vary from sample to sample — and because that variation is predictable and quantifiable, you can use it to make probability statements about where the true population mean lies.

A sample distribution is the distribution of the data values in one sample — it approximates the population distribution and has standard deviation s. A sampling distribution is the distribution of a statistic (like the sample mean) across all possible samples of a given size — it is much narrower than the population distribution and has standard deviation SE = σ/√n. These two quantities serve completely different purposes and should never be substituted for each other.

The standard error of the sample mean is SE = σ/√n, where σ is the population standard deviation and n is the sample size. If σ is unknown, replace it with the sample standard deviation s. Standard error decreases as n increases — quadrupling the sample size halves the standard error. For proportions, SE = √(p(1−p)/n). When sampling more than 5% of a finite population, apply the finite population correction: multiply SE by √((N−n)/(N−1)).

No. The Central Limit Theorem applies to any population with a finite mean and variance, regardless of its shape. For a roughly symmetric population, the sampling distribution becomes approximately normal at n = 15 or less. For a heavily skewed population, n = 50 or more may be needed. The population itself does not need to be normal — only the sampling distribution of the mean needs to be approximately normal, and the CLT guarantees that happens with sufficient n.

Use the t-distribution when the population standard deviation σ is unknown and you substitute the sample standard deviation s. With small samples (n < 30), this substitution introduces meaningful additional uncertainty that the t-distribution captures through its heavier tails. Specifically, use t when: (1) σ is unknown, (2) the population is approximately normal or n is large enough for CLT, and (3) you need the degrees of freedom to be df = n − 1. When n ≥ 30, Z and t give nearly identical answers.

When a sample is drawn without replacement from a finite population, and the sample size n exceeds about 5% of the population size N, the standard error formula overestimates the true variability. The finite population correction (FPC) adjusts for this: corrected SE = (σ/√n) × √((N−n)/(N−1)). The FPC factor is always less than 1, so it reduces the standard error. When n/N is small — as in most large-scale surveys — the FPC is negligible and can be ignored.

Bootstrap resampling treats your observed sample as a stand-in for the population. You draw thousands of new samples of size n from your original sample, with replacement, and compute the statistic of interest on each bootstrap sample. The resulting collection of bootstrap statistics forms the bootstrap sampling distribution. Its spread estimates the standard error; its 2.5th and 97.5th percentiles give a 95% bootstrap confidence interval. No assumptions about population shape are needed, making bootstrap especially useful for small samples or complex statistics.

In an A/B test, you observe conversion rates p̂_A and p̂_B from two groups of users. The quantity of interest is p̂_A − p̂_B. Under the null hypothesis that the true rates are equal, the sampling distribution of this difference is approximately normal with a calculable standard error. The test computes a Z-score for the observed difference and checks whether it falls in the rejection region. The required sample size per group is determined by specifying a minimum detectable effect, significance level α, and desired power (1 − β) — all quantities that depend on the sampling distribution's shape and spread.

What Is a Sampling Distribution?

🔑 Key Takeaways

Types of Sampling Distributions

Sampling Distribution of the Sample Mean (x̄)

Sampling Distribution of the Sample Proportion (p̂)

np ≥ 10

n(1−p) ≥ 10

Independence

Sampling Distribution of the Difference Between Two Means

Standard Error: Formula, Meaning & Effect of Sample Size

Finite Population Correction Factor

The Central Limit Theorem

Three Conditions for the CLT to Apply

Independence

Sample Size (n ≥ 30)

Finite Variance

How Skewness Affects the Required Sample Size

Real-World Example

Election Polling and the CLT

When σ Is Unknown: The t-Distribution

t-Distribution vs. Z-Distribution: When to Use Each

Step-by-Step Worked Examples

Exam scores at a university have μ = 72 and σ = 15. A random sample of n = 36 students is selected. What is the probability that the sample mean exceeds 76?

A product has a 30% defect rate. A quality inspector samples n = 100 items. What is the probability that fewer than 25% of the sample is defective?

A lab measures reaction times for n = 16 participants. The sample gives x̄ = 248 ms and s = 40 ms. The population mean is hypothesized to be 260 ms. Does the sample provide evidence that the true mean differs from 260 ms (two-tailed, α = 0.05)?

Bootstrap Sampling Distributions

How Bootstrapping Works

How Sampling Method Affects the Distribution

Simple Random Sampling (SRS)

Stratified Sampling

Cluster Sampling

Real-World Applications

Political Polling

Clinical Trials

Quality Control

A/B Testing

Financial Risk

Sampling Distributions and Inferential Statistics

Confidence Intervals

Hypothesis Testing

Common Mistakes and Misconceptions

Treating the sampling distribution as the sample distribution

Treating n ≥ 30 as a hard threshold

Using Z when t is required

Assuming the population must be normally distributed

Forgetting the finite population correction when sampling large fractions

Quick Reference: Sampling Distributions Summary

Related Topics on Statistics Fundamentals

Normal Distribution

Hypothesis Testing

Binomial Distribution

t, Chi-Square & F Distributions

Read More Articles

Normal Distribution

Hypothesis Testing

Binomial Distribution

Frequently Asked Questions