When can you use the normal approximation to the binomial?

The normal approximation to the binomial is valid when both np >= 5 and n(1-p) >= 5. Some textbooks and advanced curricula require np >= 10 and n(1-p) >= 10 for a more conservative standard.

Why is continuity correction needed?

The binomial distribution is discrete and the normal distribution is continuous. A continuity correction adjusts each discrete integer boundary by +/- 0.5, which improves the accuracy of the approximation by matching the area under the continuous curve to the probability of the discrete value.

What is the formula for normal approximation?

For a binomial distribution with parameters n and p, the mean is mu = np and the standard deviation is sigma = sqrt(np(1-p)). After applying the continuity correction, the Z-score is Z = (x_corrected - mu) / sigma, which is then used with a standard normal table to find the probability.

What is the normal approximation to the Poisson distribution?

When the Poisson rate parameter lambda is large (lambda >= 10 by standard criteria, lambda >= 20 by conservative criteria), the Poisson distribution can be approximated by a normal distribution with mean mu = lambda and variance sigma^2 = lambda. A continuity correction of +/- 0.5 also applies.

Normal Approximation: Binomial Guide, Formulas & Continuity Correction

What Is Normal Approximation?

Definition — Normal Approximation

Normal approximation is a statistical method that uses a continuous normal distribution to estimate probabilities for other distributions — most often the discrete binomial distribution. The method is valid when the sample size is large enough that the discrete histogram is nearly symmetric, allowing a smooth bell curve to model it accurately.

Z = (x_corrected − μ) / σ where μ = np, σ = √(np(1−p))

The binomial distribution gives the exact probability of getting exactly k successes in n independent trials when the probability of success on each trial is p. The formula involves factorials that grow rapidly with n, making hand calculation impractical for large samples. The normal approximation sidesteps this by treating the binomial as if it were continuous, converting the target value to a Z-score and using the standard normal distribution table.

The theoretical justification comes from the Central Limit Theorem: the sum of many independent, identically distributed random variables converges to a normal distribution as the number of terms grows. A binomial random variable is the sum of n independent Bernoulli trials, so it follows that the binomial converges to normal as n increases. This is why the approximation improves with larger samples.

📌

Featured Snippet — Normal Approximation in One Paragraph

Normal approximation substitutes a continuous normal distribution for a discrete one (usually binomial or Poisson) when the sample size is large enough. For a binomial with parameters n and p, set μ = np and σ = √(np(1−p)), apply a continuity correction of ±0.5 to the discrete boundary, convert to a Z-score, and read the probability from a standard normal table. The conditions np ≥ 5 and n(1−p) ≥ 5 determine whether the approximation is valid.

μ = np

Mean of the approximation

σ² = np(1−p)

Variance of the approximation

±0.5

Continuity correction

np ≥ 5

Minimum condition

Conditions for Normal Approximation

The approximation is only trustworthy when the underlying distribution is reasonably symmetric. For a binomial distribution, that happens when both tails have enough expected events to prevent severe skew. The standard check, used in most introductory statistics courses and on the AP Statistics exam, is called the Rule of 5.

✓ Validity Conditions — Two Things to Check

np ≥ 5 — the expected number of successes must be at least 5
n(1−p) ≥ 5 — the expected number of failures must be at least 5
If either condition fails, compute the exact binomial probability instead
Advanced courses (econometrics, engineering statistics) often require np ≥ 10 and n(1−p) ≥ 10 for tighter accuracy
For Poisson approximation, the rate parameter λ must satisfy λ ≥ 10 (standard) or λ ≥ 20 (conservative)

Conditions Check: Is Normal Approximation Valid?

Step 1 — Identify Distribution Type

Binomial Distribution: extract n and p

→

Check: np ≥ 5 AND n(1−p) ≥ 5?

✓ Both ≥ 5 → Normal approximation is valid

✗ Either < 5 → Use exact binomial

Poisson Distribution: extract λ

✓ λ ≥ 10 → Normal approximation is valid

✗ λ < 10 → Use exact Poisson calculation

Why the Conditions Matter

When p is very small or very large, the binomial distribution is skewed: most probability mass sits near zero or near n. In those cases, a symmetric bell curve is a poor fit. For example, with n = 20 and p = 0.05, np = 1 — well below 5. The distribution is strongly right-skewed. Forcing a normal approximation there produces meaningless probability estimates.

As p approaches 0.5, the binomial becomes symmetric, and the approximation improves even at smaller n. The Rule of 5 catches the worst cases; the Rule of 10 adds a margin for applied work where getting the right probability matters more than a quick calculation. Full coverage of this skew behavior is in the binomial distribution guide.

Normal Approximation Formulas

Three quantities are needed: the mean, the standard deviation, and the Z-score after applying the continuity correction.

Mean of the Normal Approximation

μ = n · p

n = number of trials p = probability of success μ = expected number of successes

Standard Deviation of the Normal Approximation

σ = √(n · p · (1 − p))

σ² = n·p·(1−p) is the variance σ = standard deviation (take the square root) Never plug in variance where σ belongs in the Z formula

Z-Score After Continuity Correction

Z = (x_corrected − μ) / σ

x_corrected = discrete value ± 0.5 μ = np σ = √(np(1−p))

Once you have the Z-score, look it up in the Z-table or use a calculator's normal cumulative distribution function. The table gives the area to the left of the Z-score. For probabilities in the right tail, subtract the table value from 1. For the area between two Z-scores, subtract the smaller table value from the larger one.

Continuity Correction

The binomial counts integers: 0, 1, 2, 3, and so on. The normal distribution covers every real number along a continuous line. When you map the integer k onto the continuous curve, you are treating a single point as if it had width. The continuity correction expands each integer to an interval of width 1 centered on it — specifically from k − 0.5 to k + 0.5 — which is the range of real numbers that would round to k.

Without the correction, you are reading the probability at the single point k, which has measure zero on a continuous distribution. With it, you read the area of a thin strip, which matches the discrete bar in the histogram. For large n, the difference is small. For moderate n near the condition threshold, it is noticeable enough to matter.

Target Inequality	Continuity-Corrected Region	Plain English
P(X ≤ k)	P(Y ≤ k + 0.5)	Include k, so extend boundary rightward
P(X < k)	P(Y ≤ k − 0.5)	Exclude k, so stop just before it
P(X ≥ k)	P(Y ≥ k − 0.5)	Include k, so extend boundary leftward
P(X > k)	P(Y ≥ k + 0.5)	Exclude k, so start just after it
P(X = k)	P(k − 0.5 ≤ Y ≤ k + 0.5)	Exact value: compute area of the strip

⚠️

Common Mistake — Strict vs. Non-Strict Inequalities

P(X < 50) and P(X ≤ 50) give different corrected boundaries: 49.5 vs. 50.5. For continuous distributions the two are identical; for discrete ones they differ by one whole integer. Always determine whether the original inequality is strict (< or >) or non-strict (≤ or ≥) before applying the correction.

How to Apply Normal Approximation (8 Steps)

📋

8-Step Summary

1: Identify distribution and parameters. 2: Check np ≥ 5 and n(1−p) ≥ 5. 3: Compute μ = np. 4: Compute σ = √(np(1−p)). 5: Apply continuity correction to the boundary. 6: Compute Z = (x_corrected − μ) / σ. 7: Look up the probability from the normal table. 8: State the conclusion.

Identify the Distribution and Extract Parameters

For a binomial problem, record n (number of trials) and p (probability of success per trial). For a Poisson problem, record λ (the rate or mean count). Both determine whether the approximation is appropriate.

Check the Validity Conditions

Compute np and n(1−p). Both must be at least 5 (or at least 10 if your course uses the stricter rule). If either is below the threshold, use exact binomial probability tables or a calculator's binomcdf function instead of the approximation.

Calculate the Mean (μ)

Set μ = n × p. This is the expected number of successes and becomes the center of the approximating normal distribution. For Poisson, μ = λ.

Calculate the Standard Deviation (σ)

Set σ = √(n × p × (1 − p)). This is the square root of the variance. Do not skip the square root — plugging in the variance instead of the standard deviation is the single most common arithmetic error in normal approximation problems.

Apply the Continuity Correction

Use the table above to determine the corrected boundary. For P(X ≤ k), compute Z using k + 0.5. For P(X < k), use k − 0.5. For P(X = k), you need two Z-scores, one for k − 0.5 and one for k + 0.5.

Compute the Z-Score

Z = (x_corrected − μ) / σ. A positive Z means the target value is above the mean; negative means below. Carry at least three decimal places to avoid rounding errors when looking up the table.

Look Up the Probability

Use the standard normal (Z) table to find the area to the left of your Z-score. For right-tail probabilities, subtract from 1. For between-two-values problems, subtract the smaller area from the larger. Alternatively, use normalcdf on a TI-84 or stats.norm.cdf() in Python.

Interpret the Result

Restate the answer in the language of the original problem. Avoid reporting more than four significant figures — the approximation itself introduces error, so false precision is misleading. Compare to the exact binomial result when accuracy is critical.

Worked Examples

Each example follows the 8-step procedure above. Arithmetic is shown in full so you can follow each number. All Z-score lookups use the standard four-decimal-place table distributed by NIST.

Example 1 — Airline No-Show Problem (P(X < 15))

Worked Example 1 — Binomial to Normal, Left Tail

An airline finds that 10% of booked passengers do not show up. The plane holds 200 passengers. What is the probability that fewer than 15 passengers fail to show up?

Parameters

n = 200 p = 0.10 k = 15 (strict: P(X < 15))

Distribution: Binomial, n = 200, p = 0.10

Check conditions: np = 200 × 0.10 = 20 ≥ 5 ✓ n(1−p) = 200 × 0.90 = 180 ≥ 5 ✓ — approximation is valid

Mean: μ = np = 200 × 0.10 = 20

Standard deviation: σ = √(200 × 0.10 × 0.90) = √18 ≈ 4.243

Continuity correction: P(X < 15) means k = 15 is excluded. Corrected boundary = 15 − 0.5 = 14.5

Z-score: Z = (14.5 − 20) / 4.243 = −5.5 / 4.243 ≈ −1.30

Probability: Table area for Z = −1.30 → P ≈ 0.0968

Interpretation: There is approximately a 9.68% probability that fewer than 15 passengers will fail to show up.

✅ Answer: P(X < 15) ≈ 0.0968 (9.68%)

Z-score lookup via the NIST Standard Normal Probability Table. Exact binomial result via R's pbinom(14, 200, 0.1) = 0.0867; the approximation without continuity correction gives 0.0968, confirming the correction adds accuracy.

Example 2 — Exact Probability: P(X = 45) from 100 Coin Flips

Worked Example 2 — Exact Single Value, Continuity Correction Both Sides

A fair coin is flipped 100 times. What is the probability of getting exactly 45 heads?

Distribution: Binomial, n = 100, p = 0.50

Check conditions: np = 50 ≥ 5 ✓ n(1−p) = 50 ≥ 5 ✓

Mean: μ = 100 × 0.50 = 50

Standard deviation: σ = √(100 × 0.50 × 0.50) = √25 = 5

Continuity correction: P(X = 45) expands to P(44.5 ≤ X ≤ 45.5)

Two Z-scores:
Z_lower = (44.5 − 50) / 5 = −1.10
Z_upper = (45.5 − 50) / 5 = −0.90

Probability:
Table area for Z = −0.90 → 0.1841
Table area for Z = −1.10 → 0.1357
P = 0.1841 − 0.1357 = 0.0484

Interpretation: The probability of getting exactly 45 heads in 100 flips is approximately 4.84%. The exact binomial answer via dbinom(45, 100, 0.5) is 0.0485, confirming the approximation is accurate to four decimal places here.

✅ Answer: P(X = 45) ≈ 0.0484 (4.84%). Exact: 0.0485 — error < 0.01%.

Example 3 — Poisson to Normal Approximation

Worked Example 3 — Poisson Approximation

A hospital emergency room receives an average of 18 patients per hour. What is the probability that more than 22 patients arrive in a given hour?

Distribution: Poisson, λ = 18

Check condition: λ = 18 ≥ 10 ✓ — approximation is valid

Mean: μ = λ = 18

Standard deviation: σ = √λ = √18 ≈ 4.243

Continuity correction: P(X > 22) excludes 22. Corrected boundary = 22 + 0.5 = 22.5

Z-score: Z = (22.5 − 18) / 4.243 = 4.5 / 4.243 ≈ 1.06

Probability: Table area for Z = 1.06 → 0.8554. Right-tail: 1 − 0.8554 = 0.1446

Interpretation: There is approximately a 14.5% probability that more than 22 patients arrive in a given hour.

✅ Answer: P(X > 22) ≈ 0.1446 (14.46%)

Normal Approximation Calculator

Enter the binomial parameters, select the inequality type, and click Calculate. The calculator shows every intermediate step: conditions check, mean, standard deviation, continuity-corrected boundary, Z-score, and the final probability.

Normal Approximation Calculator (Binomial)

Sample Size (n)

Success Probability (p)

Value (k)

Inequality

Method Comparison

Choosing between exact binomial computation and the normal approximation depends on sample size, the accuracy you need, and the tools available. The Poisson distribution is a third option when n is large and p is very small.

Feature	Exact Binomial	Normal Approximation	Poisson Method
Distribution type	Discrete	Continuous	Discrete
Formula complexity	High (factorials grow fast)	Low (Z-score only)	Moderate
Best for	Small n (< 30) or extreme p	Large n, moderate p (np ≥ 5)	Large n, very small p (λ = np)
Requires continuity correction?	Not applicable	Yes, ±0.5	Yes, ±0.5
Accuracy	Exact	Good for large n	Good when p is tiny
Hand-calculation friendly?	Only for small n	Yes	Moderate

For a deeper comparison of the normal and t-distributions, see the T-distribution vs. normal distribution guide. Coverage of the empirical rule — a related application of normal distribution properties — is in the empirical rule article.

Common Mistakes to Avoid

⛔

Mistake 1 — Skipping the Condition Check

Applying the normal approximation when np < 5 or n(1−p) < 5 produces unreliable results. Always check both quantities before proceeding. If either fails, use the exact binomial formula or a calculator's binomcdf function.

⛔

Mistake 2 — Using Variance Instead of Standard Deviation

The Z-score denominator requires σ = √(np(1−p)), not the variance np(1−p) itself. Forgetting the square root inflates the denominator and dramatically understates the Z-score magnitude.

⛔

Mistake 3 — Wrong Continuity Correction Direction

P(X < 50) corrects to k − 0.5 = 49.5. P(X ≤ 50) corrects to k + 0.5 = 50.5. These give different Z-scores and different final probabilities. Determine whether the inequality is strict or non-strict before adjusting.

⚠️

Mistake 4 — Omitting the Continuity Correction Entirely

For large n (say, n > 1000), the correction makes almost no difference. For n closer to 30 or when p is not near 0.5, omitting it introduces error. The correction is cheap to apply and always improves the result — there is no reason to skip it.

Python and R Implementation

Software handles the calculation with far less risk of arithmetic error. Both examples below compute the approximated probability and the exact binomial result side by side so you can verify the approximation's accuracy.

Python (SciPy)

Python — Normal Approximation with Continuity Correction

import numpy as np
import scipy.stats as stats

n, p, k = 100, 0.4, 35

# Normal approximation parameters
mu = n * p
sigma = np.sqrt(n * p * (1 - p))

# Continuity correction for P(X <= k)
k_corrected = k + 0.5
prob_approx = stats.norm.cdf(k_corrected, mu, sigma)

# Exact binomial for comparison
prob_exact = stats.binom.cdf(k, n, p)

print(f"mu = {mu}, sigma = {sigma:.4f}")
print(f"Approximated P(X ≤ {k}) = {prob_approx:.4f}")
print(f"Exact P(X ≤ {k})        = {prob_exact:.4f}")

R — Normal Approximation with Continuity Correction

n <- 100
p <- 0.4
k <- 35

mu <- n * p
sigma <- sqrt(n * p * (1 - p))

# Continuity correction for P(X <= k)
k_corrected <- k + 0.5
prob_approx <- pnorm(k_corrected, mean = mu, sd = sigma)

# Exact binomial for comparison
prob_exact <- pbinom(k, size = n, prob = p)

cat("mu =", mu, " sigma =", round(sigma, 4), "\n")
cat("Approximated:", round(prob_approx, 4), "\n")
cat("Exact:", round(prob_exact, 4), "\n")

For more on working with distributions in R, Khan Academy's normal approximation to binomial article provides a solid complementary reference. Penn State's STAT 414 course materials on approximating the binomial distribution cover the theoretical derivation in more depth.

📊

Binomial Distribution

The exact distribution that normal approximation replaces. Covers the PMF formula, mean, variance, and cumulative probability tables.

🔔

Normal Distribution

Properties of the bell curve, the standard normal (Z) distribution, and how to use the Z-table — foundational to applying the approximation.

📐

Z-Score

How to standardize values and interpret their position on the normal distribution — the final step in every normal approximation problem.

🎲

Central Limit Theorem

The theoretical result that explains why the normal approximation works: sums of independent random variables converge to normal as n grows.

📏

Sample Proportions

The sampling distribution of p̂ also uses normal approximation, with conditions np ≥ 10 and n(1−p) ≥ 10 in most applied courses.

🔍

Hypothesis Testing

Z-tests for proportions rely directly on normal approximation to establish the test statistic's distribution under the null hypothesis.

FAQs

Normal approximation is a method that uses the continuous normal distribution to estimate probabilities for other distributions, especially the discrete binomial distribution. Instead of calculating exact probabilities, you calculate the mean and standard deviation, apply a continuity correction, convert the value to a Z-score, and use the standard normal distribution to find probabilities.

A binomial distribution can be approximated by a normal distribution when both np ≥ 5 and n(1−p) ≥ 5. These conditions ensure the distribution is not heavily skewed and has enough observations in both success and failure categories. Some applications use the stricter rule np ≥ 10 and n(1−p) ≥ 10 for better accuracy.

The continuity correction is used because the binomial distribution is discrete while the normal distribution is continuous. It adjusts integer values by 0.5 so that the continuous normal curve better matches the probability areas of the discrete distribution. This improves approximation accuracy, especially for smaller sample sizes.

For a binomial distribution with parameters n and p, calculate the mean as μ = np and the standard deviation as σ = √(np(1−p)). After applying the continuity correction, convert the value into a Z-score using Z = (x_corrected − μ) / σ. The resulting Z-score is then used with the standard normal distribution to find probabilities.

When the Poisson rate parameter λ is sufficiently large, the Poisson distribution can be approximated by a normal distribution with mean μ = λ and standard deviation σ = √λ. A continuity correction is also applied. The approximation becomes more accurate as λ increases because the Poisson distribution becomes more symmetric.

The accuracy of normal approximation improves as sample size increases and as p gets closer to 0.5. Large samples usually produce very accurate results, while smaller samples near the minimum conditions may have noticeable errors. Applying the continuity correction generally improves accuracy compared with using the uncorrected approximation.

The general steps are: identify the distribution and parameters, check that the normal approximation conditions are satisfied, calculate the mean and standard deviation, apply the continuity correction, convert the value into a Z-score, use the standard normal table or software to find the probability, and interpret the result in the context of the problem.

Yes. Larger sample sizes make the normal approximation more accurate because sampling distributions become closer to normal. The approximation error generally decreases as sample size increases. Small samples, especially when success and failure probabilities are unbalanced, can produce less reliable results.

Yes. Normal approximation is widely used in quality control, polling, medical research, and business statistics. It helps estimate probabilities, calculate confidence intervals, and perform hypothesis tests when exact calculations are difficult. Tests for proportions often rely on normal approximation to justify the use of Z statistics.