What Is Normal Approximation?
The binomial distribution gives the exact probability of getting exactly k successes in n independent trials when the probability of success on each trial is p. The formula involves factorials that grow rapidly with n, making hand calculation impractical for large samples. The normal approximation sidesteps this by treating the binomial as if it were continuous, converting the target value to a Z-score and using the standard normal distribution table.
The theoretical justification comes from the Central Limit Theorem: the sum of many independent, identically distributed random variables converges to a normal distribution as the number of terms grows. A binomial random variable is the sum of n independent Bernoulli trials, so it follows that the binomial converges to normal as n increases. This is why the approximation improves with larger samples.
Normal approximation substitutes a continuous normal distribution for a discrete one (usually binomial or Poisson) when the sample size is large enough. For a binomial with parameters n and p, set μ = np and σ = √(np(1−p)), apply a continuity correction of ±0.5 to the discrete boundary, convert to a Z-score, and read the probability from a standard normal table. The conditions np ≥ 5 and n(1−p) ≥ 5 determine whether the approximation is valid.
Conditions for Normal Approximation
The approximation is only trustworthy when the underlying distribution is reasonably symmetric. For a binomial distribution, that happens when both tails have enough expected events to prevent severe skew. The standard check, used in most introductory statistics courses and on the AP Statistics exam, is called the Rule of 5.
- np ≥ 5 — the expected number of successes must be at least 5
- n(1−p) ≥ 5 — the expected number of failures must be at least 5
- If either condition fails, compute the exact binomial probability instead
- Advanced courses (econometrics, engineering statistics) often require np ≥ 10 and n(1−p) ≥ 10 for tighter accuracy
- For Poisson approximation, the rate parameter λ must satisfy λ ≥ 10 (standard) or λ ≥ 20 (conservative)
Conditions Check: Is Normal Approximation Valid?
Why the Conditions Matter
When p is very small or very large, the binomial distribution is skewed: most probability mass sits near zero or near n. In those cases, a symmetric bell curve is a poor fit. For example, with n = 20 and p = 0.05, np = 1 — well below 5. The distribution is strongly right-skewed. Forcing a normal approximation there produces meaningless probability estimates.
As p approaches 0.5, the binomial becomes symmetric, and the approximation improves even at smaller n. The Rule of 5 catches the worst cases; the Rule of 10 adds a margin for applied work where getting the right probability matters more than a quick calculation. Full coverage of this skew behavior is in the binomial distribution guide.
Normal Approximation Formulas
Three quantities are needed: the mean, the standard deviation, and the Z-score after applying the continuity correction.
n = number of trials
p = probability of success
μ = expected number of successes
σ² = n·p·(1−p) is the variance
σ = standard deviation (take the square root)
Never plug in variance where σ belongs in the Z formula
xcorrected = discrete value ± 0.5
μ = np
σ = √(np(1−p))
Once you have the Z-score, look it up in the Z-table or use a calculator's normal cumulative distribution function. The table gives the area to the left of the Z-score. For probabilities in the right tail, subtract the table value from 1. For the area between two Z-scores, subtract the smaller table value from the larger one.
Continuity Correction
The binomial counts integers: 0, 1, 2, 3, and so on. The normal distribution covers every real number along a continuous line. When you map the integer k onto the continuous curve, you are treating a single point as if it had width. The continuity correction expands each integer to an interval of width 1 centered on it — specifically from k − 0.5 to k + 0.5 — which is the range of real numbers that would round to k.
Without the correction, you are reading the probability at the single point k, which has measure zero on a continuous distribution. With it, you read the area of a thin strip, which matches the discrete bar in the histogram. For large n, the difference is small. For moderate n near the condition threshold, it is noticeable enough to matter.
| Target Inequality | Continuity-Corrected Region | Plain English |
|---|---|---|
| P(X ≤ k) | P(Y ≤ k + 0.5) | Include k, so extend boundary rightward |
| P(X < k) | P(Y ≤ k − 0.5) | Exclude k, so stop just before it |
| P(X ≥ k) | P(Y ≥ k − 0.5) | Include k, so extend boundary leftward |
| P(X > k) | P(Y ≥ k + 0.5) | Exclude k, so start just after it |
| P(X = k) | P(k − 0.5 ≤ Y ≤ k + 0.5) | Exact value: compute area of the strip |
P(X < 50) and P(X ≤ 50) give different corrected boundaries: 49.5 vs. 50.5. For continuous distributions the two are identical; for discrete ones they differ by one whole integer. Always determine whether the original inequality is strict (< or >) or non-strict (≤ or ≥) before applying the correction.
How to Apply Normal Approximation (8 Steps)
1: Identify distribution and parameters. 2: Check np ≥ 5 and n(1−p) ≥ 5. 3: Compute μ = np. 4: Compute σ = √(np(1−p)). 5: Apply continuity correction to the boundary. 6: Compute Z = (xcorrected − μ) / σ. 7: Look up the probability from the normal table. 8: State the conclusion.
Identify the Distribution and Extract Parameters
For a binomial problem, record n (number of trials) and p (probability of success per trial). For a Poisson problem, record λ (the rate or mean count). Both determine whether the approximation is appropriate.
Check the Validity Conditions
Compute np and n(1−p). Both must be at least 5 (or at least 10 if your course uses the stricter rule). If either is below the threshold, use exact binomial probability tables or a calculator's binomcdf function instead of the approximation.
Calculate the Mean (μ)
Set μ = n × p. This is the expected number of successes and becomes the center of the approximating normal distribution. For Poisson, μ = λ.
Calculate the Standard Deviation (σ)
Set σ = √(n × p × (1 − p)). This is the square root of the variance. Do not skip the square root — plugging in the variance instead of the standard deviation is the single most common arithmetic error in normal approximation problems.
Apply the Continuity Correction
Use the table above to determine the corrected boundary. For P(X ≤ k), compute Z using k + 0.5. For P(X < k), use k − 0.5. For P(X = k), you need two Z-scores, one for k − 0.5 and one for k + 0.5.
Compute the Z-Score
Z = (xcorrected − μ) / σ. A positive Z means the target value is above the mean; negative means below. Carry at least three decimal places to avoid rounding errors when looking up the table.
Look Up the Probability
Use the standard normal (Z) table to find the area to the left of your Z-score. For right-tail probabilities, subtract from 1. For between-two-values problems, subtract the smaller area from the larger. Alternatively, use normalcdf on a TI-84 or stats.norm.cdf() in Python.
Interpret the Result
Restate the answer in the language of the original problem. Avoid reporting more than four significant figures — the approximation itself introduces error, so false precision is misleading. Compare to the exact binomial result when accuracy is critical.
Worked Examples
Each example follows the 8-step procedure above. Arithmetic is shown in full so you can follow each number. All Z-score lookups use the standard four-decimal-place table distributed by NIST.
Example 1 — Airline No-Show Problem (P(X < 15))
An airline finds that 10% of booked passengers do not show up. The plane holds 200 passengers. What is the probability that fewer than 15 passengers fail to show up?
Distribution: Binomial, n = 200, p = 0.10
Check conditions: np = 200 × 0.10 = 20 ≥ 5 ✓ n(1−p) = 200 × 0.90 = 180 ≥ 5 ✓ — approximation is valid
Mean: μ = np = 200 × 0.10 = 20
Standard deviation: σ = √(200 × 0.10 × 0.90) = √18 ≈ 4.243
Continuity correction: P(X < 15) means k = 15 is excluded. Corrected boundary = 15 − 0.5 = 14.5
Z-score: Z = (14.5 − 20) / 4.243 = −5.5 / 4.243 ≈ −1.30
Probability: Table area for Z = −1.30 → P ≈ 0.0968
Interpretation: There is approximately a 9.68% probability that fewer than 15 passengers will fail to show up.
✅ Answer: P(X < 15) ≈ 0.0968 (9.68%)
Example 2 — Exact Probability: P(X = 45) from 100 Coin Flips
A fair coin is flipped 100 times. What is the probability of getting exactly 45 heads?
Distribution: Binomial, n = 100, p = 0.50
Check conditions: np = 50 ≥ 5 ✓ n(1−p) = 50 ≥ 5 ✓
Mean: μ = 100 × 0.50 = 50
Standard deviation: σ = √(100 × 0.50 × 0.50) = √25 = 5
Continuity correction: P(X = 45) expands to P(44.5 ≤ X ≤ 45.5)
Two Z-scores:
Zlower = (44.5 − 50) / 5 = −1.10
Zupper = (45.5 − 50) / 5 = −0.90
Probability:
Table area for Z = −0.90 → 0.1841
Table area for Z = −1.10 → 0.1357
P = 0.1841 − 0.1357 = 0.0484
Interpretation: The probability of getting exactly 45 heads in 100 flips is approximately 4.84%. The exact binomial answer via dbinom(45, 100, 0.5) is 0.0485, confirming the approximation is accurate to four decimal places here.
✅ Answer: P(X = 45) ≈ 0.0484 (4.84%). Exact: 0.0485 — error < 0.01%.
Example 3 — Poisson to Normal Approximation
A hospital emergency room receives an average of 18 patients per hour. What is the probability that more than 22 patients arrive in a given hour?
Distribution: Poisson, λ = 18
Check condition: λ = 18 ≥ 10 ✓ — approximation is valid
Mean: μ = λ = 18
Standard deviation: σ = √λ = √18 ≈ 4.243
Continuity correction: P(X > 22) excludes 22. Corrected boundary = 22 + 0.5 = 22.5
Z-score: Z = (22.5 − 18) / 4.243 = 4.5 / 4.243 ≈ 1.06
Probability: Table area for Z = 1.06 → 0.8554. Right-tail: 1 − 0.8554 = 0.1446
Interpretation: There is approximately a 14.5% probability that more than 22 patients arrive in a given hour.
✅ Answer: P(X > 22) ≈ 0.1446 (14.46%)
Normal Approximation Calculator
Enter the binomial parameters, select the inequality type, and click Calculate. The calculator shows every intermediate step: conditions check, mean, standard deviation, continuity-corrected boundary, Z-score, and the final probability.
Normal Approximation Calculator (Binomial)
Method Comparison
Choosing between exact binomial computation and the normal approximation depends on sample size, the accuracy you need, and the tools available. The Poisson distribution is a third option when n is large and p is very small.
| Feature | Exact Binomial | Normal Approximation | Poisson Method |
|---|---|---|---|
| Distribution type | Discrete | Continuous | Discrete |
| Formula complexity | High (factorials grow fast) | Low (Z-score only) | Moderate |
| Best for | Small n (< 30) or extreme p | Large n, moderate p (np ≥ 5) | Large n, very small p (λ = np) |
| Requires continuity correction? | Not applicable | Yes, ±0.5 | Yes, ±0.5 |
| Accuracy | Exact | Good for large n | Good when p is tiny |
| Hand-calculation friendly? | Only for small n | Yes | Moderate |
For a deeper comparison of the normal and t-distributions, see the T-distribution vs. normal distribution guide. Coverage of the empirical rule — a related application of normal distribution properties — is in the empirical rule article.
Common Mistakes to Avoid
Applying the normal approximation when np < 5 or n(1−p) < 5 produces unreliable results. Always check both quantities before proceeding. If either fails, use the exact binomial formula or a calculator's binomcdf function.
The Z-score denominator requires σ = √(np(1−p)), not the variance np(1−p) itself. Forgetting the square root inflates the denominator and dramatically understates the Z-score magnitude.
P(X < 50) corrects to k − 0.5 = 49.5. P(X ≤ 50) corrects to k + 0.5 = 50.5. These give different Z-scores and different final probabilities. Determine whether the inequality is strict or non-strict before adjusting.
For large n (say, n > 1000), the correction makes almost no difference. For n closer to 30 or when p is not near 0.5, omitting it introduces error. The correction is cheap to apply and always improves the result — there is no reason to skip it.
Python and R Implementation
Software handles the calculation with far less risk of arithmetic error. Both examples below compute the approximated probability and the exact binomial result side by side so you can verify the approximation's accuracy.
Python (SciPy)
import numpy as np import scipy.stats as stats n, p, k = 100, 0.4, 35 # Normal approximation parameters mu = n * p sigma = np.sqrt(n * p * (1 - p)) # Continuity correction for P(X <= k) k_corrected = k + 0.5 prob_approx = stats.norm.cdf(k_corrected, mu, sigma) # Exact binomial for comparison prob_exact = stats.binom.cdf(k, n, p) print(f"mu = {mu}, sigma = {sigma:.4f}") print(f"Approximated P(X ≤ {k}) = {prob_approx:.4f}") print(f"Exact P(X ≤ {k}) = {prob_exact:.4f}")
n <- 100 p <- 0.4 k <- 35 mu <- n * p sigma <- sqrt(n * p * (1 - p)) # Continuity correction for P(X <= k) k_corrected <- k + 0.5 prob_approx <- pnorm(k_corrected, mean = mu, sd = sigma) # Exact binomial for comparison prob_exact <- pbinom(k, size = n, prob = p) cat("mu =", mu, " sigma =", round(sigma, 4), "\n") cat("Approximated:", round(prob_approx, 4), "\n") cat("Exact:", round(prob_exact, 4), "\n")
For more on working with distributions in R, Khan Academy's normal approximation to binomial article provides a solid complementary reference. Penn State's STAT 414 course materials on approximating the binomial distribution cover the theoretical derivation in more depth.
Related Concepts on Statistics Fundamentals
Binomial Distribution
The exact distribution that normal approximation replaces. Covers the PMF formula, mean, variance, and cumulative probability tables.
Normal Distribution
Properties of the bell curve, the standard normal (Z) distribution, and how to use the Z-table — foundational to applying the approximation.
Z-Score
How to standardize values and interpret their position on the normal distribution — the final step in every normal approximation problem.
Central Limit Theorem
The theoretical result that explains why the normal approximation works: sums of independent random variables converge to normal as n grows.
Sample Proportions
The sampling distribution of p̂ also uses normal approximation, with conditions np ≥ 10 and n(1−p) ≥ 10 in most applied courses.
Hypothesis Testing
Z-tests for proportions rely directly on normal approximation to establish the test statistic's distribution under the null hypothesis.
FAQs
Normal approximation is a method that uses the continuous normal distribution to estimate probabilities for other distributions, especially the discrete binomial distribution. Instead of calculating exact probabilities, you calculate the mean and standard deviation, apply a continuity correction, convert the value to a Z-score, and use the standard normal distribution to find probabilities.
A binomial distribution can be approximated by a normal distribution when both np ≥ 5 and n(1−p) ≥ 5. These conditions ensure the distribution is not heavily skewed and has enough observations in both success and failure categories. Some applications use the stricter rule np ≥ 10 and n(1−p) ≥ 10 for better accuracy.
The continuity correction is used because the binomial distribution is discrete while the normal distribution is continuous. It adjusts integer values by 0.5 so that the continuous normal curve better matches the probability areas of the discrete distribution. This improves approximation accuracy, especially for smaller sample sizes.
For a binomial distribution with parameters n and p, calculate the mean as μ = np and the standard deviation as σ = √(np(1−p)). After applying the continuity correction, convert the value into a Z-score using Z = (x_corrected − μ) / σ. The resulting Z-score is then used with the standard normal distribution to find probabilities.
When the Poisson rate parameter λ is sufficiently large, the Poisson distribution can be approximated by a normal distribution with mean μ = λ and standard deviation σ = √λ. A continuity correction is also applied. The approximation becomes more accurate as λ increases because the Poisson distribution becomes more symmetric.
The accuracy of normal approximation improves as sample size increases and as p gets closer to 0.5. Large samples usually produce very accurate results, while smaller samples near the minimum conditions may have noticeable errors. Applying the continuity correction generally improves accuracy compared with using the uncorrected approximation.
The general steps are: identify the distribution and parameters, check that the normal approximation conditions are satisfied, calculate the mean and standard deviation, apply the continuity correction, convert the value into a Z-score, use the standard normal table or software to find the probability, and interpret the result in the context of the problem.
Yes. Larger sample sizes make the normal approximation more accurate because sampling distributions become closer to normal. The approximation error generally decreases as sample size increases. Small samples, especially when success and failure probabilities are unbalanced, can produce less reliable results.
Yes. Normal approximation is widely used in quality control, polling, medical research, and business statistics. It helps estimate probabilities, calculate confidence intervals, and perform hypothesis tests when exact calculations are difficult. Tests for proportions often rely on normal approximation to justify the use of Z statistics.