What are the 4 types of probability?

The four types of probability are: (1) Theoretical probability — based on equally likely outcomes; (2) Experimental (empirical) probability — based on observed frequencies; (3) Subjective probability — based on personal judgment; and (4) Axiomatic probability — based on Kolmogorov's formal axioms.

What is the difference between mean and expected value?

The mean is a descriptive measure of the center of an observed dataset. The expected value E(X) is a theoretical concept — the long-run average outcome of a random variable, calculated as a weighted sum of all possible values multiplied by their probabilities.

How do you calculate conditional probability?

Conditional probability is calculated as P(A|B) = P(A∩B) / P(B), where P(B) > 0. This gives the probability that event A occurs given that event B has already occurred. You divide the joint probability of both events by the probability of the given condition.

What is a normal distribution used for?

The normal distribution is used to model naturally occurring phenomena such as heights, test scores, and measurement errors. It also underpins the Central Limit Theorem, making it the foundation for z-tests, t-tests, confidence intervals, and many other statistical inference methods.

Is a p-value of 0.05 always significant?

Not always. The 0.05 threshold is a widely used convention, not a universal law. Significance depends on the research context, the cost of errors, and the study design. In some fields (e.g., physics, genomics) much stricter thresholds like 0.001 or 0.0001 are used.

What is the Central Limit Theorem in simple terms?

The Central Limit Theorem states that when you take sufficiently large random samples from any population, the distribution of sample means will be approximately normal — regardless of the original population's shape. This is why normal-distribution-based methods work so broadly in practice.

What are the most common probability distributions?

The most common probability distributions are: Normal (continuous, bell-shaped), Binomial (discrete, fixed trials with two outcomes), Poisson (discrete, events per time/space interval), Exponential (continuous, time between events), and Uniform (equal probability across a range).

How is statistics used in everyday life?

Statistics is used in everyday life through weather forecasting (probability of rain), medical diagnosis (sensitivity and specificity of tests), sports analytics (win probabilities), financial risk management, polling and election forecasting, quality control in manufacturing, and personalized recommendations in apps.

What is the difference between population and sample in statistics?

A population is the entire group you want to draw conclusions about (e.g., all adults in a country). A sample is a subset of that population actually measured. Because studying an entire population is usually impractical, statistical inference uses sample data to estimate population parameters with quantified uncertainty.

Statistics and Probability: The Complete Guide (With Examples)

🔑 Key Takeaways

The most important ideas from this guide — keep these in mind as you work through each section.

●

Probability quantifies uncertainty. It assigns a value between 0 and 1 to all possible outcomes of a random experiment.

●

Statistics uses probability to make inferences. Sample data combined with probability models allows conclusions about populations.

●

There are four types of probability. Theoretical, experimental, subjective, and axiomatic — each used in different contexts.

●

Probability distributions describe random variables. Normal, binomial, and Poisson distributions are most commonly used in practice.

●

The Central Limit Theorem connects everything. It explains why normal distributions appear so often in statistical inference.

●

Bayes’ Theorem is a powerful update rule. It lets you revise probabilities when new evidence becomes available.

💡

Key Insight

Probability tells you what should happen in theory. Statistics tells you what did happen in practice — and uses that to estimate the underlying reality.

Branches of Statistics: Descriptive vs. Inferential

Before diving into probability, it helps to understand the two major branches of statistics that probability supports.

Descriptive statistics summarize and describe the data you actually have using measures of central tendency (mean, median, mode) and spread (variance, standard deviation, IQR). Inferential statistics use sample data and probability models to make predictions and test claims about a larger population.

Feature	Descriptive Statistics	Inferential Statistics
Goal	Summarize the data you have	Draw conclusions about a population
Focus	Observed dataset	Sample → Population generalization
Typical tools	Mean, SD, IQR, charts	Hypothesis tests, confidence intervals
Role of probability	Minimal	Central — all inference is probabilistic
Output	Descriptive numbers & visuals	Decisions, estimates, p-values

A third branch — Bayesian statistics — treats probability as a degree of belief and updates it as new data arrives. It sits between the two and is foundational to modern machine learning.

Key Measures in Descriptive Statistics

Before applying probability, you need to describe your data. These are the core measures every analyst uses daily.

Mean, Median, and Mode

These three measures describe the center of a dataset.

Formula — Mean

x̄ = (Σxᵢ) / n

Sum all values and divide by the count. Sensitive to outliers.

Example: Exam scores: 65, 72, 75, 78, 80, 82, 85, 88, 90, 95 → Sum = 810 → Mean = 810/10 = 81

The median is the middle value when sorted. For 10 values: Median = (80 + 82) / 2 = 81. Add an outlier score of 200 and the mean jumps to 90.5 — but the median stays at 82. That robustness makes the median the right choice for skewed data.

The mode is the most frequent value. Shoe sizes: 7, 8, 8, 8, 9, 9, 10 → Mode = 8. Mode is the only measure that works with categorical data (e.g., "most popular color").

Variance and Standard Deviation

These measure how spread out data points are from the mean.

Formula — Sample Variance & Standard Deviation

s² = Σ(xᵢ − x̄)² / (n−1) | s = √s²

Use n−1 for sample data (Bessel's correction). For exam scores: s² ≈ 93.3, so s ≈ 9.66

A standard deviation of ~9.7 means most scores fall within about 10 points of the average of 81. Variance is in squared units (points²); standard deviation returns to the original units (points), making it easier to interpret.

Z-Score

Formula — Z-Score

z = (x − μ) / σ

Measures how many standard deviations a value is from the mean. A score of 95 → z = (95 − 81) / 9.66 ≈ 1.45

Z-scores standardize data to a common scale, enabling comparison across different datasets. A z-score of 1.45 means the score of 95 is 1.45 standard deviations above average.

Introduction to Probability — Definition and Types

Probability is the numerical measure of the likelihood that an event will occur. It is always a value between 0 (impossible) and 1 (certain), calculated as the ratio of favorable outcomes to total possible outcomes.

Basic Probability Formula

P(A) = Number of favorable outcomes / Total outcomes in sample space

Example: Rolling a 3 on a fair die → P(3) = 1/6 ≈ 0.167

Impossible event

0.5

Equal chance

Certain event

Types of probability

Key Terminology

Term	Definition	Example
Sample Space (S)	All possible outcomes of an experiment	Rolling a die: S = {1,2,3,4,5,6}
Event (A)	A subset of the sample space	A = rolling an even number = {2,4,6}
Complement (A')	All outcomes NOT in event A	A' = rolling an odd number = {1,3,5}
Union (A∪B)	Outcomes in A or B or both	A∪B = rolling even OR >4 = {2,4,5,6}
Intersection (A∩B)	Outcomes in both A and B	A∩B = rolling even AND >4 = {6}
Mutually Exclusive	Events that cannot both occur	Rolling a 1 AND rolling a 6 simultaneously
Independent Events	One event does not affect the other	Two separate coin flips

Type 1: Theoretical Probability

Based on mathematical reasoning assuming all outcomes are equally likely. Example: The probability of drawing an Ace from a standard deck = 4/52 = 1/13 ≈ 0.077. No experiment needed — it follows from the structure of the deck.

Type 2: Experimental (Empirical) Probability

Based on actual observed frequencies from repeated trials.

Empirical Probability Formula

P(A) = Number of times A occurred / Total number of trials

Roll a die 600 times and observe 3 appearing 98 times → P(3) = 98/600 ≈ 0.163 (vs. theoretical 1/6 ≈ 0.167)

Type 3: Subjective Probability

Based on personal judgment, expert opinion, or experience rather than calculation or experiments. Examples: A weather forecaster says "70% chance of rain tomorrow." A surgeon estimates a "90% chance of successful recovery." These are not calculated — they are expert estimates.

Type 4: Axiomatic Probability (Kolmogorov's Axioms)

The mathematical foundation of all probability theory. Andrei Kolmogorov (1933) defined three axioms:

Axiom 1: P(A) ≥ 0 for any event A (probability is non-negative)
Axiom 2: P(S) = 1 (the probability of the entire sample space is 1)
Axiom 3: For mutually exclusive events A and B: P(A∪B) = P(A) + P(B)

All other probability rules are derived from these three axioms.

Probability Rules and Formulas

These five rules are the toolkit for solving almost every probability problem. The quick reference table below summarizes all of them.

Rule	Formula	When to Use
Complement Rule	P(A') = 1 − P(A)	When it's easier to find P(not A)
Addition Rule (General)	P(A∪B) = P(A) + P(B) − P(A∩B)	Any two events
Addition Rule (Mutually Exclusive)	P(A∪B) = P(A) + P(B)	Events that can't both happen
Multiplication Rule (General)	P(A∩B) = P(A) × P(B\|A)	Any two events
Multiplication Rule (Independent)	P(A∩B) = P(A) × P(B)	Independent events only
Conditional Probability	P(A\|B) = P(A∩B) / P(B)	Probability of A given B occurred
Bayes' Theorem	P(A\|B) = P(B\|A)·P(A) / P(B)	Updating belief with new evidence

Complement Rule

The probability that event A does not occur equals 1 minus the probability it does.

Formula

P(A') = 1 − P(A)

Example: P(rolling at least one 6 in two rolls) = 1 − P(no 6 in two rolls) = 1 − (5/6)² = 1 − 0.694 = 0.306

Addition Rule

For the probability that event A or event B (or both) occur:

Formula — General

P(A∪B) = P(A) + P(B) − P(A∩B)

Example: Drawing a King OR a Heart from a deck → P(King)=4/52, P(Heart)=13/52, P(King of Hearts)=1/52 → P(King or Heart) = 4/52 + 13/52 − 1/52 = 16/52 ≈ 0.308

Multiplication Rule

For the probability that both events A and B occur:

Formula — Dependent Events

P(A∩B) = P(A) × P(B|A)

Drawing 2 Aces without replacement → P(1st Ace) = 4/52, P(2nd Ace | 1st was Ace) = 3/51 → P(both Aces) = (4/52) × (3/51) ≈ 0.0045

Formula — Independent Events

P(A∩B) = P(A) × P(B)

Flipping two heads in a row → P(H) × P(H) = 0.5 × 0.5 = 0.25

Conditional Probability

The probability of A occurring given that B has already occurred:

Formula

P(A|B) = P(A∩B) / P(B)

Read as: "probability of A given B"

Worked Example: In a class of 30 students: 18 study Math (M), 12 study Science (S), 6 study both. What is P(M|S) — probability a student studies Math given they study Science?

P(M∩S) = 6/30 = 0.2 | P(S) = 12/30 = 0.4 → P(M|S) = 0.2 / 0.4 = 0.5

So half of Science students also study Math.

Bayes' Theorem

Bayes' Theorem lets you reverse a conditional probability — updating the probability of a hypothesis given new evidence. It is the foundation of Bayesian statistics and underlies spam filters, medical diagnosis, and recommendation engines.

Bayes' Theorem Formula

P(A|B) = [ P(B|A) × P(A) ] / P(B)

P(A) = prior probability | P(B|A) = likelihood | P(A|B) = posterior probability

Classic Example — Medical Test: A disease affects 1% of the population. A test is 95% accurate (sensitivity = 95%, false positive rate = 5%). You test positive. What is the probability you actually have the disease?

P(Disease) = 0.01 (prior)
P(Positive | Disease) = 0.95 (sensitivity)
P(Positive | No Disease) = 0.05 (false positive rate)
P(Positive) = (0.95 × 0.01) + (0.05 × 0.99) = 0.0095 + 0.0495 = 0.059
P(Disease | Positive) = (0.95 × 0.01) / 0.059 = 0.0095 / 0.059 ≈ 16.1%

⚠️

The Base Rate Neglect Problem

Most people intuitively assume a 95% accurate test means a ~95% chance of being sick. Bayes' Theorem shows the true probability is only ~16% because the disease is rare. This counterintuitive result is why Bayes' Theorem is critical in medical diagnosis and AI systems.

Random Variables and Probability Distributions

A random variable is a numerical value assigned to the outcome of a random experiment. It connects probability theory to measurable data.

Type	Description	Examples
Discrete	Countable, specific values	Number of heads in 5 flips, defective items in a batch
Continuous	Any value within a range	Height, temperature, time to complete a task

Expected Value and Variance of a Random Variable

Expected Value (Discrete)

E(X) = Σ [ xᵢ × P(xᵢ) ]

Example: Fair die → E(X) = 1(1/6) + 2(1/6) + 3(1/6) + 4(1/6) + 5(1/6) + 6(1/6) = 21/6 = 3.5

Variance of a Random Variable

Var(X) = E(X²) − [E(X)]²

For a fair die: E(X²) = 15.17, [E(X)]² = 12.25 → Var(X) ≈ 2.92

Binomial Distribution

Models the number of successes in n independent trials, each with probability p of success.

Conditions: Fixed n trials, binary outcome (success/failure), constant p, independent trials.

Binomial PMF

P(X = k) = C(n,k) × pᵏ × (1−p)^(n−k)

Mean = np | Variance = np(1−p) | C(n,k) = n! / [k!(n−k)!]

Example: Probability of exactly 3 heads in 5 fair coin flips (p = 0.5, n = 5, k = 3):

P(X=3) = C(5,3) × (0.5)³ × (0.5)² = 10 × 0.125 × 0.25 = 0.3125

Poisson Distribution

Models the number of events occurring in a fixed time or space interval when events occur independently at a constant average rate λ.

Poisson PMF

P(X = k) = (λᵏ × e^−λ) / k!

Mean = λ | Variance = λ | e ≈ 2.718

Example: A call center receives 4 calls per minute on average (λ = 4). P(exactly 2 calls in one minute):

P(X=2) = (4² × e⁻⁴) / 2! = (16 × 0.0183) / 2 = 0.1465

Normal (Gaussian) Distribution

The most important distribution in statistics. It is symmetric, bell-shaped, and defined by its mean (μ) and standard deviation (σ).

The Empirical Rule (68-95-99.7 Rule):

Range	% of data covered	Example (heights, μ=170cm, σ=10cm)
μ ± 1σ (160–180 cm)	~68%	About 68 in 100 people
μ ± 2σ (150–190 cm)	~95%	About 95 in 100 people
μ ± 3σ (140–200 cm)	~99.7%	Almost everyone

Other Key Distributions (Quick Reference)

Distribution	Type	Mean	Variance	Best Used For
Binomial	Discrete	np	np(1−p)	Fixed trials, binary outcome
Poisson	Discrete	λ	λ	Events per time/space unit
Normal	Continuous	μ	σ²	Natural phenomena, CLT applications
Exponential	Continuous	1/λ	1/λ²	Time between Poisson events
Uniform	Both	(a+b)/2	(b−a)²/12	Equal likelihood over range
Geometric	Discrete	1/p	(1−p)/p²	Trials until first success

Inferential Statistics — Hypothesis Testing and Confidence Intervals

Inferential statistics uses probability distributions to make decisions about populations from sample data. This is where probability and statistics merge most powerfully.

What is Hypothesis Testing?

Hypothesis testing is a formal 5-step process to determine whether sample data provides enough evidence to reject a claim about a population.

State hypotheses: H₀ (null) and H₁ (alternative)
Choose significance level: Usually α = 0.05
Select and compute test statistic: t, z, F, χ², etc.
Calculate the p-value
Decision: If p < α → reject H₀; If p ≥ α → fail to reject H₀

Error Type	What It Means	Probability
Type I Error (α)	Rejecting H₀ when it is actually true (false positive)	Controlled by significance level α
Type II Error (β)	Failing to reject H₀ when it is actually false (false negative)	Related to statistical power (1−β)

P-Value Explained

The p-value is the probability of observing results as extreme as — or more extreme than — your data, assuming the null hypothesis is true. A small p-value indicates the observed result is unlikely under H₀.

⚠️

Common Misconception

A p-value of 0.03 does NOT mean "there is a 3% chance the null hypothesis is true." It means "if H₀ were true, there's only a 3% chance of seeing data this extreme." The distinction matters enormously in practice.

Confidence Intervals

A confidence interval is a range of plausible values for a population parameter, calculated from sample data at a specified confidence level.

95% Confidence Interval for a Mean

CI = x̄ ± z* × (σ / √n)

For 95% CI: z* = 1.96 | Example: x̄=81, σ=9.66, n=10 → CI = 81 ± 1.96×(9.66/√10) = 81 ± 5.99 = (75.01, 86.99)

💡

Correct Interpretation

A 95% CI does NOT mean "there's a 95% probability the true mean is in this interval." It means: if you repeated the sampling process 100 times, about 95 of the resulting intervals would contain the true population mean.

Common Statistical Tests — Quick Reference

Test	Used When	Key Assumption
One-sample t-test	Compare sample mean to known value	Approximately normal data, unknown σ
Two-sample t-test	Compare means of two independent groups	Independent samples, approximately normal
Paired t-test	Compare two related measurements (pre/post)	Differences are approximately normal
Chi-square test	Test independence of categorical variables	Expected frequency ≥ 5 in each cell
ANOVA	Compare means of 3+ groups	Normal data, equal variances
Pearson Correlation	Measure linear relationship between two continuous variables	Linear relationship, bivariate normal

The Central Limit Theorem

The Central Limit Theorem (CLT) is arguably the most important theorem in statistics: as sample size increases, the sampling distribution of the sample mean approaches a normal distribution — regardless of the original population's shape.

In practice, n ≥ 30 is typically sufficient. This is why t-tests, z-tests, and confidence intervals work even when your data isn't perfectly normal — the sample mean will be approximately normally distributed anyway.

Probability Laws and Theorems

Law of Large Numbers

As the number of trials increases, the experimental probability converges to the theoretical probability. After 10 flips, you might see 40% heads — but after 10,000 flips, you'll be very close to 50%.

✅

Why This Matters

The Law of Large Numbers justifies using historical data to estimate probabilities — and explains why insurance companies, casinos, and epidemiologists can make reliable predictions despite individual uncertainty.

Law of Total Probability

If events B₁, B₂, ..., Bₙ form a partition of the sample space (mutually exclusive and exhaustive), then:

Law of Total Probability

P(A) = Σ P(A|Bᵢ) × P(Bᵢ)

Used to compute P(A) when it's easier to condition on different scenarios

Permutations vs. Combinations

	Permutations (order matters)	Combinations (order does not matter)
Formula	nPr = n! / (n−r)!	nCr = n! / [r!(n−r)!]
Example	4-digit PIN from digits 1–9: 9P4 = 3,024	Lottery: choose 6 from 49: C(49,6) = 13,983,816
Use when	Arranging, ordering, passwords, rankings	Selecting a group, combinations, committees

Common Probability Misconceptions

Gambler's Fallacy: The belief that past independent events affect future ones. After 10 consecutive heads, many people think tails is "due" — but each flip is still 50/50. Past flips have zero influence on independent future flips.

Base Rate Neglect: Ignoring the prior probability of an event (as in the Bayes' theorem medical test example above). A highly accurate test can still produce mostly false positives if the disease is rare.

Hot Hand Fallacy: Believing a player is "on a streak" and more likely to succeed next time. Research shows most streaks in sports are consistent with random chance rather than genuine hot hands.

7 Real-World Applications of Statistics and Probability

Statistics and probability are not abstract — they drive decisions in virtually every field.

1
Medicine and Clinical Trials — Hypothesis testing determines whether a new drug outperforms a placebo. Bayes' Theorem improves diagnostic accuracy. Confidence intervals define the range of effective doses. The p-value threshold α = 0.05 governs drug approval in most regulatory systems.
2
Finance and Insurance — Actuaries use probability distributions to price insurance policies. Portfolio managers measure investment risk using standard deviation and Value at Risk (VaR). Options pricing (Black-Scholes) relies on normal distribution assumptions.
3
Weather Forecasting — Meteorologists apply Bayesian methods to update precipitation probabilities as new atmospheric data arrives. "70% chance of rain" is a direct application of probability to decision-making under uncertainty.
4
Machine Learning and AI — Naive Bayes classifiers, Gaussian mixture models, probabilistic neural networks, and model validation metrics (precision, recall, AUC) all depend on probability theory. Every ML model is fundamentally a statistical model.
5
Sports Analytics — Win probability models update in real time using conditional probability. Player performance distributions guide contract decisions. Regression to the mean explains why breakout seasons are often followed by normal ones.
6
Quality Control in Manufacturing — Statistical Process Control (SPC) uses control charts to detect when a production process drifts outside acceptable limits. Six Sigma programs reduce defects to fewer than 3.4 per million opportunities using normal distribution principles.
7
Social Sciences and Polling — Political polls use random sampling and report margins of error (confidence intervals). Chi-square tests analyze relationships in survey data. Sampling theory ensures results from 1,000 respondents can reliably represent millions.

Statistics vs. Probability — Key Differences

Statistics starts with observed data and works backward to infer the underlying model. Probability starts with a known model and works forward to predict outcomes. They are complementary — not competing — disciplines.

Aspect	Statistics	Probability
Definition	Science of collecting and analyzing data	Mathematical measure of likelihood of outcomes
Direction of reasoning	Data → Model (inductive)	Model → Data (deductive)
Starting point	Observed sample data	Known probability model / sample space
Output	Estimates, decisions, p-values	Probabilities of specific outcomes
Key tools	Regression, hypothesis tests, confidence intervals	Distributions, Bayes' theorem, combinatorics
Example question	"What does this data tell us about the population?"	"What is the chance of rolling at least one 6?"

Which is Harder?

Probability can feel abstract early on — especially conditional probability, Bayes' theorem, and combinatorics. The paradoxes (Monty Hall, base rate neglect) challenge intuition deeply. Statistics involves more computational work, more assumptions to verify, and more interpretation judgment. Both require mathematical maturity. Most students find probability conceptually challenging initially, then statistics computationally demanding. The good news: mastering one makes the other much easier.

Key Formulas Quick Reference Sheet

Descriptive Statistics Formulas

Measure	Formula	Notes
Mean	x̄ = Σxᵢ / n	Arithmetic average
Population Variance	σ² = Σ(xᵢ − μ)² / N	Divide by N (entire population)
Sample Variance	s² = Σ(xᵢ − x̄)² / (n−1)	Divide by n−1 (Bessel's correction)
Standard Deviation	s = √s²	Same units as data
Z-Score	z = (x − μ) / σ	Standard deviations from mean
IQR	IQR = Q3 − Q1	Spread of middle 50%, outlier-resistant

Probability Rules Summary

Rule	Formula	Key Condition
Complement	P(A') = 1 − P(A)	Always valid
Addition (General)	P(A∪B) = P(A)+P(B)−P(A∩B)	Any two events
Addition (Exclusive)	P(A∪B) = P(A)+P(B)	A and B mutually exclusive
Multiplication (General)	P(A∩B) = P(A) × P(B\|A)	Any two events
Multiplication (Independent)	P(A∩B) = P(A) × P(B)	A and B independent
Conditional Probability	P(A\|B) = P(A∩B) / P(B)	P(B) > 0
Bayes' Theorem	P(A\|B) = [P(B\|A)·P(A)] / P(B)	Reversing conditional probability

Key Distributions Reference

Distribution	PMF / PDF	Mean	Variance
Binomial	C(n,k)·pᵏ·(1−p)^(n−k)	np	np(1−p)
Poisson	(λᵏ·e^−λ) / k!	λ	λ
Normal	(1/σ√2π)·e^[−(x−μ)²/2σ²]	μ	σ²
Exponential	λ·e^(−λx) for x≥0	1/λ	1/λ²
Uniform (continuous)	1/(b−a) for a≤x≤b	(a+b)/2	(b−a)²/12

Conclusion

Statistics and probability are inseparable tools for understanding uncertainty and making data-driven decisions. Probability gives you the mathematical language to describe randomness. Statistics Fundamentals gives you the methods to learn from it.

Start with the basics — sample space, the four probability types, and the core rules. Then build toward distributions and inferential statistics. Every advanced topic in data science, machine learning, and research rests on the foundation covered in this guide.

Frequently Asked Questions

Probability provides the mathematical framework for quantifying uncertainty, while statistics uses that framework to draw conclusions from real data. Statistical inference — including hypothesis testing and confidence intervals — is built on probability theory.

The four types are: theoretical probability (based on equally likely outcomes), experimental probability (based on observed data), subjective probability (based on judgment), and axiomatic probability (based on formal mathematical rules).

The mean is calculated from observed data, while the expected value is a theoretical average based on probabilities. With large data, the sample mean approaches the expected value.

Use the formula P(A|B) = P(A∩B) / P(B), where P(B) is not zero. It represents the probability of event A occurring given that event B has already occurred.

It models many natural phenomena like heights, test scores, and measurement errors, and forms the basis of many statistical methods such as hypothesis testing and confidence intervals.

Not always. The 0.05 threshold is a common guideline, not a strict rule. Significance depends on context, field of study, and practical importance.

With large enough sample sizes, the distribution of sample means becomes approximately normal, regardless of the original data distribution.

Common distributions include normal, binomial, Poisson, exponential, and uniform. Each is used for different types of data and scenarios.

It is used in weather forecasts, medical testing, opinion polls, finance, sports analytics, and recommendation systems.

A population includes all members of a group, while a sample is a subset used to make inferences about that population.

Probability Calculator

Calculate probabilities easily with practical examples and tools.

Z Table

Use the Z table to find probabilities in the standard normal distribution.

T Distribution Table

Apply t-distribution values in hypothesis testing and small sample analysis.

🔑 Key Takeaways

Branches of Statistics: Descriptive vs. Inferential

Key Measures in Descriptive Statistics

Mean, Median, and Mode

Variance and Standard Deviation

Z-Score

Introduction to Probability — Definition and Types

Key Terminology

Type 1: Theoretical Probability

Type 2: Experimental (Empirical) Probability

Type 3: Subjective Probability

Type 4: Axiomatic Probability (Kolmogorov's Axioms)

Probability Rules and Formulas

Complement Rule

Addition Rule

Multiplication Rule

Conditional Probability

Bayes' Theorem

Random Variables and Probability Distributions

Expected Value and Variance of a Random Variable

Binomial Distribution

Poisson Distribution

Normal (Gaussian) Distribution

Other Key Distributions (Quick Reference)

Inferential Statistics — Hypothesis Testing and Confidence Intervals

What is Hypothesis Testing?

P-Value Explained

Confidence Intervals

Common Statistical Tests — Quick Reference

The Central Limit Theorem

Probability Laws and Theorems

Law of Large Numbers

Law of Total Probability

Permutations vs. Combinations

Common Probability Misconceptions

7 Real-World Applications of Statistics and Probability

Statistics vs. Probability — Key Differences

Which is Harder?

Key Formulas Quick Reference Sheet

Descriptive Statistics Formulas

Probability Rules Summary

Key Distributions Reference

Conclusion

Frequently Asked Questions

Read More Articles

Probability Calculator

Z Table

T Distribution Table