Understanding Probability
Distributions

An interactive journey through the fundamental patterns that govern randomness in data science. Visualize, experiment, and master the distributions that shape our world.

Discrete

Countable outcomes like coin flips, dice rolls, or success counts.

Uniform Binomial

Continuous

Measurements across a range like height, temperature, or time.

Normal Continuous Uniform

Why Distributions Matter

Model Reality

Capture patterns in sales, weather, and user behavior

Make Predictions

Forecast customer conversions and risk probabilities

Statistical Testing

Choose correct methods for hypothesis testing

DISTRIBUTION 01

Uniform Distribution

The simplest case: every outcome is equally likely. Perfect fairness in probability.

Discrete: Fair Die Simulator

Total Rolls: 0 Expected: 16.67% each

Continuous: Random Range

0
10
f(x) = 1/(b-a) = 0.10

Discrete Formula

P(X = x) = 1/n

Where n = number of possible outcomes. Each outcome has exactly the same probability.

Continuous Formula

f(x) = 1/(b-a) for a ≤ x ≤ b

Probability density is constant across the entire interval [a, b].

Real-World Applications

Random Number Generation Lottery Systems Monte Carlo Simulations Cryptography
DISTRIBUTION 02

Binomial Distribution

Counting successes in fixed trials. The probability of getting exactly k successes in n independent yes/no experiments.

1 10 50
0% 0.5 100%
Expected Value
5.0
μ = n × p

Coin Flip Example

Flip a fair coin 10 times. Probability of exactly 6 heads:

C(10,6) × 0.5⁶ × 0.5⁴ = 210 × 0.015625 × 0.0625 ≈ 20.5%
Requirements: Independent trials, two outcomes, constant probability

Use Cases

  • Quality Control: Defective items in a batch
  • A/B Testing: Click-through rates
  • Medical Trials: Treatment success rates
  • Survey Analysis: Yes/no responses
DISTRIBUTION 03

Normal Distribution

The famous Bell Curve. Nature's favorite pattern for continuous measurements like height, IQ, and measurement errors.

0
1
68% within: ±1σ
95% within: ±2σ
99.7% within: ±3σ

The 68-95-99.7 Rule

68%
Within 1σ of mean
95%
Within 2σ of mean
99.7%
Within 3σ of mean

Probability Density Function

f(x) = (1/σ√2π) × e^-(x-μ)²/(2σ²)

Where μ is the mean and σ is the standard deviation. The curve is perfectly symmetric around the mean.

Properties

  • • Symmetric around the mean (skewness = 0)
  • • Mean = Median = Mode
  • • Follows the Empirical Rule
  • • Area under curve = 1 (100% probability)
  • • Extends infinitely in both directions
FUNDAMENTAL THEOREM

Central Limit Theorem

Why the normal distribution appears everywhere. The mathematical miracle that makes statistics work.

The Big Idea

Take many random samples from any population, calculate their means, and those means will form a normal distribution—regardless of the original population's shape!

1

Sample

Take n random samples from population (n ≥ 30)

2

Calculate Mean

Compute the average of each sample

3

Observe Distribution

Plot all sample means → Bell curve!

30
1000
Original Population
Distribution of Sample Means
Population Mean
3.5
Sample Means Avg
3.5
Standard Error
0.53
σ/√n
0.53

Why CLT Matters

Statistical Inference

We can make conclusions about populations using sample data, even without knowing the population's distribution.

Hypothesis Testing

Most statistical tests assume normality. CLT justifies using these tests with large samples.

Confidence Intervals

We can estimate how precise our sample statistics are using the normal distribution.

Quality Control

Manufacturing processes use CLT to monitor product consistency and detect anomalies.

Quick Reference Guide

Distribution Type When to Use Key Parameters
Uniform Discrete/Continuous Equal probability outcomes a (min), b (max)
Binomial Discrete Counting successes in n trials n (trials), p (prob)
Normal Continuous Natural measurements, errors μ (mean), σ (std dev)
CLT Theorem Sample means from any population n ≥ 30