Binomial distribution

The binomial distribution is a probability distribution that describes the likelihood of obtaining a certain number of successes in a series of independent trials.

 

Each trial has only two possible outcomes: success or failure. Examples include heads or tails, hit or miss, pass or fail.

 

 

Assumptions

  • A fixed number of trials, \( \large n \).
  • Each trial has two outcomes: success or failure.
  • The probability of success is the same in each trial, \( \large p \).
  • The trials are independent of each other.

 

 

The formula

The probability of obtaining exactly \( \large r \) successes in \( \large n \) trials is:

 

$$ \large P(X=r) = \binom{n}{r} p^r (1-p)^{n-r} $$

 

Here \(\large \binom{n}{r}\) is a combination, indicating how many ways the \( \large r \) successes can be placed among the \( \large n \) trials.

 

 

Example 1: Coin toss

A coin is tossed 10 times. The probability of heads is \( \large p = 0.5 \).

 

What is the probability of obtaining exactly 6 heads?

 

$$ \large P(X=6) = \binom{10}{6} (0.5)^6 (0.5)^{4} $$

$$ \large P(X=6) = 210 \cdot (0.5)^{10} $$

$$ \large P(X=6) \approx 0.205 $$

 

Thus, there is about a 20.5% probability of obtaining 6 heads.

 

Binomial distribution

The binomial distribution for coin tosses

 

 

Example 2: Dice

A die is rolled 5 times. We define success as rolling a six. Here \( \large p = \tfrac{1}{6} \).

 

The probability of obtaining exactly 2 sixes is:

 

$$ \large P(X=2) = \binom{5}{2} \left(\tfrac{1}{6}\right)^2 \left(\tfrac{5}{6}\right)^3 $$

$$ \large P(X=2) = 10 \cdot \tfrac{1}{36} \cdot \tfrac{125}{216} $$

$$ \large P(X=2) = \tfrac{1250}{7776} $$

$$ \large P(X=2) \approx 0.161 $$

 

To get an overview, we can calculate the probabilities for all possible outcomes:

 

\( \large r \) Probability \( \large P(X=r) \)
0 \( \large 0.401 \)
1 \( \large 0.402 \)
2 \( \large 0.161 \)
3 \( \large 0.032 \)
4 \( \large 0.003 \)
5 \( \large 0.00013 \)

 

We see that it is most likely to obtain 0 or 1 six in 5 rolls, and that the probability quickly becomes very small for higher numbers.

 

 

Binomial distribution

The binomial distribution for dice rolls

 

 

Example 3: Quality control

At a factory, 5% of products are defective. We randomly select 20 products. The probability of finding exactly 3 defective ones is:

 

$$ \large P(X=3) = \binom{20}{3} (0.05)^3 (0.95)^{17} $$

$$ \large P(X=3) = 1140 \cdot (0.000125) \cdot (0.419) $$

$$ \large P(X=3) \approx 0.059 $$

 

To get an overview, we can calculate the probabilities for 0 to 5 defective products:

 

\( \large r \) Probability \( \large P(X=r) \)
0 \( \large 0.358 \)
1 \( \large 0.377 \)
2 \( \large 0.188 \)
3 \( \large 0.059 \)
4 \( \large 0.014 \)
5 \( \large 0.003 \)

 

We see that it is most likely to find 0 or 1 defective product, but there is still a real probability of finding 2 or 3 defective in a sample.

 

 

Binomial distribution

The binomial distribution in quality control

 

 

Properties

  • Mean: \( \large \mu = n \cdot p \)
  • Variance: \( \large \sigma^2 = n \cdot p \cdot (1-p) \)
  • Standard deviation: \( \large \sigma = \sqrt{n \cdot p \cdot (1-p)} \)

 

 

Binomial and normal distribution

When \( \large n \) is large, and \( \large p \) is not too close to 0 or 1, the binomial distribution can be approximated by a normal distribution:

 

$$ \large N(\mu, \sigma^2) = N(n \cdot p, n \cdot p \cdot (1-p)) $$

 

This is useful because the normal distribution is easier to work with for large \( \large n \).

 

 

Applications

  • Statistics: modeling experiments with two outcomes.
  • Biology: the probability that a certain number of plants germinate.
  • Medicine: the probability that a certain number of patients respond to treatment.
  • Quality control: how many defective products are found in a sample.
  • Games and simulations: e.g. coin tosses, dice rolls or other repeated trials.

 

 

Summary

  • The binomial distribution describes the probability of a certain number of successes in \( \large n \) independent trials.
  • For large \( \large n \), the binomial distribution can be approximated by a normal distribution.
  • It has many applications in statistics, probability, biology, medicine, games and quality control.

 

The binomial distribution is one of the most fundamental distributions in probability theory and connects combinatorics with statistics.

 

 

 

Formulas

Binomial distribution

$$ P(X = r) = \binom{n}{r} \, p^r \, (1-p)^{\,n-r} $$

Mean

$$ \mu = n \cdot p $$

Variance

$$ \sigma^2 = n \cdot p \cdot (1-p) $$

Standard deviation

$$ \sigma = \sqrt{n \cdot p \cdot (1-p)} $$