What is the geometric distribution?

The geometric distribution models the number of trials needed until the first success occurs in a sequence of independent Bernoulli trials. Unlike the binomial distribution where trials are fixed, the geometric distribution continues until success happens. It has one parameter p (probability of success on each trial).

What is the geometric probability formula?

The probability of first success on trial k is P(X = k) = (1-p)^(k-1) × p, where p is the success probability. This formula multiplies the probability of k-1 failures by the probability of success on trial k. The support is all positive integers: 1, 2, 3, ...

How do you find the mean of a geometric distribution?

The mean (expected value) of a geometric distribution is E[X] = 1/p, where p is the probability of success. If success probability is 1/6 (like rolling a specific number on a die), you expect to need 6 trials on average. The smaller p is, the more trials you expect to need.

What is the memoryless property?

The memoryless property means that the probability of success in future trials doesn't depend on how many failures have already occurred. Mathematically: P(X > m + n | X > m) = P(X > n). The process 'resets' after each failure—past failures don't affect future success chances.

When should you use the geometric distribution?

Use the geometric distribution when counting trials until the first success, with independent trials having constant success probability. Common examples include: number of sales calls until first sale, items inspected until finding first defect, or coin flips until first heads. The key distinction from binomial is that trials continue until success rather than being fixed in advance.

Geometric Distribution

The Probabilistic Experiment Behind geometric distribution

Notation Used

Parameters

Probability Mass Function (PMF) and Support (Range)

Cumulative Distribution Function (CDF)

Expected Value (Mean)

Variance and Standard Deviation

Mode and Median

Applications and Examples

Interactive Calculator

Geometric Distribution: Trials Until First Success

The geometric distribution is based on a sequence of independent Bernoulli trials that continues until the first success occurs. The number of trials is not fixed in advance; instead, the experiment stops as soon as success is observed. The random variable represents how long it takes to achieve the first success, making the timing of success — not the total count — the central focus.

The Probabilistic Experiment Behind geometric distribution

The geometric distribution serves to count the number of trials required until the first success occurs. Unlike the binomial distribution, the number of trials is not fixed in advance. Instead, trials continue indefinitely until success happens for the first time.

Each trial is a Bernoulli experiment: two outcomes, constant success probability, and independence between trials. The random variable counts when success occurs, not how many successes occur in total. This makes the geometric distribution fundamentally about waiting time rather than accumulation.

A defining characteristic of the geometric distribution is the memoryless property: the probability that success occurs in future trials does not depend on how many failures have already occurred. The process effectively "restarts" after each failure.

Learn More About Distinguishing Discrete Distributions →

Example:

Flipping a coin repeatedly until the first heads appears. If

X=4

, this means the first three flips were tails and the fourth flip was heads. The experiment stops as soon as success occurs.

See More Examples of Geometric Distribution →

Notation Used

X \sim \text{Geom}(p)

X \sim \text{Geometric}(p)

— distribution of the random variable.

\text{Geom}(p)

— used to denote the distribution itself (not the random variable).

G(p)

— less common shorthand in some texts or software contexts.

P(X = k) = (1 - p)^{k - 1} p, \quad \text{for } k = 1, 2, 3, \ldots

— probability mass function (PMF), where:

p

— probability of success on each trial

k

— number of trials until first success

Alternative PMF formulation:

P(X = k) = (1 - p)^k p, \quad \text{for } k = 0, 1, 2, \ldots

— number of failures before first success

Alternative notations:

q = 1 - p

— probability of failure, so PMF can be written as

P(X = k) = q^{k-1} p

Key properties:

E[X] = \frac{1}{p}

— expected value (mean)

\text{Var}(X) = \frac{1-p}{p^2}

— variance

Memoryless property:

P(X > m + n \mid X > m) = P(X > n)

— the distribution has no memory

See All Probability Symbols and Notations →

Parameters

𝑝

: probability of success on a single trial, with

0<𝑝≤1

The geometric distribution models the number of trials needed to get the first success in a sequence of independent Bernoulli trials.

There's only one parameter —

𝑝

, the chance of success each time — which completely determines the shape of the distribution.

The outcomes are positive integers:

1,2,3,…

where each value represents the trial number on which success first occurs.

Probability Mass Function (PMF) and Support (Range)

The probability mass function (PMF) of a geometric distribution is given by:

P(X = k) = (1-p)^{k-1} p, \quad k = 1, 2, 3, \ldots

* First Success: The geometric distribution models the number of trials needed to get the first success in a sequence of independent Bernoulli trials.

* Support (Range of the Random Variable):
• The random variable

X

can take on values

1, 2, 3, \ldots

(all positive integers).
•

X = k

means the first success occurs on the

k

-th trial.
• The support is thus a countably infinite set.

* Logic Behind the Formula:
•

(1-p)^{k-1}

: The probability of getting

k-1

failures before the first success
•

p

: The probability of success on the

k

-th trial
• The total probability sums to 1:

\sum_{k=1}^{\infty} P(X = k) = \sum_{k=1}^{\infty} (1-p)^{k-1} p = p \sum_{k=1}^{\infty} (1-p)^{k-1} = p \cdot \frac{1}{1-(1-p)} = p \cdot \frac{1}{p} = 1

• This uses the geometric series formula:

\sum_{k=0}^{\infty} r^k = \frac{1}{1-r}

for

|r| < 1

Geometric Distribution

Trials until first success (probability p)

Success Probability (p): 0.30

Explanation

The geometric distribution models the number of trials needed to get the first success in repeated independent trials. The probability mass function is $P(X = k) = (1-p)^{k-1} p$ . The expected value is $E[X] = \frac{1}{p}$ and the variance is $\text{Var}(X) = \frac{1-p}{p^2}$ . Common applications include counting coin flips until the first heads, attempts until the first sale is made, or trials until equipment failure occurs.

Cumulative Distribution Function (CDF)

The cumulative distribution function (CDF) of a geometric distribution is given by:

F_X(k) = P(X \leq k) = 1 - (1-p)^k

Where:

p

= probability of success on each trial

k

= number of trials until and including the first success (where

k \geq 1

)

(1-p)

= probability of failure on each trial

Intuition Behind the Formula

Definition: The CDF gives the probability that the first success occurs on or before trial

k

.

Complement Approach:
Instead of summing probabilities from 1 to

k

, it's easier to use the complement. The event "first success on or before trial

k

" is the complement of "first success after trial

k

", which means all of the first

k

trials are failures:

P(X \leq k) = 1 - P(\text{all first } k \text{ trials fail}) = 1 - (1-p)^k

Verification by Summation:
We can verify this by summing the PMF:

P(X \leq k) = \sum_{i=1}^{k} (1-p)^{i-1}p = p\sum_{i=0}^{k-1} (1-p)^i = p \cdot \frac{1-(1-p)^k}{1-(1-p)} = 1-(1-p)^k

Monotonic Property: As

k

increases,

(1-p)^k

decreases toward 0, so

F_X(k)

increases toward 1, which reflects the increasing certainty that success will eventually occur.

Geometric Distribution CDF

CDF for waiting time until first success

Success Probability (p): 0.30

CDF Explanation

The geometric CDF has a closed form:

F(k) = P(X \leq k) = 1 - (1-p)^k

for

k \geq 1

. This represents the probability that the first success occurs on or before trial

k

. The CDF starts at

F(1) = p

(success on the first trial) and asymptotically approaches 1.0 as

k

increases. The rate of increase depends on

p

: larger values of

p

lead to faster convergence to 1.0, while smaller values result in a more gradual increase, reflecting the longer expected waiting time.

Expected Value (Mean)

As explained in the general case for calculating expected value, the expected value of a discrete random variable is computed as a weighted sum where each possible value is multiplied by its probability:

E[X] = \sum_{x} x \cdot P(X = x)

For the geometric distribution, we apply this general formula to the specific probability mass function of this distribution.

Formula

E[X] = \frac{1}{p}

Where:

p

= probability of success on each trial

Derivation and Intuition

Starting from the general definition and substituting the PMF

P(X = k) = (1-p)^{k-1}p

for

k = 1, 2, 3, \ldots

E[X] = \sum_{k=1}^{\infty} k \cdot (1-p)^{k-1} p = p \sum_{k=1}^{\infty} k \cdot (1-p)^{k-1}

Using the formula for the derivative of a geometric series, we recognize that:

\sum_{k=1}^{\infty} k \cdot r^{k-1} = \frac{1}{(1-r)^2}

Substituting

r = 1-p

E[X] = p \cdot \frac{1}{(1-(1-p))^2} = p \cdot \frac{1}{p^2} = \frac{1}{p}

The result

E[X] = \frac{1}{p}

captures an intuitive relationship: if the probability of success on each trial is

p

, then on average you need

\frac{1}{p}

trials to achieve the first success. The smaller the probability of success, the more trials you expect to need.

Example

Consider rolling a die until you get a 6, where

p = \frac{1}{6}

E[X] = \frac{1}{1/6} = 6

On average, you expect to roll the die 6 times before seeing the first 6. This makes intuitive sense: with a 1-in-6 chance per roll, the average wait is 6 rolls.

Variance and Standard Deviation

The variance of a discrete random variable measures how spread out the values are around the expected value. It is computed as:

\mathrm{Var}(X) = \mathbb{E}[(X - \mu)^2] = \sum_{x} (x - \mu)^2 P(X = x)

Or using the shortcut formula:

\mathrm{Var}(X) = \mathbb{E}[X^2] - \mu^2

For the geometric distribution, we apply this formula to derive the variance.

Formula

\mathrm{Var}(X) = \frac{1-p}{p^2}

Where:

p

= probability of success on each trial

(1-p) = q

= probability of failure on each trial

Derivation and Intuition

Starting with the shortcut formula, we need to calculate

\mathbb{E}[X^2]

.

We know from the expected value section that

\mu = \frac{1}{p}

.

Using the PMF

P(X = k) = (1-p)^{k-1}p

and applying summation techniques involving derivatives of geometric series:

\mathbb{E}[X^2] = \sum_{k=1}^{\infty} k^2 (1-p)^{k-1} p = \frac{2-p}{p^2}

Applying the shortcut formula:

\mathrm{Var}(X) = \frac{2-p}{p^2} - \left(\frac{1}{p}\right)^2 = \frac{2-p}{p^2} - \frac{1}{p^2} = \frac{1-p}{p^2}

The result

\mathrm{Var}(X) = \frac{1-p}{p^2}

shows that variance increases as

p

decreases. When success is rare (small

p

), the waiting time becomes highly variable—sometimes you succeed quickly, sometimes you wait a very long time. The quadratic relationship with

p

in the denominator means variance grows rapidly as

p

approaches zero.

Standard Deviation

\sigma = \sqrt{\frac{1-p}{p^2}} = \frac{\sqrt{1-p}}{p}

Example

Consider rolling a die until you get a 6, where

p = \frac{1}{6}

\mathrm{Var}(X) = \frac{1 - \frac{1}{6}}{(\frac{1}{6})^2} = \frac{\frac{5}{6}}{\frac{1}{36}} = \frac{5}{6} \times 36 = 30

\sigma = \sqrt{30} \approx 5.477

The variance of 30 and standard deviation of about 5.5 indicate substantial variability around the expected wait time of 6 rolls. You might get lucky and succeed on roll 2, or unlucky and wait 15+ rolls.

Mode and Median

Mode

The mode is the value of

k

(number of trials until first success) with the highest probability—the peak of the PMF.

For the geometric distribution, the mode is always:

\text{Mode} = 1

Intuition: The geometric PMF is

p_X(k) = (1-p)^{k-1}p

, which decreases monotonically. The probability is highest at the first trial and decreases exponentially with each subsequent trial. Since each additional trial multiplies the probability by

(1-p) < 1

, the maximum probability always occurs at

k = 1

.

Example:
For

p = 0.3

:
•

P(X = 1) = 0.3

•

P(X = 2) = 0.7 \times 0.3 = 0.21

•

P(X = 3) = 0.49 \times 0.3 = 0.147

The mode is 1, representing that getting success on the first trial is always the most likely single outcome.

This holds regardless of the value of

p

—whether success is rare (

p

near 0) or common (

p

near 1), the first trial is always most probable.

Median

The median is the value

m

such that

P(X \leq m) \geq 0.5

and

P(X \geq m) \geq 0.5

.

For the geometric distribution, the median can be found by solving:

\sum_{k=1}^{m} (1-p)^{k-1}p \geq 0.5

Using the CDF

F(k) = 1 - (1-p)^k

, we solve:

1 - (1-p)^m \geq 0.5

(1-p)^m \leq 0.5

m \geq \frac{\ln(0.5)}{\ln(1-p)} = \frac{-\ln(2)}{\ln(1-p)}

The median is:

\text{Median} = \left\lceil \frac{-\ln(2)}{\ln(1-p)} \right\rceil

Example:
For

p = 0.5

\text{Median} = \left\lceil \frac{-\ln(2)}{\ln(0.5)} \right\rceil = \left\lceil \frac{0.693}{0.693} \right\rceil = 1

Example:
For

p = 0.3

\text{Median} = \left\lceil \frac{-\ln(2)}{\ln(0.7)} \right\rceil = \left\lceil \frac{0.693}{0.357} \right\rceil = \lceil 1.94 \rceil = 2

Properties:
• The median is always close to

\frac{1}{p}

(the expected value)
• For high

p

, median = mode = 1
• For low

p

, median > mode, reflecting right skew
• The geometric distribution is always right-skewed (median ≥ mode)

Applications and Examples

Practical Example

Suppose you're rolling a fair six-sided die until you get a 6. The probability of rolling a 6 is

p = \frac{1}{6}

. The probability that you need exactly

k = 4

rolls to get your first 6 is:

P(X = 4) = \left(\frac{5}{6}\right)^{4-1} \cdot \frac{1}{6} = \left(\frac{5}{6}\right)^{3} \cdot \frac{1}{6} \approx 0.096

This means there's about a 9.6% chance that you'll need exactly 4 rolls to get your first 6.

Interactive Calculator

This interactive calculator computes probabilities for the geometric distribution, which models the number of trials needed until the first success occurs. Enter your success probability (

p

) and choose whether you want the full distribution or specific probabilities like

P(X=k)

P(X<k)

, or

P(X≥k)

to find how likely you are to achieve your first success on a particular trial. Perfect for modeling scenarios like sales conversions, quality testing, or any situation where you're waiting for the first success.

1. Select probability type: 'All values' shows full distribution, or choose a specific query like P(X=k) or P(X≤k)

2. Enter p (success probability) - must be between 0 and 1, cannot be 0

3. For specific queries, enter k (trial number) - must be a positive integer

4. Click Calculate to see results including mean, variance, and probability distribution

Geometric Distribution Calculator

Calculate probabilities and distribution properties

Probability Type

p (success probability)

Probability of success on each trial

See All Discrete Distributions Probability Calculators →