Probability Terms and Definitions

See details

↑ Back to top

intuitionexamplesrelated concepts

An event is a collection of outcomes we are interested in. It can contain one outcome, several outcomes, or even all outcomes. Probabilities are assigned to events, not to individual outcomes directly.

↑ Back to top

Elementary Event

An event consisting of exactly one outcome:

\{\omega\}

where

\omega \in \Omega

See details

↑ Back to top

intuitionrelated concepts

An elementary event is the simplest possible event — a single outcome from the sample space that cannot be broken down further.

↑ Back to top

Relative Frequency

f_n(A) = \frac{\text{number of times } A \text{ occurs}}{n}

where

n

is the total number of trials.

Probability Axioms

See details

↑ Back to top

intuitionrelated concepts

Relative frequency is the proportion of times an event occurs in repeated experiments. As the number of trials grows, relative frequency tends to stabilize near the true probability of the event.

↑ Back to top

Probability Measure

A function

P: \mathcal{F} \to [0,1]

defined on a collection of events, satisfying non-negativity, normalization (

P(\Omega) = 1

), and countable additivity for disjoint events.

Probability Axioms

See details

↑ Back to top

intuitionpropertiesrelated concepts

A probability measure is the formal rule that assigns a number between 0 and 1 to every event in a way that is internally consistent. It is the mathematical object that makes probability rigorous.

↑ Back to top

Equally Likely Events

Events

A_1, A_2, \ldots, A_n

are equally likely when

P(A_1) = P(A_2) = \cdots = P(A_n)

See details

↑ Back to top

intuitionexamplesrelated concepts

When all outcomes in a finite sample space have the same probability, they are equally likely. In this case probability reduces to counting:

P(A) = |A| / |\Omega|

↑ Back to top

Conditional Probability & Independence

(3 items)

Conditional Probability

P(A \mid B) = \frac{P(A \cap B)}{P(B)}

, defined when

P(B) > 0

Conditional Probability

See details

↑ Back to top

intuitionnotationrelated concepts

Conditional probability measures the likelihood of an event given that another event has already occurred. It restricts attention to a smaller part of the sample space.

↑ Back to top

Independent Events

Events

A

and

B

are independent if and only if

P(A \cap B) = P(A) \cdot P(B)

Independence

See details

↑ Back to top

intuitioncommon errorsrelated concepts

Two events are independent when the occurrence of one provides no information about the other. Knowing that

B

happened does not change the probability of

A

↑ Back to top

Mutual Exclusiveness

Events

A

and

B

are mutually exclusive if

A \cap B = \emptyset

See details

↑ Back to top

intuitionpropertiesrelated concepts

Mutually exclusive events cannot happen at the same time. If one occurs, the other is automatically ruled out.

↑ Back to top

Random Variables

(5 items)

Bernoulli Experiment

A random experiment with exactly two possible outcomes, conventionally called success (

S

) and failure (

F

Discrete Distributions

See details

↑ Back to top

intuitionexamplesrelated concepts

A Bernoulli experiment is the simplest random experiment: something either happens or it does not. It is the building block for more complex models like the binomial distribution.

↑ Back to top

Sequence of Bernoulli Trials

A sequence of independent Bernoulli experiments, each with the same success probability

p

Discrete Distributions

See details

↑ Back to top

intuitionrelated concepts

Repeating the same yes/no experiment independently under identical conditions. The number of successes, the trial of first success, and similar quantities each give rise to a named distribution.

↑ Back to top

Random Variable

X: \Omega \to \mathbb{R}

— a function that assigns a real number to each outcome in the sample space.

See details

↑ Back to top

intuitionnotationrelated concepts

A random variable translates outcomes of a random experiment into numbers. This numerical representation makes it possible to compute averages, measure spread, and define distributions.

↑ Back to top

Discrete Random Variable

A random variable whose set of possible values is finite or countably infinite.

See details

↑ Back to top

intuitionexamplesrelated concepts

A discrete random variable takes on isolated, separated values that can be listed — even if the list is infinite. Its probability distribution is described by a probability mass function.

↑ Back to top

Continuous Random Variable

A random variable whose set of possible values forms an interval or union of intervals on the real line.

Cumulative Distribution Function

See details

↑ Back to top

intuitionexamplesrelated concepts

A continuous random variable can take any value within a range. Probability is spread smoothly rather than concentrated at individual points, and is described by a probability density function.

↑ Back to top

Distribution Functions

(3 items)

Cumulative Distribution Function

F_X(x) = P(X \le x)

for all

x \in \mathbb{R}

See details

↑ Back to top

intuitionpropertiesrelated concepts

The CDF tracks how much probability has accumulated up to each value. It answers "how likely is the random variable to be at most

x

?" and works for any type of distribution.

↑ Back to top

Probability Mass Function

p_X(x) = P(X = x)

— the probability that a discrete random variable

X

takes the value

x

Probability Function

See details

↑ Back to top

intuitionpropertiesrelated concepts

The PMF assigns a probability to each individual value a discrete random variable can take. All values are non-negative and their sum equals 1.

↑ Back to top

Probability Density Function

A function

f_X(x) \ge 0

such that

P(a \le X \le b) = \int_a^b f_X(x)\,dx

and

\int_{-\infty}^{\infty} f_X(x)\,dx = 1

Probability Function

See details

↑ Back to top

intuitioncommon errorsrelated concepts

The PDF describes how densely probability is spread across values of a continuous random variable. Its value at a point is not a probability — probabilities come from integrating the PDF over intervals.

↑ Back to top

Measures

(8 items)

Expected Value

E[X] = \sum_x x \cdot p_X(x)

(discrete) or

E[X] = \int_{-\infty}^{\infty} x \cdot f_X(x)\,dx

(continuous).

Expected Value

See details

↑ Back to top

intuitionpropertiesrelated concepts

The expected value is the long-run average of a random variable over many repetitions. It represents the center of mass of the distribution.

↑ Back to top

Variance

\operatorname{Var}(X) = E[(X - \mu)^2]

where

\mu = E[X]

Variance

See details

↑ Back to top

intuitionpropertiesrelated concepts

Variance measures how spread out a random variable's values are around its expected value. A larger variance means outcomes are more dispersed.

↑ Back to top

Standard Deviation

\sigma_X = \sqrt{\operatorname{Var}(X)}

Variance

See details

↑ Back to top

intuitionrelated concepts

Standard deviation is the square root of variance. It measures spread in the same units as the random variable, making it easier to interpret than variance.

↑ Back to top

Covariance

\operatorname{Cov}(X, Y) = E[(X - E[X])(Y - E[Y])]

See details

↑ Back to top

intuitionpropertiesrelated concepts

Covariance measures how two random variables move together. Positive covariance means they tend to increase together; negative means one tends to increase when the other decreases.

↑ Back to top

Correlation Coefficient

\rho_{XY} = \frac{\operatorname{Cov}(X, Y)}{\sigma_X \cdot \sigma_Y}

, where

\sigma_X, \sigma_Y > 0

See details

↑ Back to top

intuitionpropertiesrelated concepts

The correlation coefficient normalizes covariance to a scale between

-1

and

1

. It measures the strength and direction of the linear relationship between two random variables.

↑ Back to top

Conditional Expectation

E[X \mid Y = y] = \sum_x x \cdot P(X = x \mid Y = y)

(discrete) or

E[X \mid Y = y] = \int x \cdot f_{X|Y}(x \mid y)\,dx

(continuous).

Expected Value

See details

↑ Back to top

intuitionrelated concepts

Conditional expectation is the expected value of one random variable given that another takes a specific value. It adjusts the average to account for known information.

↑ Back to top

Conditional Variance

\operatorname{Var}(X \mid Y = y) = E[(X - E[X \mid Y = y])^2 \mid Y = y]

Variance

See details

↑ Back to top

intuitionrelated concepts

Conditional variance measures the spread of one random variable around its conditional mean, given a specific value of another variable.

↑ Back to top

Moment of a Random Variable

The

k

-th moment of

X

about the origin is

E[X^k]

. The

k

-th central moment is

E[(X - \mu)^k]

Expected Value

See details

↑ Back to top

intuitionrelated concepts

Moments are numerical summaries of a distribution's shape. The first moment is the mean, the second central moment is the variance, and higher moments capture skewness and tail behavior.

↑ Back to top

Continuous Distributions

(2 items)

Exponential Distribution

A continuous distribution describing the time between events in a process where events occur independently at a constant rate

\lambda

Exponential Distribution

See details

↑ Back to top

intuitionnotationrelated concepts

The exponential distribution models waiting time — how long until the next event. Its defining feature is the memoryless property: the probability of waiting longer does not depend on how long you have already waited.

↑ Back to top

Normal Distribution

A continuous distribution characterized by a symmetric, bell-shaped curve, fully determined by its mean

\mu

and variance

\sigma^2

Normal Distribution

See details

↑ Back to top

intuitionnotationrelated concepts

The normal distribution appears when many small independent effects combine. Its bell curve is symmetric about the mean, with probability decreasing smoothly in both directions.

↑ Back to top

Multivariate Probability

(11 items)

Bivariate Random Variable

A pair of random variables

(X, Y)

defined on the same sample space, considered jointly.

See details

↑ Back to top

intuitionrelated concepts

A bivariate random variable treats two measurements taken from the same experiment as a single object. Their joint behavior — how they relate, co-occur, or depend on each other — is captured by a joint distribution.

↑ Back to top

N-Variate Random Variables

A vector

(X_1, X_2, \ldots, X_n)

n

random variables defined on the same sample space.

See details

↑ Back to top

intuitionrelated concepts

An extension of bivariate random variables to any number of components. The joint distribution describes how all

n

variables behave together.

↑ Back to top

Independent Random Variables

Random variables

X

and

Y

are independent if

P(X \le x, Y \le y) = P(X \le x) \cdot P(Y \le y)

for all

x, y

Independence

See details

↑ Back to top

intuitionpropertiesrelated concepts

Independent random variables carry no information about each other. Knowing the value of one does not change the distribution of the other.

↑ Back to top

Orthogonal Random Variables

Random variables

X

and

Y

are orthogonal if

E[XY] = 0

See details

↑ Back to top

intuitionrelated concepts

Orthogonality is an algebraic condition on the product of two random variables. It is weaker than independence and does not imply zero covariance unless the means are zero.

↑ Back to top

Uncorrelated Random Variables

Random variables

X

and

Y

are uncorrelated if

\operatorname{Cov}(X, Y) = 0

, equivalently

E[XY] = E[X]E[Y]

See details

↑ Back to top

intuitioncommon errorsrelated concepts

Uncorrelated means no linear relationship between two variables. Independent random variables are always uncorrelated, but uncorrelated variables can still be dependent through nonlinear relationships.

↑ Back to top

Marginal Distribution

The distribution of one random variable obtained from a joint distribution by summing (discrete) or integrating (continuous) over all values of the other variable(s).

See details

↑ Back to top

intuitionrelated concepts

A marginal distribution extracts the standalone behavior of one variable from a joint distribution, ignoring the other variables. In a contingency table, marginals appear as row or column totals.

↑ Back to top

Joint Cumulative Distribution Function

F_{X,Y}(x, y) = P(X \le x, Y \le y)

for all

x, y \in \mathbb{R}

See details

↑ Back to top

intuitionrelated concepts

The joint CDF extends the cumulative distribution function to multiple random variables. It gives the probability that all variables simultaneously fall below their respective thresholds.

↑ Back to top

Joint Probability Mass Function

p_{X,Y}(x, y) = P(X = x, Y = y)

for discrete random variables

X

and

Y

See details

↑ Back to top

intuitionpropertiesrelated concepts

The joint PMF assigns a probability to each specific combination of values two discrete random variables can take simultaneously.

↑ Back to top

Joint Probability Density Function

A function

f_{X,Y}(x,y) \ge 0

such that

P((X,Y) \in A) = \iint_A f_{X,Y}(x,y)\,dx\,dy

for any region

A

See details

↑ Back to top

intuitionrelated concepts

The joint PDF describes how probability density is spread over the plane for two continuous random variables. Probabilities are obtained by integrating over regions, not by evaluating at points.

↑ Back to top

Conditional Probability Mass Function

p_{X|Y}(x \mid y) = \frac{p_{X,Y}(x, y)}{p_Y(y)}

, defined when

p_Y(y) > 0

Conditional Probability

See details

↑ Back to top

intuitionrelated concepts

The conditional PMF gives the probability distribution of one discrete random variable when another discrete random variable is known to take a specific value.

↑ Back to top

Conditional Probability Density Function

f_{X|Y}(x \mid y) = \frac{f_{X,Y}(x, y)}{f_Y(y)}

, defined when

f_Y(y) > 0

Conditional Probability

See details

↑ Back to top

intuitionrelated concepts

The conditional PDF describes the density of one continuous random variable when another continuous random variable is known to take a specific value.

↑ Back to top

Transformations

(3 items)

Function of a Random Variable

Given a random variable

X

and a function

g

Y = g(X)

defines a new random variable whose distribution is determined by the distribution of

X

and the function

g

See details

↑ Back to top

intuitionrelated concepts

Applying a function to a random variable produces a new random variable. The challenge is determining the distribution of the result from the distribution of the original.

↑ Back to top

PDF of a Transformed Variable

Y = g(X)

where

g

is monotone and differentiable, then

f_Y(y) = f_X(g^{-1}(y)) \cdot |\frac{d}{dy} g^{-1}(y)|

PDF

See details

↑ Back to top

intuitionrelated concepts

When a continuous random variable is transformed, its density changes. The change-of-variables formula accounts for both the mapping of values and the stretching or compression of the density.

↑ Back to top

Moment Generating Function

M_X(t) = E[e^{tX}]

, defined for real values of

t

where the expectation exists.

Binomial Distribution

See details

↑ Back to top

intuitionpropertiesrelated concepts

The moment generating function encodes all moments of a random variable in a single function. The

k

-th moment is obtained by differentiating

k

times and evaluating at

t = 0

↑ Back to top

Set Operations

(6 items)

Venn Diagram

A graphical representation using overlapping circles to depict sets (events) and their relationships within a sample space.

See details

↑ Back to top

intuitionrelated concepts

Venn diagrams visualize how events overlap, combine, or exclude each other. They make set operations like union, intersection, and complement immediately visible.

↑ Back to top

Null Set

\emptyset

— the set containing no elements, representing an impossible event in probability.

See details

↑ Back to top

intuitionrelated concepts

The null set is the event that can never occur. Its probability is always zero:

P(\emptyset) = 0

↑ Back to top

Union of Sets

A \cup B = \{\omega : \omega \in A \text{ or } \omega \in B\}

— the event that at least one of

A

B

occurs.

See details

↑ Back to top

intuitionrelated concepts

The union combines two events into one that occurs whenever either event (or both) occurs.

↑ Back to top

Intersection of Sets

A \cap B = \{\omega : \omega \in A \text{ and } \omega \in B\}

— the event that both

A

and

B

occur simultaneously.

See details

↑ Back to top

intuitionrelated concepts

The intersection is the event where both conditions are met at the same time.

↑ Back to top

Disjoint Sets

Sets

A

and

B

are disjoint if

A \cap B = \emptyset

— they share no common elements.

See details

↑ Back to top

intuitionrelated concepts

Disjoint sets have no overlap. In probability, disjoint events are mutually exclusive — they cannot both occur in the same trial.

↑ Back to top

Complement of a Set

A^c = \{\omega \in \Omega : \omega \notin A\}

— all outcomes in the sample space that are not in

A