What is a matrix in linear algebra?

A matrix is a rectangular array of numbers arranged in rows and columns. It is denoted by a capital letter such as A, with individual entries identified by two subscripts indicating the row and column position. Matrices can contain real numbers, complex numbers, or elements of any algebraic field.

What do the dimensions of a matrix mean?

The dimensions of a matrix describe its size as m × n, where m is the number of rows and n is the number of columns. A 3 × 5 matrix has 3 rows and 5 columns and contains 15 entries total. Order matters — a 3 × 5 matrix and a 5 × 3 matrix have different shapes and are never equal.

How does matrix multiplication work?

To multiply matrices A (m × n) and B (n × p), the number of columns of A must equal the number of rows of B. Each entry of the product AB is computed as the dot product of a row of A with a column of B. The resulting matrix has dimensions m × p. Unlike ordinary multiplication, matrix multiplication is not commutative — AB does not generally equal BA.

When is a matrix invertible?

A square matrix A is invertible when there exists a matrix A⁻¹ such that AA⁻¹ = A⁻¹A = I. This happens if and only if the determinant of A is nonzero. For a 2 × 2 matrix, the inverse has an explicit formula involving the entries and the determinant ad − bc.

How do matrices represent linear transformations?

Every m × n matrix defines a linear transformation from Rⁿ to Rᵐ by the rule x ↦ Ax. The columns of the matrix are the images of the standard basis vectors, and the image of any other vector follows by linearity. The rank of the matrix equals the dimension of the image of the transformation.

Matrices

What a Matrix Is

Dimensions, Rows, and Columns

Matrix Equality and the Zero Matrix

Matrices as Collections of Vectors

Matrix Arithmetic at a Glance

Special Matrix Shapes

The Inverse of a Matrix

Rank and Trace

Matrices and Systems of Equations

Matrices as Linear Transformations

Four Views of a Matrix

Rectangular Arrays of Numbers

A matrix is one of the most versatile objects in mathematics. It encodes systems of equations, represents linear transformations, stores data in structured form, and serves as the computational backbone of nearly every topic in linear algebra. Understanding what matrices are, how to read them, and how their parts relate to one another is the starting point for everything that follows.

What a Matrix Is

A matrix is a rectangular array of numbers arranged in rows and columns. The standard notation uses a capital letter for the matrix and a lowercase letter with two subscripts for its entries: the matrix

A

has entry

a_{ij}

in row

i

and column

j

. The shorthand

A = (a_{ij})

means "the matrix whose

(i,j)

entry is

a_{ij}

."

In full generality, an

m \times n

matrix looks like

A = \begin{pmatrix} a_{11} & a_{12} & \cdots & a_{1n} \\ a_{21} & a_{22} & \cdots & a_{2n} \\ \vdots & \vdots & \ddots & \vdots \\ a_{m1} & a_{m2} & \cdots & a_{mn} \end{pmatrix}

The entries can be real numbers, complex numbers, or elements of any algebraic field. Throughout this site, entries are real unless explicitly stated otherwise.

Dimensions, Rows, and Columns

The size of a matrix is described by two numbers:

m

rows and

n

columns, written

m \times n

. The notation

A \in \mathbb{R}^{m \times n}

states that

A

is an

m \times n

matrix with real entries. The total number of entries is

m \cdot n

. Order matters: a

3 \times 5

matrix and a

5 \times 3

matrix have different shapes and are never equal, regardless of their entries.

Row

i

A

is the horizontal slice

(a_{i1}, a_{i2}, \dots, a_{in})

, a

1 \times n

vector. Column

j

is the vertical slice

(a_{1j}, a_{2j}, \dots, a_{mj})^T

, an

m \times 1

vector. The main diagonal consists of the entries where the row index equals the column index:

a_{11}, a_{22}, \dots, a_{kk}

with

k = \min(m, n)

. The diagonal is defined for any matrix, not just square ones, though it is most prominent in the square case.

A matrix with

m = n

is called square, and square matrices occupy a special position. Only square matrices can have a determinant, an inverse, eigenvalues, or a trace. A column vector in

\mathbb{R}^n

is simply an

n \times 1

matrix, a row vector is a

1 \times n

matrix, and a scalar is a

1 \times 1

matrix. Matrices unify all of these objects under a single framework.

Matrix Equality and the Zero Matrix

Two matrices

A

and

B

are equal if and only if they have the same dimensions and every pair of corresponding entries matches:

a_{ij} = b_{ij}

for all

i

and

j

. A single mismatched entry makes the matrices unequal. If the dimensions differ, the matrices are never equal — a

2 \times 3

matrix cannot equal a

3 \times 2

matrix no matter what numbers they contain.

The zero matrix is the

m \times n

matrix whose every entry is zero, written

O

0_{m \times n}

. It serves as the additive identity:

A + O = A

for any matrix

A

of the same size. Strictly speaking, there is a different zero matrix for each pair

(m, n)

, but the same symbol is used for all of them, with the dimensions understood from context.

Matrices as Collections of Vectors

m \times n

matrix can be viewed as a collection of

n

column vectors in

\mathbb{R}^m

, arranged side by side:

A = \begin{pmatrix} | & | & & | \\ \mathbf{a}_1 & \mathbf{a}_2 & \cdots & \mathbf{a}_n \\ | & | & & | \end{pmatrix}

Equivalently, it is a stack of

m

row vectors in

\mathbb{R}^n

. Both perspectives are useful, and choosing the right one often simplifies a problem considerably. The column view connects the matrix to concepts like span, linear independence, and column space. The row view connects it to systems of equations and row space.

The column perspective also gives a powerful interpretation of the matrix-vector product. If

\mathbf{x} = (x_1, x_2, \dots, x_n)^T

, then

A\mathbf{x} = x_1 \mathbf{a}_1 + x_2 \mathbf{a}_2 + \cdots + x_n \mathbf{a}_n

The product

A\mathbf{x}

is a linear combination of the columns of

A

, weighted by the entries of

\mathbf{x}

. This single observation underlies the theory of linear systems, transformations, and virtually everything else involving matrices.

Matrix Arithmetic at a Glance

Matrices support several operations, each with its own rules and dimension requirements.

Addition is entry-by-entry:

(A + B)_{ij} = a_{ij} + b_{ij}

. Both matrices must have the same dimensions. Scalar multiplication scales every entry:

(cA)_{ij} = c \cdot a_{ij}

. These two operations together give the set of all

m \times n

matrices the structure of a vector space of dimension

mn

.

Matrix multiplication is more involved. For

A

of size

m \times n

and

B

of size

n \times p

, the product

AB

has size

m \times p

, with each entry computed as the dot product of a row of

A

with a column of

B

(AB)_{ij} = \sum_{k=1}^{n} a_{ik} b_{kj}

. The number of columns of

A

must equal the number of rows of

B

; otherwise the product is undefined.

The transpose

A^T

swaps rows and columns:

(A^T)_{ij} = a_{ji}

. An

m \times n

matrix becomes

n \times m

.

One property that distinguishes matrix arithmetic from ordinary arithmetic is that multiplication is not commutative. In general,

AB \neq BA

, even when both products are defined. This asymmetry has far-reaching consequences throughout linear algebra.

Operation	Definition	Dimension requirement	Result size
Addition	(A + B)ᵢⱼ = aᵢⱼ + bᵢⱼ	A and B both m × n	m × n
Scalar multiplication	(cA)ᵢⱼ = c · aᵢⱼ	none	same as A
Matrix multiplication	(AB)ᵢⱼ = Σₖ aᵢₖ bₖⱼ (not commutative)	cols(A) = rows(B); A is m × n, B is n × p	m × p
Transpose	(Aᵀ)ᵢⱼ = aⱼᵢ	none	n × m

Special Matrix Shapes

Certain structural patterns appear so frequently that they have their own names and dedicated theory. A diagonal matrix has nonzero entries only on the main diagonal, making its arithmetic trivially simple — products, powers, and inverses all reduce to operations on the diagonal entries alone. The identity matrix

I

is the diagonal matrix with every diagonal entry equal to

1

, serving as the multiplicative identity:

AI = IA = A

.

A symmetric matrix satisfies

A = A^T

, meaning it is unchanged by transposition. Symmetric matrices have the remarkable property that all their eigenvalues are real and their eigenvectors can be chosen to be mutually orthogonal. A triangular matrix has all entries either above or below the diagonal equal to zero, making its determinant and eigenvalues readable directly from the diagonal.

An orthogonal matrix satisfies

Q^T Q = I

, meaning its columns form an orthonormal set and its transpose is its inverse. Orthogonal matrices preserve lengths and angles, making them the algebraic counterpart of rotations and reflections.

These and several other types — including skew-symmetric, nilpotent, idempotent, and permutation matrices — each carry structural guarantees that simplify computation and deepen understanding.

Type	Defining condition	Signature property
Diagonal	nonzero entries only on the main diagonal	products, powers, and inverses act on the diagonal alone
Identity (I)	diagonal with every diagonal entry equal to 1	AI = IA = A (multiplicative identity)
Symmetric	A = Aᵀ	all eigenvalues real; eigenvectors can be chosen orthogonal
Triangular	all entries above or below the diagonal are zero	determinant and eigenvalues read directly from the diagonal
Orthogonal	QᵀQ = I (columns orthonormal)	preserves lengths and angles; Q⁻¹ = Qᵀ

The Inverse of a Matrix

A square matrix

A

is called invertible if there exists a matrix

A^{-1}

satisfying

AA^{-1} = A^{-1}A = I

. When it exists, the inverse is unique and effectively "undoes" the action of

A

: if

A

maps

\mathbf{x}

\mathbf{b}

, then

A^{-1}

maps

\mathbf{b}

back to

\mathbf{x}

.

Not every square matrix has an inverse. The dividing line is the determinant:

A

is invertible if and only if

\det(A) \neq 0

. For a

2 \times 2

matrix

A = \begin{pmatrix} a & b \\ c & d \end{pmatrix}

, the inverse has the explicit formula

A^{-1} = \frac{1}{ad - bc} \begin{pmatrix} d & -b \\ -c & a \end{pmatrix}

which breaks down precisely when

ad - bc = 0

. For larger matrices, the inverse can be computed by row reduction or through the adjugate formula, though in practice solving

Ax = \mathbf{b}

directly is almost always more efficient than computing

A^{-1}

Rank and Trace

Two scalar quantities extracted from a matrix appear throughout linear algebra.

The rank of an

m \times n

matrix is the number of linearly independent rows, which always equals the number of linearly independent columns. It measures the "effective dimensionality" of the matrix — how many of its rows or columns carry genuinely new information. The rank satisfies

0 \leq \text{rank}(A) \leq \min(m, n)

. When

\text{rank}(A) = \min(m, n)

, the matrix is said to have full rank, meaning no row or column is redundant.

The trace is defined only for square matrices:

\text{tr}(A) = a_{11} + a_{22} + \cdots + a_{nn}

, the sum of the diagonal entries. Despite its simplicity, the trace encodes deep information. It equals the sum of the eigenvalues, it is invariant under changes of basis, and it satisfies the cyclic property

\text{tr}(AB) = \text{tr}(BA)

, which makes it a fundamental tool in both theoretical and applied contexts.

Attribute	Rank	Trace
Defined for	any m × n matrix	square matrices only
Definition	number of linearly independent rows (= columns)	sum of diagonal entries: a₁₁ + a₂₂ + ⋯ + aₙₙ
Range / values	integer, 0 ≤ rank(A) ≤ min(m, n)	any real (or complex) scalar
Eigenvalue link	dimension of the column space (and image)	equals the sum of all eigenvalues
Key identity	rank(A) = rank(Aᵀ)	tr(AB) = tr(BA); invariant under change of basis

Matrices and Systems of Equations

A system of

m

linear equations in

n

unknowns can be written compactly as

Ax = \mathbf{b}

where

A

is the

m \times n

coefficient matrix,

\mathbf{x}

is the

n \times 1

vector of unknowns, and

\mathbf{b}

is the

m \times 1

vector of right-hand sides. The augmented matrix

[A \mid \mathbf{b}]

appends

\mathbf{b}

as an extra column, creating the compact representation used in Gaussian elimination.

Whether the system has no solution, exactly one solution, or infinitely many solutions depends entirely on the rank of

A

relative to the rank of the augmented matrix

[A \mid \mathbf{b}]

. When

A

is square and invertible, the unique solution is

\mathbf{x} = A^{-1}\mathbf{b}

. When

A

is rectangular or singular, the analysis requires the rank and the structure of the null space.

Matrices as Linear Transformations

Every

m \times n

matrix

A

defines a function from

\mathbb{R}^n

\mathbb{R}^m

by the rule

\mathbf{x} \mapsto A\mathbf{x}

. This function is a linear transformation: it preserves addition (

A(\mathbf{x} + \mathbf{y}) = A\mathbf{x} + A\mathbf{y}

) and scalar multiplication (

A(c\mathbf{x}) = cA\mathbf{x}

).

The columns of

A

reveal exactly what the transformation does to the standard basis. The first column

\mathbf{a}_1

is the image of

\mathbf{e}_1

, the second column

\mathbf{a}_2

is the image of

\mathbf{e}_2

, and so on. Once the images of the basis vectors are known, the image of any vector follows by linearity.

When

A

is square and invertible, the transformation is bijective — every output has exactly one input, and the inverse transformation is given by

A^{-1}

. When

A

is singular, the transformation collapses at least one dimension, mapping

\mathbb{R}^n

onto a proper subspace of

\mathbb{R}^m

. The rank of

A

is the dimension of this image, and the null space captures everything that gets sent to zero.

This perspective transforms matrices from static tables of numbers into active geometric objects that rotate, stretch, compress, reflect, and project.

Four Views of a Matrix

Matrices appear throughout this section under several guises: a rectangular array of numbers, an arrangement of column or row vectors, a compact encoding of a linear system, and a transformation between vector spaces. Each view emphasizes different structure and pairs naturally with different tools. The table below collects these four perspectives along with the question each best answers and the concepts that flow from it.

View	What A is	Question it best answers	Concepts that flow from it
Rectangular array	a table of entries aᵢⱼ with m rows and n columns	how do entries relate by position?	dimensions, equality, zero matrix, transpose
Collection of vectors	n column vectors in ℝᵐ (or m row vectors in ℝⁿ)	how do the columns combine?	span, linear independence, column/row space
Encoded linear system	the coefficient block of Ax = b	when does the system have solutions, and how many?	augmented matrix, rank, Gaussian elimination
Linear transformation	a map x ↦ Ax from ℝⁿ to ℝᵐ	what does A do geometrically?	image, null space, rotation, projection, inverse