Ch.03

Matrices and Data: Structural Representation of Many Vectors

Math diagram by chapter

Select a chapter to see its diagram below. View the flow of intermediate math at a glance.

m\times n

A matrix is a rectangular grid of numbers— one sheet . In machine learning, one row is often one sample (one person, one image) and one column is one feature . This chapter connects vectors (Ch.01) and dot products (Ch.02) to how they appear many times at once inside a matrix, and sets up matrix multiplication and linear layers (Ch.04) .

Matrices and data batches: putting many vectors on one sheet

m\times n

Think of a matrix as one spreadsheet : each cell is a number; a whole column can be one feature vector; a whole row can be one record . The same table changes meaning depending on which direction you read .

m\times n

In deep learning, weights are often matrices (or 2D slices of tensors). One layer’s linear map is “many dot products at once”; batching stacks samples along a row/column. In machine learning, the design matrix stacks feature vectors into one data table.

A\mathbf{u}

Ch.01 gave vectors; Ch.02 gave dot products for one interaction. Ch.03 extends that interaction to whole tables . Matrices are the language of losses, gradients, and weight updates .

m\times n

Training data is often a design matrix; linear models are written as matrix-vector products. Logistic/softmax, linear SVM, and matrix‑factorization recommendations all use batched vector operations .

Columns span a subspace (column space); fitting data to a lower dimension is projection to a subspace (later chapters).

The table below lists symbols and dimension rules for problem solving. Worked patterns illustrate typical steps.

m\times n

Practice problems

Below are 10 problems sampled from a bank of 60 (4 easy · 3 medium · 3 hard; order easy→medium→hard). Each item is multiple choice—pick the option number.

Which property matches a zero matrix?

1 / 10

Matrices and data batches: putting many vectors on one sheet

m\times n

Think of a matrix as one spreadsheet : each cell is a number; a whole column can be one feature vector; a whole row can be one record . The same table changes meaning depending on which direction you read .

m\times n

In deep learning, weights are often matrices (or 2D slices of tensors). One layer’s linear map is “many dot products at once”; batching stacks samples along a row/column. In machine learning, the design matrix stacks feature vectors into one data table.

A\mathbf{u}

Ch.01 gave vectors; Ch.02 gave dot products for one interaction. Ch.03 extends that interaction to whole tables . Matrices are the language of losses, gradients, and weight updates .

m\times n

Training data is often a design matrix; linear models are written as matrix-vector products. Logistic/softmax, linear SVM, and matrix‑factorization recommendations all use batched vector operations .

Columns span a subspace (column space); fitting data to a lower dimension is projection to a subspace (later chapters).

The table below lists symbols and dimension rules for problem solving. Worked patterns illustrate typical steps.

m\times n

Practice problems

Below are 10 problems sampled from a bank of 60 (4 easy · 3 medium · 3 hard; order easy→medium→hard). Each item is multiple choice—pick the option number.

Which property matches a zero matrix?

1 / 10

Symbol	Meaning
$m\times n$	$m$ rows and $n$ columns
$a_{ij}$	entry at row $i$ , column $j$
$A^{\mathsf T}$	transpose: $(A^{\mathsf T})_{ji}=a_{ij}$
column $\mathbf{a}_j$	column $j$ of $A$ as a vector
same shape	$A+B$ only if dimensions match
$A\mathbf{u}$ (preview)	vector of row– $\mathbf{u}$ dot products

Symbol	Meaning
$m\times n$	$m$ rows and $n$ columns
$a_{ij}$	entry at row $i$ , column $j$
$A^{\mathsf T}$	transpose: $(A^{\mathsf T})_{ji}=a_{ij}$
column $\mathbf{a}_j$	column $j$ of $A$ as a vector
same shape	$A+B$ only if dimensions match
$A\mathbf{u}$ (preview)	vector of row– $\mathbf{u}$ dot products