Ch.02

Dot Product and Projection: Angle and Similarity Between Data

Math diagram by chapter

Select a chapter to see its diagram below. View the flow of intermediate math at a glance.

Plane: u, v, projection

base urotating vshadowresidual ⊥ u

Direction, cosine & values

−10+1

u·v

13.32

cos θ (direction)

0.969

|proj| / |v|

0.969

v

\mathbb{R}^n

Dot Product and Orthogonal Projection: Measuring Similarity with Numbers

\|\mathbf{u}\|\|\mathbf{v}\|\cos\theta

0

\mathbf{u} \cdot \mathbf{v} = \|\mathbf{u}\|\|\mathbf{v}\|\cos\theta

In deep learning, each linear layer is built from dot products between rows of weights and the input. Attention uses query-key dot products (or scores) to decide where to look. Recommendation uses dot products / cosines between user and item embeddings.

Summary: The dot product is “sum of products of components” and couples length and angle; projection is the shadow along a direction; cosine focuses on direction; projections pair with orthogonal residuals. Ch.03 matrices bundle many dot products at once.

After Ch.01’s “boxes of numbers,” this chapter is the rule that pairs boxes to make one score . That score becomes the common language for distance, angle, and similarity before matrices, eigenvalues, and optimization.

To make “similar” precise you need a measure . Dot products and cosines separate direction vs magnitude in high dimensions and tie directly to preprocessing (e.g. normalization).

\mathbf{w}\cdot\mathbf{x}

Geometry: least-squares fits as projection onto the column space; PCA / orthogonal bases; Gram-Schmidt subtracts projections to orthogonalize.

The table summarizes formulas and symbol meanings for solving problems, followed by item-by-item notes on why those definitions are set up that way. Worked examples walk through representative types step by step.

\mathbf{u}\cdot\mathbf{v}

Practice problems

Below are 10 problems sampled from a bank of 60 (easy 4 · medium 3 · hard 3; order easy→medium→hard). Each item is multiple choice—pick the option number.

z=\mathbf{w}\cdot\mathbf{x}+b

1 / 10

Dot Product and Orthogonal Projection: Measuring Similarity with Numbers

\|\mathbf{u}\|\|\mathbf{v}\|\cos\theta

0

\mathbf{u} \cdot \mathbf{v} = \|\mathbf{u}\|\|\mathbf{v}\|\cos\theta

In deep learning, each linear layer is built from dot products between rows of weights and the input. Attention uses query-key dot products (or scores) to decide where to look. Recommendation uses dot products / cosines between user and item embeddings.

Summary: The dot product is “sum of products of components” and couples length and angle; projection is the shadow along a direction; cosine focuses on direction; projections pair with orthogonal residuals. Ch.03 matrices bundle many dot products at once.

After Ch.01’s “boxes of numbers,” this chapter is the rule that pairs boxes to make one score . That score becomes the common language for distance, angle, and similarity before matrices, eigenvalues, and optimization.

To make “similar” precise you need a measure . Dot products and cosines separate direction vs magnitude in high dimensions and tie directly to preprocessing (e.g. normalization).

\mathbf{w}\cdot\mathbf{x}

Geometry: least-squares fits as projection onto the column space; PCA / orthogonal bases; Gram-Schmidt subtracts projections to orthogonalize.

The table summarizes formulas and symbol meanings for solving problems, followed by item-by-item notes on why those definitions are set up that way. Worked examples walk through representative types step by step.

\mathbf{u}\cdot\mathbf{v}

Practice problems

Below are 10 problems sampled from a bank of 60 (easy 4 · medium 3 · hard 3; order easy→medium→hard). Each item is multiple choice—pick the option number.

z=\mathbf{w}\cdot\mathbf{x}+b

1 / 10

Formula	Meaning
$\mathbf{u}\cdot\mathbf{v}$	Sum of products of matching components; result is a scalar
$\\|\mathbf{u}\\|$	Euclidean norm (length) $\sqrt{\mathbf{u}\cdot\mathbf{u}}$
$\cos\theta$	$\dfrac{\mathbf{u}\cdot\mathbf{v}}{\\|\mathbf{u}\\|\\|\mathbf{v}\\|}$ — cosine of the angle between the vectors (exclude zero vectors)
$\mathrm{proj}_{\mathbf{u}}\mathbf{v}$	Projection of $\mathbf{v}$ onto the line spanned by $\mathbf{u}$
$\mathbf{v}-\mathrm{proj}_{\mathbf{u}}\mathbf{v}$	Residual after projection; always orthogonal to $\mathbf{u}$
Unit $\mathbf{\hat{u}}$	$\\|\mathbf{\hat{u}}\\|=1$ ; shadow length $=\|\mathbf{v}\cdot\mathbf{\hat{u}}\|$

Formula	Meaning
$\mathbf{u}\cdot\mathbf{v}$	Sum of products of matching components; result is a scalar
$\\|\mathbf{u}\\|$	Euclidean norm (length) $\sqrt{\mathbf{u}\cdot\mathbf{u}}$
$\cos\theta$	$\dfrac{\mathbf{u}\cdot\mathbf{v}}{\\|\mathbf{u}\\|\\|\mathbf{v}\\|}$ — cosine of the angle between the vectors (exclude zero vectors)
$\mathrm{proj}_{\mathbf{u}}\mathbf{v}$	Projection of $\mathbf{v}$ onto the line spanned by $\mathbf{u}$
$\mathbf{v}-\mathrm{proj}_{\mathbf{u}}\mathbf{v}$	Residual after projection; always orthogonal to $\mathbf{u}$
Unit $\mathbf{\hat{u}}$	$\\|\mathbf{\hat{u}}\\|=1$ ; shadow length $=\|\mathbf{v}\cdot\mathbf{\hat{u}}\|$