Ch.12

Collaborative Filtering: Recommendation Basics

Have you ever seen 'You might also like' on Netflix? Collaborative filtering recommends items that users with similar tastes liked. This chapter covers the rating matrix, similarity, neighbor-based prediction, and how it is used in practice.

Select a chapter to see its diagram below. View the machine learning flow at a glance.

From the user-item rating matrix, find similar users (neighbors) and predict missing entries using their ratings.

\hat{r}_{u,i}=\frac{5+4}{2}=4.5\approx4

\hat{r}_{u,i}

Recommendation basics: Collaborative filtering

What is collaborative filtering? — It uses other users' behavior (ratings, clicks, purchases) to recommend items to you. The idea is that people with similar tastes tend to like similar things. It is widely used in streaming, e-commerce, and music apps.

Intuition: borrowing from neighbors — For movie recommendations, if someone who liked the same movies A and B as you also liked C, you might like C. Those similar users are neighbors, and predicting from their ratings is the core of collaborative filtering.

u

In practice — Cold start (new users/items have no neighbors) and sparsity make pure collaborative filtering hard, so it is often combined with content-based methods or matrix factorization .

Recommendations drive business and UX — Good recommendations increase engagement and revenue. Collaborative filtering personalizes results using behavior data alone, without rich metadata.

Core ML application — Recommendation is a different kind of problem: we fill in missing entries of a matrix. Understanding collaborative filtering is a step toward matrix factorization and deep learning-based recommenders.

User-based vs item-based — User-based : find users similar to you and recommend what they liked. Item-based : find items similar to the one you are viewing ('Users who bought this also bought'). Both use similarity and neighbors.

s_{u,v}

Matrix factorization — Advanced methods approximate the rating matrix by a product of lower-rank matrices. Hybrid systems combine collaborative filtering with content or context.

\hat{r}_{u,i}