What is machine learning?

Machine learning learns patterns from data to make predictions. Start with https://mdooai.com/en/learn/ml/mlSupervisedUnsupervisedSelf.

What is the difference between ML and DL?

Deep learning is a subset of machine learning focused on neural networks. Build foundations at https://mdooai.com/en/learn/ml/mlDataFeature first.

How do I start hyperparameter tuning?

Use cross-validation while narrowing search ranges. Start at https://mdooai.com/en/learn/ml/mlCrossValidation.

Ch.06

Logistic Regression: Pass or Fail?

Where linear regression predicts a 'score', logistic regression is the specialist for yes/no classification—e.g. "Will this score mean pass (1) or fail (0) ?" It uses the sigmoid function to turn a score into a probability between 0 and 1.

ML diagram by chapter

Select a chapter to see its diagram below. View the machine learning flow at a glance.

z

\sigma(z) = \frac{1}{1+e^{-z}}

z

Logistic Regression: Pass or Fail?

z

Why it matters

Many real problems are yes/no — Spam or not? Disease or not? Will the user buy? Binary classification is everywhere; logistic regression is the standard baseline. Confidence as a number — Saying "pass with 98% probability" is more useful than just "pass". Logistic regression gives a probability, which supports better decisions. Bridge to deep learning — A single neuron in a neural network behaves much like logistic regression. Mastering this makes deep learning easier later.

How it is used

Spam filter — Compute "probability this email is spam" from features; if above a threshold, send to spam. Medical AI — From X-rays or lab values, predict "probability of disease" to support diagnosis. Marketing and recommendations — Predict "will this user churn?" or "will they click?" for targeting and ads.

Logistic Regression: Pass or Fail?

The S-curve: sigmoid — The score

z

from a linear model can be large or negative. Probabilities must lie between 0 and 1. The sigmoid

\sigma(z) = \frac{1}{1+e^{-z}}

maps any real

z

into (0, 1).

Decision boundary — When the sigmoid outputs e.g. "probability of pass = 0.7", we need a rule. Usually we use 0.5: if probability ≥ 0.5 we predict 1 (yes), otherwise 0 (no).

Same core as linear regression — Logistic regression still computes a score

z = wx + b

first; the only difference is passing that score through the sigmoid to get a probability.

How to read $\sigma(z) = \frac{1}{1+e^{-z}}$ — When

z

is large and negative,

e^{-z}

is large so

\sigma(z) \approx 0

. When

z=0

\sigma(0)=0.5

. When

z

is large and positive,

e^{-z} \approx 0

\sigma(z) \approx 1

. So any

z

is squeezed into a probability in [0, 1].