Ch.21

GAN Basics: Generator vs Discriminator

GAN: make and tell apart

Like a counterfeiter and an expert keeping each other sharp.

Real photos and fakes from noise enter the discriminator as real or fake . First tell who makes (G) and who judges (D) .

Training flow

x

G

Reading the formulas (GAN)

G

GAN: Generator vs. Discriminator

G

Why it matters

1. A true starting point for generative AI Where classifiers answer "this is a dog," GANs paint dogs that never existed —a backbone of modern generative AI across images, audio, and voice. 2. Sharp, vivid detail Unlike blurry average-seeking models, GANs must pass a harsh critic, so hair strands and skin texture can look razor-sharp . 3. Data augmentation Train on a few snowy-night driving photos and synthesize thousands more; rare medical or defect images can be multiplied for downstream models.

How it is used

[-1,1]

Summary

z

Problem-solving notes

G

GAN: Generator vs. Discriminator

1. Core GAN architecture: generator vs discriminator

A GAN is a structure where two networks fight endlessly and grow stronger. The Generator ( $G$ ) tries to make fake data look real, while the Discriminator ( $D$ ) sharply judges real vs fake.

* Analogy: A forger (generator) brings a fake painting, and an appraiser (discriminator) uses a magnifying glass to tell originals from fakes. Each side keeps sharpening its craft.

2. The minimax objective

The core GAN objective is:

\min_G \max_D V(D, G) = \mathbb{E}_{x}[\log D(x)] + \mathbb{E}_{z}[\log(1 - D(G(z)))]

* Discriminator ( $D$ ), maximize: on real

x

, push

D(x)

toward

1

; on fake

G(z)

, push

D(G(z))

toward

0

* Generator ( $G$ ), minimize: make the discriminator treat

G(z)

as real (

D(G(z)) \to 1

) so the second term shrinks.

3. Latent noise $z$

Latent noise $z$ is the random vector fed to the generator as a starting point.

* Analogy: Like a lump of clay handed to a sculptor—small changes in

z

can change expression, color, or style in the finished image.

4. Mode collapse

A notorious failure mode: the generator stops exploring diversity and keeps copying one sample that already fooled the discriminator.

* Analogy: A restaurant that earns a perfect score for kimchi stew, then serves only kimchi stew to every guest all year.

5. Conditional GAN (cGAN)

Add a condition ( $y$ )—class label or text—alongside

z

to steer generation, e.g. "draw a cat" or "colorize this sketch".