GANs —
Wasserstein GAN with MNIST (Part 6)

3 min readJun 16, 2020

Brief theoretical introduction to Wasserstein GAN or WGANs and practical implementation using Python and Keras/TensorFlow in Jupyter Notebook.

Wasserstein GAN or WGANs by fernanda rodríguez.

In this article, you will find:

Research paper,
Definition, network design, and cost function, and
Training WGANs with MNIST dataset using Python and Keras/TensorFlow in Jupyter Notebook.

Research Paper

Arjovsky, M., Chintala, S., & Bottou, L. (2017). Wasserstein GAN. ArXiv, abs/1701.07875.

Wasserstein GAN — WGAN

Wasserstein GAN (WGAN) proposes a new cost function using Wasserstein distance that has a smoother gradient everywhere.

This model is proposed to measure the difference between the data distributions of real and generated images.

This network is very similar to the Discriminator 𝐷 just without the sigmoid function and outputs a scalar score rather than a probability.

The Discriminator 𝐷 is renamed to Critic to reflect its new role.

Read more about GANs:

GANs — A brief overview (Part 1)

This post presents the basic notions that involve the concept of Generative Adversarial Networks.

medium.com

Network design

Wasserstein GAN — WGAN Architecture by fernanda rodríguez.

x is the real data and z is the latent space.

Cost function

Training WGANs

Data: MNIST dataset

(X_train, y_train), (X_test, y_test) = mnist.load_data()

2. Model:

Generator

# Generator network
generator = Sequential()# FC
generator.add(Dense(7*7*512, input_shape=(latent_dim,), kernel_initializer=init))
# generator.add(ReLU())
generator.add(Reshape((7, 7, 512)))

# Conv 1 and Con 2
...

# Output
generator.add(Conv2DTranspose(1, kernel_size=3, strides=2, padding='same', activation='tanh'))

Critic

# Critic network
critic = Sequential()

# Conv 1
critic.add(Conv2D(64, kernel_size=3, strides=2, padding='same', input_shape=(img_shape)))
critic.add(LeakyReLU(0.2))

# Conv 2, Conv 3 and Conv 4
...# FC
critic.add(Flatten())

# Output
critic.add(Dense(1))

3. Compile

# Wasserstein objective 
def wasserstein_loss(y_true, y_pred):
    return K.mean(y_true * y_pred)n_critic = 5
clip_value = 0.01
optimizer = RMSprop(lr=0.00005)

critic.compile(optimizer=optimizer, loss=wasserstein_loss, metrics=['accuracy'])critic.trainable = False  # The generator takes noise as input and generated imgs 
z = Input(shape=(latent_dim,)) 
img = generator(z)  # The critic takes generated images as input and determines validity valid = critic(img)  # The combined model (critic and generative) 
c_g = Model(inputs=z, outputs=valid, name='wgan')  c_g.compile(optimizer=optimizer, loss=wasserstein_loss, metrics=['accuracy'])s

4. Fit

for _ in range(n_critic):

    # Train Discriminator weights
    critic.trainable = True

    # Real samples
    X_batch = X_train[i*batch_size:(i+1)*batch_size]
    d_loss_real = critic.train_on_batch(x=X_batch, y=real)

    # Fake Samples
    z = np.random.normal(loc=0, scale=1, size=(batch_size, latent_dim))
    X_fake = generator.predict(z)
    d_loss_fake = critic.train_on_batch(x=X_fake, y=fake)

    # Discriminator loss
    d_loss_batch = 0.5 * (d_loss_real[0] + d_loss_fake[0])

    # Clip critic weights
    for l in critic.layers:
        weights = l.get_weights()
        weights = [np.clip(w, -clip_value, clip_value) for w in weights]
        l.set_weights(weights)

    # Train Generator weights
    critic.trainable = False
    g_loss_batch = c_g.train_on_batch(x=z, y=real)

5. Evaluate

# plotting the metrics 
plt.plot(d_loss) 
plt.plot(d_g_loss) 
plt.show()

WGANs — MNIST results

epoch = 1/100, d_loss=-0.292, g_loss=0.456 in WGAN_MNIST

Train summary

Github repository

Look the complete training WGAN with MNIST dataset, using Python and Keras/TensorFlow in Jupyter Notebook.

mafda/generative_adversarial_networks_101

Permalink Dismiss GitHub is home to over 50 million developers working together to host and review code, manage…

github.com

For those looking for all the articles in our GANs series. Here is the link.

GANs — Generative Adversarial Networks 101

Keras implementations of Generative Adversarial Networks. GANs, DCGAN, CGAN, CCGAN, WGAN, and LSGAN models with MNIST…

medium.com

GANs —
Wasserstein GAN with MNIST (Part 6)

Research Paper

Wasserstein GAN — WGAN

GANs — A brief overview (Part 1)

This post presents the basic notions that involve the concept of Generative Adversarial Networks.

Network design

Cost function

Training WGANs

WGANs — MNIST results

Train summary

Github repository

mafda/generative_adversarial_networks_101

Permalink Dismiss GitHub is home to over 50 million developers working together to host and review code, manage…

GANs — Generative Adversarial Networks 101

Keras implementations of Generative Adversarial Networks. GANs, DCGAN, CGAN, CCGAN, WGAN, and LSGAN models with MNIST…

References

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Written by fernanda rodríguez

No responses yet

GANs — Wasserstein GAN with MNIST (Part 6)

Research Paper

Wasserstein GAN — WGAN

GANs — A brief overview (Part 1)

This post presents the basic notions that involve the concept of Generative Adversarial Networks.

Network design

Cost function

Training WGANs

WGANs — MNIST results

Train summary

Github repository

mafda/generative_adversarial_networks_101

Permalink Dismiss GitHub is home to over 50 million developers working together to host and review code, manage…

GANs — Generative Adversarial Networks 101

Keras implementations of Generative Adversarial Networks. GANs, DCGAN, CGAN, CCGAN, WGAN, and LSGAN models with MNIST…

References

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Written by fernanda rodríguez

No responses yet

GANs —
Wasserstein GAN with MNIST (Part 6)