Hands-On Generative Adversarial Networks with PyTorch/

...

Pix2pixHD: High-Resolution Image Translation

Understand pix2pixHD architecture and train the model for cityscapes dataset.

We'll cover the following...

Model architecture
Model training

Pix2pixHDWang, Ting-Chun, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, and Bryan Catanzaro. "High-resolution image synthesis and semantic manipulation with conditional gans." In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 8798-8807. 2018. is an upgraded version of the pix2pix model. The biggest improvement of pix2pixHD over pix2pix is that it supports image-to-image translation at $2048\times1024$ resolution and with high quality.

Model architecture

To make this happen, they designed a two-stage approach to gradually train and refine the networks, as shown in the following diagram. First, a lower-resolution image of $1024 \times512$ is generated by a generator network $G_1$ , called the global generator (the red box). Second, the image is enlarged by a generator network $G_2$ ...

Ask

Getting Started

Generative Adversarial Networks Fundamentals

Best Practices for Model Design and Training

Building Our First GAN with PyTorch

Generating Images Based on Label Information

Image-to-Image Translation and Its Applications

Image Restoration with GANs

Training GANs to Break Different Models

Image Generation from Description Text

Sequence Synthesis with GANs

Reconstructing 3D Models with GANs

Concluding Remarks

Appendix

Pix2pixHD: High-Resolution Image Translation

Model architecture