Improved Training of Wasserstein GANs, Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, Aaron Courville, 2017Advances in Neural Information Processing Systems 30 (NIPS 2017)DOI: 10.48550/arXiv.1704.00028 - Proposes gradient penalty as a more effective and stable alternative to weight clipping for enforcing the Lipschitz constraint in WGANs.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Provides a comprehensive introduction to deep learning, including foundational explanations of GANs, training issues, and different divergence metrics.