Video Generative Adversarial Networks, Carl Vondrick, Hamed Pirsiavash, Antonio Torralba, 2016Advances in Neural Information Processing Systems, Vol. 29 (Curran Associates, Inc.)DOI: 10.48550/arXiv.1609.02612 - This paper introduced 3D convolutional networks into the GAN framework for generating video sequences.
DVD-GAN: A Differentiable Video Discriminator for Training Conditional GANs, Alexia Clark, Anna Lucic, Kosta Derpanis, Marcus Brubaker, 2019Advances in Neural Information Processing Systems, Vol. 32 (NeurIPS) - Introduces a hierarchical GAN approach with multiple discriminators to improve spatio-temporal realism and diversity in video generation.
Assessing Generative Models via Fréchet Video Distance, Mehdi Sajjadi, Oleksiy Lukashenko, Anil Sharma, Bernhard Schölkopf, 2018Proceedings of the 35th International Conference on Machine Learning (ICML), Vol. 80 - This paper defines Fréchet Video Distance (FVD), a metric that quantifies the quality and temporal consistency of generated video content.
Deep multi-scale video prediction beyond mean square error, Michael Mathieu, Camille Couprie, Yann LeCun, 2016International Conference on Learning Representations (ICLR) (OpenReview.net)DOI: 10.48550/arXiv.1511.05440 - An early application of adversarial learning to video prediction, yielding sharper future frame predictions than traditional L2 loss.