Generating Videos with Scene Dynamics
dc.contributor.author | Vondrick, Carl | |
dc.contributor.author | Pirsiavash, Hamed | |
dc.contributor.author | Torralba, Antonio | |
dc.date.accessioned | 2019-05-24T15:48:17Z | |
dc.date.available | 2019-05-24T15:48:17Z | |
dc.date.issued | 2016 | |
dc.description | 30th Conference on Neural Information Processing Systems (NIPS 2016), Barcelona, Spain | en_US |
dc.description.abstract | We capitalize on large amounts of unlabeled video in order to learn a model of scene dynamics for both video recognition tasks (e.g. action classification) and video generation tasks (e.g. future prediction). We propose a generative adversarial network for video with a spatio-temporal convolutional architecture that untangles the scene’s foreground from the background. Experiments suggest this model can generate tiny videos up to a second at full frame rate better than simple baselines, and we show its utility at predicting plausible futures of static images. Moreover, experiments and visualizations show the model internally learns useful features for recognizing actions with minimal supervision, suggesting scene dynamics are a promising signal for representation learning. We believe generative video models can impact many applications in video understanding and simulation. | en_US |
dc.description.sponsorship | This work was supported by NSF grant #1524817 to AT, START program at UMBC to HP, and the Google PhD fellowship to CV. | en_US |
dc.description.uri | https://papers.nips.cc/paper/6194-generating-videos-with-scene-dynamics.pdf | en_US |
dc.format.extent | 9 pages | en_US |
dc.genre | conference papers and proceedings | en_US |
dc.identifier | doi:10.13016/m26gih-tnyz | |
dc.identifier.citation | Carl Vondrick, Hamed Pirsiavash, Antonio Torralba, Generating Videos with Scene Dynamics, Advances in Neural Information Processing Systems 29 (NIPS 2016),https://papers.nips.cc/paper/6194-generating-videos-with-scene-dynamics.pdf | en_US |
dc.identifier.uri | http://hdl.handle.net/11603/13942 | |
dc.language.iso | en_US | en_US |
dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
dc.relation.ispartof | UMBC Computer Science and Electrical Engineering Department Collection | |
dc.relation.ispartof | UMBC Faculty Collection | |
dc.rights | This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author. | |
dc.subject | video recognition tasks | en_US |
dc.subject | spatio-temporal convolutional architecture | en_US |
dc.subject | visualizations | en_US |
dc.title | Generating Videos with Scene Dynamics | en_US |
dc.type | Text | en_US |
Files
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 2.56 KB
- Format:
- Item-specific license agreed upon to submission
- Description: