Generating Videos with Scene Dynamics

Vondrick, Carl; Pirsiavash, Hamed; Torralba, Antonio

Generating Videos with Scene Dynamics

dc.contributor.author	Vondrick, Carl
dc.contributor.author	Pirsiavash, Hamed
dc.contributor.author	Torralba, Antonio
dc.date.accessioned	2019-05-24T15:48:17Z
dc.date.available	2019-05-24T15:48:17Z
dc.date.issued	2016
dc.description	30th Conference on Neural Information Processing Systems (NIPS 2016), Barcelona, Spain	en
dc.description.abstract	We capitalize on large amounts of unlabeled video in order to learn a model of scene dynamics for both video recognition tasks (e.g. action classification) and video generation tasks (e.g. future prediction). We propose a generative adversarial network for video with a spatio-temporal convolutional architecture that untangles the scene’s foreground from the background. Experiments suggest this model can generate tiny videos up to a second at full frame rate better than simple baselines, and we show its utility at predicting plausible futures of static images. Moreover, experiments and visualizations show the model internally learns useful features for recognizing actions with minimal supervision, suggesting scene dynamics are a promising signal for representation learning. We believe generative video models can impact many applications in video understanding and simulation.	en
dc.description.sponsorship	This work was supported by NSF grant #1524817 to AT, START program at UMBC to HP, and the Google PhD fellowship to CV.	en
dc.description.uri	https://papers.nips.cc/paper/6194-generating-videos-with-scene-dynamics.pdf	en
dc.format.extent	9 pages	en
dc.genre	conference papers and proceedings	en
dc.identifier	doi:10.13016/m26gih-tnyz
dc.identifier.citation	Carl Vondrick, Hamed Pirsiavash, Antonio Torralba, Generating Videos with Scene Dynamics, Advances in Neural Information Processing Systems 29 (NIPS 2016),https://papers.nips.cc/paper/6194-generating-videos-with-scene-dynamics.pdf	en
dc.identifier.uri	http://hdl.handle.net/11603/13942
dc.language.iso	en	en
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartof	UMBC Faculty Collection
dc.rights	This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subject	video recognition tasks	en
dc.subject	spatio-temporal convolutional architecture	en
dc.subject	visualizations	en
dc.title	Generating Videos with Scene Dynamics	en
dc.type	Text	en

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.56 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

UMBC Computer Science and Electrical Engineering Department
UMBC Faculty Collection