Parsing videos of actions with segmental grammars

Pirsiavash, Hamed; Ramanan, Deva

Parsing videos of actions with segmental grammars

dc.contributor.author	Pirsiavash, Hamed
dc.contributor.author	Ramanan, Deva
dc.date.accessioned	2019-06-28T16:41:53Z
dc.date.available	2019-06-28T16:41:53Z
dc.date.issued	2014-06-28
dc.description.abstract	Real-world videos of human activities exhibit temporal structure at various scales, long videos are typically composed out of multiple action instances, where each instance is itself composed of sub-actions with variable durations and orderings. Temporal grammars can presumably model such hierarchical structure, but are computationally difficult to apply for long video streams. We describe simple grammars that capture hierarchical temporal structure while admitting inference with a finite-state-machine. This makes parsing linear time, constant storage, and naturally online. We train grammar parameters using a latent structural SVM, where latent subactions are learned automatically. We illustrate the effectiveness of our approach over common baselines on a new half-million frame dataset of continuous YouTube videos.	en
dc.description.sponsorship	Funding for this research was provided by NSF Grant 0954083, ONR-MURI Grant N00014- 10-1-0933, and the Intel Science and Technology Center -Visual Computing.	en
dc.description.uri	https://ieeexplore.ieee.org/document/6909479	en
dc.format.extent	8 pages	en
dc.genre	conference papers and proceedings preprints	en
dc.identifier	doi:10.13016/m2b0nz-gddn
dc.identifier.citation	Hamed Pirsiavash, Deva Ramanan , Parsing videos of actions with segmental grammars, 2014 IEEE Conference on Computer Vision and Pattern Recognition, DOI: 10.1109/CVPR.2014.85	en
dc.identifier.uri	https://doi.org/10.1109/CVPR.2014.85
dc.identifier.uri	http://hdl.handle.net/11603/14318
dc.language.iso	en	en
dc.publisher	IEEE	en
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartof	UMBC Faculty Collection
dc.rights	This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rights	© 2014 IEEE
dc.subject	Grammar	en
dc.subject	Videos	en
dc.subject	Hidden Markov models	en
dc.subject	Data models	en
dc.subject	Presses	en
dc.subject	Markov processes	en
dc.subject	finite state machines	en
dc.subject	support vector machines	en
dc.subject	image segmentation	en
dc.subject	latent subactions	en
dc.subject	latent structural SVM	en
dc.title	Parsing videos of actions with segmental grammars	en
dc.type	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: grammar_cvpr14.pdf
Size:: 1.49 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.56 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

UMBC Computer Science and Electrical Engineering Department
UMBC Faculty Collection