DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns

Diba, Ali; Pazandeh, Ali Mohammad; Pirsiavash, Hamed; Gool, Luc Van

DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns

dc.contributor.author	Diba, Ali
dc.contributor.author	Pazandeh, Ali Mohammad
dc.contributor.author	Pirsiavash, Hamed
dc.contributor.author	Gool, Luc Van
dc.date.accessioned	2019-07-03T14:17:09Z
dc.date.available	2019-07-03T14:17:09Z
dc.date.issued	2016-12-12
dc.description.abstract	The recognition of human actions and the determination of human attributes are two tasks that call for fine-grained classification. Indeed, often rather small and inconspicuous objects and features have to be detected to tell their classes apart. In order to deal with this challenge, we propose a novel convolutional neural network that mines mid-level image patches that are sufficiently dedicated to resolve the corresponding subtleties. In particular, we train a newly designed CNN (DeepPattern) that learns discriminative patch groups. There are two innovative aspects to this. On the one hand we pay attention to contextual information in an original fashion. On the other hand, we let an iteration of feature learning and patch clustering purify the set of dedicated patches that we use. We validate our method for action classification on two challenging datasets: PASCAL VOC 2012 Action and Stanford 40 Actions, and for attribute recognition we use the Berkeley Attributes of People dataset. Our discriminative mid-level mining CNN obtains state-of-the-art results on these datasets, without a need for annotations about parts and poses.	en_US
dc.description.sponsorship	This work was supported by DBOF PhD scholarship, KU Leuven CAMETRON project	en_US
dc.description.uri	https://arxiv.org/abs/1608.03217	en_US
dc.format.extent	9 pages	en_US
dc.genre	conference papers and proceedings preprints	en_US
dc.identifier	doi:10.13016/m2xrer-wdho
dc.identifier.citation	Ali Diba, et.al, DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), DOI: 10.1109/CVPR.2016.387	en_US
dc.identifier.uri	https://doi.org/10.1109/CVPR.2016.387
dc.identifier.uri	http://hdl.handle.net/11603/14335
dc.language.iso	en_US	en_US
dc.publisher	IEEE	en_US
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartof	UMBC Faculty Collection
dc.rights	This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rights	© 2016 IEEE
dc.subject	data mining	en_US
dc.subject	feature extraction	en_US
dc.subject	feedforward neural nets	en_US
dc.subject	image classification	en_US
dc.subject	learning (artificial intelligence)	en_US
dc.subject	pattern clustering	en_US
dc.subject	visualization	en_US
dc.subject	DeepCAMP	en_US
dc.subject	deep convolutional action & attribute mid-level patterns	en_US
dc.subject	human action recognition	en_US
dc.subject	convolutional neural network	en_US
dc.title	DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns	en_US
dc.type	Text	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 1608.03217.pdf
Size:: 3.16 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.56 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

UMBC Computer Science and Electrical Engineering Department
UMBC Faculty Collection