DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns
dc.contributor.author | Diba, Ali | |
dc.contributor.author | Pazandeh, Ali Mohammad | |
dc.contributor.author | Pirsiavash, Hamed | |
dc.contributor.author | Gool, Luc Van | |
dc.date.accessioned | 2019-07-03T14:17:09Z | |
dc.date.available | 2019-07-03T14:17:09Z | |
dc.date.issued | 2016-12-12 | |
dc.description.abstract | The recognition of human actions and the determination of human attributes are two tasks that call for fine-grained classification. Indeed, often rather small and inconspicuous objects and features have to be detected to tell their classes apart. In order to deal with this challenge, we propose a novel convolutional neural network that mines mid-level image patches that are sufficiently dedicated to resolve the corresponding subtleties. In particular, we train a newly designed CNN (DeepPattern) that learns discriminative patch groups. There are two innovative aspects to this. On the one hand we pay attention to contextual information in an original fashion. On the other hand, we let an iteration of feature learning and patch clustering purify the set of dedicated patches that we use. We validate our method for action classification on two challenging datasets: PASCAL VOC 2012 Action and Stanford 40 Actions, and for attribute recognition we use the Berkeley Attributes of People dataset. Our discriminative mid-level mining CNN obtains state-of-the-art results on these datasets, without a need for annotations about parts and poses. | en_US |
dc.description.sponsorship | This work was supported by DBOF PhD scholarship, KU Leuven CAMETRON project | en_US |
dc.description.uri | https://arxiv.org/abs/1608.03217 | en_US |
dc.format.extent | 9 pages | en_US |
dc.genre | conference papers and proceedings preprints | en_US |
dc.identifier | doi:10.13016/m2xrer-wdho | |
dc.identifier.citation | Ali Diba, et.al, DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), DOI: 10.1109/CVPR.2016.387 | en_US |
dc.identifier.uri | https://doi.org/10.1109/CVPR.2016.387 | |
dc.identifier.uri | http://hdl.handle.net/11603/14335 | |
dc.language.iso | en_US | en_US |
dc.publisher | IEEE | en_US |
dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
dc.relation.ispartof | UMBC Computer Science and Electrical Engineering Department Collection | |
dc.relation.ispartof | UMBC Faculty Collection | |
dc.rights | This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author. | |
dc.rights | © 2016 IEEE | |
dc.subject | data mining | en_US |
dc.subject | feature extraction | en_US |
dc.subject | feedforward neural nets | en_US |
dc.subject | image classification | en_US |
dc.subject | learning (artificial intelligence) | en_US |
dc.subject | pattern clustering | en_US |
dc.subject | visualization | en_US |
dc.subject | DeepCAMP | en_US |
dc.subject | deep convolutional action & attribute mid-level patterns | en_US |
dc.subject | human action recognition | en_US |
dc.subject | convolutional neural network | en_US |
dc.title | DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns | en_US |
dc.type | Text | en_US |