Representation Learning by Learning to Count

Noroozi, Mehdi; Pirsiavash, Hamed; Favaro, Paolo

Representation Learning by Learning to Count

dc.contributor.author	Noroozi, Mehdi
dc.contributor.author	Pirsiavash, Hamed
dc.contributor.author	Favaro, Paolo
dc.date.accessioned	2019-07-01T17:51:38Z
dc.date.available	2019-07-01T17:51:38Z
dc.date.issued	2017-12-25
dc.description.abstract	We introduce a novel method for representation learning that uses an artificial supervision signal based on counting visual primitives. This supervision signal is obtained from an equivariance relation, which does not require any manual annotation. We relate transformations of images to transformations of the representations. More specifically, we look for the representation that satisfies such relation rather than the transformations that match a given representation. In this paper, we use two image transformations in the context of counting: scaling and tiling. The first transformation exploits the fact that the number of visual primitives should be invariant to scale. The second transformation allows us to equate the total number of visual primitives in each tile to that in the whole image. These two transformations are combined in one constraint and used to train a neural network with a contrastive loss. The proposed task produces representations that perform on par or exceed the state of the art in transfer learning benchmarks.	en_US
dc.description.sponsorship	Paolo Favaro acknowledges support from the Swiss National Science Foundation on project 200021 149227. Hamed Pirsiavash acknowledges support from GE Global Research.	en_US
dc.description.uri	https://ieeexplore.ieee.org/document/8237890	en_US
dc.format.extent	9 pages	en_US
dc.genre	conference papers and proceedings preprints	en_US
dc.identifier	doi:10.13016/m2qaoo-thod
dc.identifier.citation	Mehdi Noroozi , et.al, Representation Learning by Learning to Count, 2017 IEEE International Conference on Computer Vision (ICCV), DOI: 10.1109/ICCV.2017.628	en_US
dc.identifier.uri	https://doi.org/10.1109/ICCV.2017.628
dc.identifier.uri	http://hdl.handle.net/11603/14327
dc.language.iso	en_US	en_US
dc.publisher	IEEE	en_US
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartof	UMBC Faculty Collection
dc.rights	This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rights	© 2017 IEEE
dc.subject	image representation	en_US
dc.subject	neural nets	en_US
dc.subject	representation learning	en_US
dc.subject	artificial supervision signal	en_US
dc.subject	equivariance relation	en_US
dc.subject	manual annotation	en_US
dc.subject	visual primitives	en_US
dc.title	Representation Learning by Learning to Count	en_US
dc.type	Text	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 1708.06734.pdf
Size:: 4.08 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.56 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

UMBC Computer Science and Electrical Engineering Department
UMBC Faculty Collection