Descriptor: Face Detection Dataset for Programmable Threshold-Based Sparse-Vision

dc.contributor.authorIslam, Riadul
dc.contributor.authorTummala, Sri Ranga Sai Krishna
dc.contributor.authorMulé, Joey
dc.contributor.authorKankipati, Rohith
dc.contributor.authorJalapally, Suraj
dc.contributor.authorChallagundla, Dhandeep
dc.contributor.authorHoward, Chad
dc.contributor.authorRobucci, Ryan
dc.date.accessioned2024-11-14T15:18:46Z
dc.date.available2024-11-14T15:18:46Z
dc.date.issued2024-10-01
dc.description.abstractSmart focal-plane and in-chip image processing has emerged as a crucial technology for vision-enabled embedded systems with energy efficiency and privacy. However, the lack of special datasets providing examples of the data that these neuromorphic sensors compute to convey visual information has hindered the adoption of these promising technologies. Neuromorphic imager variants, including event-based sensors, produce various representations such as streams of pixel addresses representing time and locations of intensity changes in the focal plane, temporal-difference data, data sifted/thresholded by temporal differences, image data after applying spatial transformations, optical flow data, and/or statistical representations. To address the critical barrier to entry, we provide an annotated, temporal-threshold-based vision dataset specifically designed for face detection tasks derived from the same videos used for Aff-Wild2. By offering multiple threshold levels (e.g., 4, 8, 12, and 16), this dataset allows for comprehensive evaluation and optimization of state-of-the-art neural architectures under varying conditions and settings compared to traditional methods. The accompanying tool flow for generating event data from raw videos further enhances accessibility and usability. We anticipate that this resource will significantly support the development of robust vision systems based on smart sensors that can process based on temporal-difference thresholds, enabling more accurate and efficient object detection and localization and ultimately promoting the broader adoption of low-power, neuromorphic imaging technologies. To support further research, we publicly released the dataset at \url{https://dx.doi.org/10.21227/bw2e-dj78}.
dc.description.sponsorshipConceptualization, R. Islam and S.R.S.K. Tummala; dataset and analysis, R. Islam, J. Mule, R. Kankipati, R. Robucci, S. ´ Jalapally, and S.R.S.K Tummala; original draft preparation, R. Islam; Review and editing, R. Robucci, C. Howard, and D. Challagundla; funding acquisition, R.Islam and R. Robucci. This research was funded by the National Science Foundation (NSF) under award number 2138253, the Maryland Industrial Partnerships (MIPS) program under award number MIPS0012, and the UMBC Startup grant. Special thanks to Raiyan Zaman and Rishi Mulchandani for assisting in annotating part of the dataset.
dc.description.urihttps://arxiv.org/abs/2410.00368v1
dc.format.extent8 pages
dc.genrejournal articles
dc.genrepreprints
dc.identifierdoi:10.13016/m29ddx-9wic
dc.identifier.urihttps://doi.org/10.48550/arXiv.2410.00368
dc.identifier.urihttp://hdl.handle.net/11603/36960
dc.language.isoen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsAttribution 4.0 International CC BY 4.0 Deed
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectUMBC Cybersecurity Institute
dc.titleDescriptor: Face Detection Dataset for Programmable Threshold-Based Sparse-Vision
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-4649-3467
dcterms.creatorhttps://orcid.org/0000-0001-7491-1710
dcterms.creatorhttps://orcid.org/0009-0002-8522-0872
dcterms.creatorhttps://orcid.org/0009-0003-9737-2782

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2410.00368v1.pdf
Size:
3.58 MB
Format:
Adobe Portable Document Format