TinyM²Net: A Flexible System Algorithm Co-designed Multimodal Learning Framework for Tiny Devices

dc.contributor.authorRashid, Hasib-Al
dc.contributor.authorOvi, Pretom Roy
dc.contributor.authorGangopadhyay, Aryya
dc.contributor.authorMohsenin, Tinoosh
dc.date.accessioned2022-03-15T16:23:28Z
dc.date.available2022-03-15T16:23:28Z
dc.date.issued2022-02-09
dc.descriptiontinyML Research Symposium’22, March 2022, San Jose, CA
dc.description.abstractWith the emergence of Artificial Intelligence (AI), new attention has been given to implement AI algorithms on resource constrained tiny devices to expand the application domain of IoT. Multimodal Learning has recently become very popular with the classification task due to its impressive performance for both image and audio event classification. This paper presents TinyM²Net -- a flexible system algorithm co-designed multimodal learning framework for resource constrained tiny devices. The framework was designed to be evaluated on two different case-studies: COVID-19 detection from multimodal audio recordings and battle field object detection from multimodal images and audios. In order to compress the model to implement on tiny devices, substantial network architecture optimization and mixed precision quantization were performed (mixed 8-bit and 4-bit). TinyM²Net shows that even a tiny multimodal learning model can improve the classification performance than that of any unimodal frameworks. The most compressed TinyM²Net achieves 88.4% COVID-19 detection accuracy (14.5% improvement from unimodal base model) and 96.8\% battle field object detection accuracy (3.9% improvement from unimodal base model). Finally, we test our TinyM²Net models on a Raspberry Pi 4 to see how they perform when deployed to a resource constrained tiny device.en_US
dc.description.sponsorshipWe acknowledge the support of the U.S. Army Grant No. W911NF21- 20076. We also acknowledge the support of the University of Maryland, Baltimore, Institute for Clinical and Translational Research (ICTR) and the National Center for Advancing Translational Sciences (NCATS) Clinical Translational Science Award (CTSA) grant number UL1TR003098en_US
dc.description.urihttps://arxiv.org/abs/2202.04303en_US
dc.description.urihttps://www.youtube.com/watch?v=qD8mxEhFBcI
dc.format.extent7 pagesen_US
dc.genreconference papers and proceedingsen_US
dc.genrepreprintsen_US
dc.genrevideo recordings
dc.identifierdoi:10.13016/m2k9xf-fqm5
dc.identifier.urihttps://doi.org/10.48550/arXiv.2202.04303
dc.identifier.urihttp://hdl.handle.net/11603/24393
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Information Systems Department
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.en_US
dc.rightsAttribution 4.0 International (CC BY 4.0)
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/*
dc.titleTinyM²Net: A Flexible System Algorithm Co-designed Multimodal Learning Framework for Tiny Devicesen_US
dc.typeTexten_US
dcterms.creatorhttps://orcid.org/0000-0002-9983-6929en_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2202.04303.pdf
Size:
8.03 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: