TinyM²Net: A Flexible System Algorithm Co-designed Multimodal Learning Framework for Tiny Devices

dc.contributor.authorRashid, Hasib-Al
dc.contributor.authorOvi, Pretom Roy
dc.contributor.authorGangopadhyay, Aryya
dc.contributor.authorMohsenin, Tinoosh
dc.date.accessioned2022-03-15T16:23:28Z
dc.date.available2022-03-15T16:23:28Z
dc.date.issued2022-02-09
dc.descriptiontinyML Research Symposium’22, March 2022, San Jose, CA
dc.description.abstractWith the emergence of Artificial Intelligence (AI), new attention has been given to implement AI algorithms on resource constrained tiny devices to expand the application domain of IoT. Multimodal Learning has recently become very popular with the classification task due to its impressive performance for both image and audio event classification. This paper presents TinyM²Net -- a flexible system algorithm co-designed multimodal learning framework for resource constrained tiny devices. The framework was designed to be evaluated on two different case-studies: COVID-19 detection from multimodal audio recordings and battle field object detection from multimodal images and audios. In order to compress the model to implement on tiny devices, substantial network architecture optimization and mixed precision quantization were performed (mixed 8-bit and 4-bit). TinyM²Net shows that even a tiny multimodal learning model can improve the classification performance than that of any unimodal frameworks. The most compressed TinyM²Net achieves 88.4% COVID-19 detection accuracy (14.5% improvement from unimodal base model) and 96.8\% battle field object detection accuracy (3.9% improvement from unimodal base model). Finally, we test our TinyM²Net models on a Raspberry Pi 4 to see how they perform when deployed to a resource constrained tiny device.en
dc.description.sponsorshipWe acknowledge the support of the U.S. Army Grant No. W911NF21- 20076. We also acknowledge the support of the University of Maryland, Baltimore, Institute for Clinical and Translational Research (ICTR) and the National Center for Advancing Translational Sciences (NCATS) Clinical Translational Science Award (CTSA) grant number UL1TR003098en
dc.description.urihttps://www.youtube.com/watch?v=qD8mxEhFBcI
dc.description.urihttps://arxiv.org/abs/2202.04303en
dc.format.extent7 pagesen
dc.genrevideo recordings
dc.genreconference papers and proceedingsen
dc.genrepreprintsen
dc.identifierdoi:10.13016/m2k9xf-fqm5
dc.identifier.urihttps://doi.org/10.48550/arXiv.2202.04303
dc.identifier.urihttp://hdl.handle.net/11603/24393
dc.language.isoenen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Information Systems Department
dc.rightsAttribution 4.0 International (CC BY 4.0)
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.en
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/*
dc.titleTinyM²Net: A Flexible System Algorithm Co-designed Multimodal Learning Framework for Tiny Devicesen
dc.typeTexten
dcterms.creatorhttps://orcid.org/0000-0002-9983-6929en

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2202.04303.pdf
Size:
8.03 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: