QuadraNet: Improving High-Order Neural Interaction Efficiency with Hardware-Aware Quadratic Neural Networks

dc.contributor.authorXu, Chenhui
dc.contributor.authorYu, Fuxun
dc.contributor.authorXu, Zirui
dc.contributor.authorLiu, Chenchen
dc.contributor.authorXiong, Jinjun
dc.contributor.authorChen, Xiang
dc.date.accessioned2023-12-12T17:04:40Z
dc.date.available2023-12-12T17:04:40Z
dc.date.issued2023-11-29
dc.description.abstractRecent progress in computer vision-oriented neural network designs is mostly driven by capturing high-order neural interactions among inputs and features. And there emerged a variety of approaches to accomplish this, such as Transformers and its variants. However, these interactions generate a large amount of intermediate state and/or strong data dependency, leading to considerable memory consumption and computing cost, and therefore compromising the overall runtime performance. To address this challenge, we rethink the high-order interactive neural network design with a quadratic computing approach. Specifically, we propose QuadraNet -- a comprehensive model design methodology from neuron reconstruction to structural block and eventually to the overall neural network implementation. Leveraging quadratic neurons' intrinsic high-order advantages and dedicated computation optimization schemes, QuadraNet could effectively achieve optimal cognition and computation performance. Incorporating state-of-the-art hardware-aware neural architecture search and system integration techniques, QuadraNet could also be well generalized in different hardware constraint settings and deployment scenarios. The experiment shows that QuadraNet achieves up to 1.5× throughput, 30% less memory footprint, and similar cognition performance, compared with the state-of-the-art high-order approaches.
dc.description.urihttps://arxiv.org/abs/2311.17956
dc.format.extent7 pages
dc.genrejournal article
dc.genrepreprints
dc.identifier.urihttps://doi.org/10.48550/arXiv.2311.17956
dc.identifier.urihttp://hdl.handle.net/11603/31042
dc.language.isoen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rightsCC BY 4.0 DEED Attribution 4.0 Internationalen
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleQuadraNet: Improving High-Order Neural Interaction Efficiency with Hardware-Aware Quadratic Neural Networks
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0001-7749-0640

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2311.17956.pdf
Size:
853.98 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: