Reg-TuneV2: Hardware-Aware and Multi-Objective Regression-Based Fine-Tuning Approach for DNNs on Embedded Platforms

A. N. Mazumder and T. Mohsenin, "Reg-TuneV2: Hardware-Aware and Multi-Objective Regression-Based Fine-Tuning Approach for DNNs on Embedded Platforms," in IEEE Micro, doi: 10.1109/MM.2023.3316433.

Rights

© 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Abstract

Fine-tuning Deep Neural Networks (DNNs) for deployment has traditionally relied on computationally intensive methods such as grid search and neural architecture search (NAS), which may not consider hardware-aware metrics. Moreover, it is essential to consider multiple objectives to develop a range of solutions for tinyML hardware deployment with real-time latency and low power constraints. To address these problems, we propose Reg-TuneV2, a systematic approach to fine-tune DNNs for hardware deployment by considering multiple objectives, including accuracy, power, and latency contours. In addition, this approach uses metric learning to achieve smaller and better-suited configurations for deployment, achieving 90.5% accuracy with only 340 KB of memory for keyword spotting on FPGA. When compared to baselines for keyword spotting and image classification on the Nvidia Jetson Nano 4 GB SDK, the proposed method achieves a 14.5× and 101.8× reduction in model size coupled with 2.5× and 5.9× better inference efficiency, respectively.

Reg-TuneV2: Hardware-Aware and Multi-Objective Regression-Based Fine-Tuning Approach for DNNs on Embedded Platforms

Files

Links to Files

Permanent Link

Collections

Author/Creator

Author/Creator ORCID

Date

Type of Work

Department

Program

Citation of Original Publication

Rights

Subjects

Abstract