Reg-Tune: A Regression-focused Fine-tuning Technique for Energy-efficient Embedded Deployment
dc.contributor.advisor | Younis, Dr. Mohamed | |
dc.contributor.advisor | Mohsenin, Dr. Tinoosh | |
dc.contributor.author | Mazumder, Arnab Neelim | |
dc.contributor.department | Computer Science and Electrical Engineering | |
dc.contributor.program | Engineering, Computer | |
dc.date.accessioned | 2025-02-13T15:35:06Z | |
dc.date.available | 2025-02-13T15:35:06Z | |
dc.date.issued | 2024-01-01 | |
dc.description.abstract | Fine-tuning deep neural networks (DNNs) is essential for developing inference modules suitable for deployment on edge or FPGA (Field Programmable Gate Arrays) platforms. Traditionally, various parameters across DNN layers have been explored using grid search, neural architecture search (NAS), and other brute-force techniques. While these methods yield optimal network parameters, the search process can be time-consuming and may not account for the deployment constraints of the target platform. The dissertation addresses this issue by formulating hardware-aware regression polynomials for the energy-efficient deployment of DNN models through Reg-Tune. A general formulation is provided to profile different metrics across various device platforms for both single and multi-objective solutions. The objective is to ascertain hardware friendly configurations based on the experience span for a limited set of configurations derived from a combination of variables in terms of accuracy, power, and latency. Once the regression fits are established for the metrics of interest, single or multi-objective contours related to these metrics can be created to analytically and experimentally identify the near-optimal solution for deployment. The dissertation also demonstrates that introducing LIME (Local Model Agnostic Explanations)-based sample weights into the training framework or employing metric learning in conjunction with the Reg-Tune method can enhance the inference accuracy of deployed models without altering network parameters. Furthermore, the LIME-based weighting method is shown to be effective for handling unbalanced datasets and mitigating catastrophic forgetting in incremental learning. Additionally, different metrics are profiled across two different device platforms and for a variety of applications, thereby increasing the applicability of the method. For instance, deployments based on the fine-tuning method for physical activity recognition on FPGA demonstrate at least 5.7x better energy efficiency than recent implementations without compromising accuracy. On the other hand, when combined with weighted loss-based training, the Reg-Tune approach identifies a deployment configuration with approximately 8x better energy efficiency than recent keyword spotting implementations on FPGA. Finally, the integration of metric learning with Reg-Tune results in a 14.5x and 101.8x reduction in model size, alongside 2.5x and 5.9x improvements in inference efficiency for keyword spotting and image classification on the Nvidia Jetson Nano 4GB SDK, compared to baseline and recent implementations, respectively. | |
dc.format | application:pdf | |
dc.genre | dissertation | |
dc.identifier | doi:10.13016/m2hurh-higs | |
dc.identifier.other | 12958 | |
dc.identifier.uri | http://hdl.handle.net/11603/37646 | |
dc.language | en | |
dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
dc.relation.ispartof | UMBC Computer Science and Electrical Engineering Department Collection | |
dc.relation.ispartof | UMBC Theses and Dissertations Collection | |
dc.relation.ispartof | UMBC Graduate School Collection | |
dc.relation.ispartof | UMBC Student Collection | |
dc.rights | This item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu or contact Special Collections at speccoll(at)umbc.edu | |
dc.source | Original File Name: Mazumder_umbc_0434D_12958.pdf | |
dc.subject | Energy Efficient Hardware Deployment | |
dc.subject | Explainable AI | |
dc.subject | Incremental Learning | |
dc.subject | Metric Learning | |
dc.subject | Performance Profiling of Deep Neural Networks | |
dc.subject | Semantic Segmentation | |
dc.title | Reg-Tune: A Regression-focused Fine-tuning Technique for Energy-efficient Embedded Deployment | |
dc.type | Text | |
dcterms.accessRights | Distribution Rights granted to UMBC by the author. |