An Energy Efficient EdgeAI Autoencoder Accelerator for Reinforcement Learning

Manjunath, Nitheesh Kumar; Shiri, Aidin; Hosseini, Morteza; Prakash, Bharat; Waytowich, Nicholas R.; Mohsenin, Tinoosh

An Energy Efficient EdgeAI Autoencoder Accelerator for Reinforcement Learning

dc.contributor.author	Manjunath, Nitheesh Kumar
dc.contributor.author	Shiri, Aidin
dc.contributor.author	Hosseini, Morteza
dc.contributor.author	Prakash, Bharat
dc.contributor.author	Waytowich, Nicholas R.
dc.contributor.author	Mohsenin, Tinoosh
dc.date.accessioned	2021-02-17T18:03:53Z
dc.date.available	2021-02-17T18:03:53Z
dc.date.issued	2021-01-25
dc.description.abstract	In EdgeAI embedded devices that exploit reinforcement learning (RL), it is essential to reduce the number of actions taken by the agent in the real world and minimize the compute-intensive policies learning process. Convolutional autoencoders (AEs) has demonstrated great improvement for speeding up the policy learning time when attached to the RL agent, by compressing the high dimensional input data into a small latent representation for feeding the RL agent. Despite reducing the policy learning time, AE adds a significant computational and memory complexity to the model which contributes to the increase in the total computation and the model size. In this article, we propose a model for speeding up the policy learning process of RL agent with the use of AE neural networks, which engages binary and ternary precision to address the high complexity overhead without deteriorating the policy that an RL agent learns. Binary Neural Networks (BNNs) and Ternary Neural Networks (TNNs) compress weights into 1 and 2 bits representations, which result in significant compression of the model size and memory as well as simplifying multiply-accumulate (MAC) operations. We evaluate the performance of our model in three RL environments including DonkeyCar, Miniworld sidewalk, and Miniworld Object Pickup, which emulate various real-world applications with different levels of complexity. With proper hyperparameter optimization and architecture exploration, TNN models achieve near the same average reward, Peak Signal to Noise Ratio (PSNR) and Mean Squared Error (MSE) performance as the full-precision model while reducing the model size by $10\times $ compared to full-precision and $3\times $ compared to BNNs. However, in BNN models the average reward drops up to 12% – 25% compared to the full-precision even after increasing its model size by $4\times $ . We designed and implemented a scalable hardware accelerator which is configurable in terms of the number of processing elements (PEs) and memory data width to achieve the best power, performance, and energy efficiency trade-off for EdgeAI embedded devices. The proposed hardware implemented on Artix-7 FPGA dissipates $250~\mu \text{J}$ energy while meeting 30 frames per second (FPS) throughput requirements. The hardware is configurable to reach an efficiency of over 1 TOP/J on FPGA implementation. The proposed hardware accelerator is synthesized and placed-and-routed in 14 nm FinFET ASIC technology which brings down the power dissipation to $3.9~\mu \text{J}$ and maximum throughput of 1,250 FPS. Compared to the state of the art TNN implementations on the same target platform, our hardware is $5\times $ and $4.4\times $ ( $2.2\times $ if technology scaled) more energy efficient on FPGA and ASIC, respectively.	en
dc.description.sponsorship	This work was supported by the U.S. Army Research Laboratory through Cooperative Agreement under Grant W911NF-10-2-0022.
dc.description.uri	https://ieeexplore.ieee.org/document/9335309	en
dc.format.extent	14 pages	en
dc.genre	journal articles	en
dc.identifier	doi:10.13016/m2xm9a-tgo3
dc.identifier.citation	Manjunath, Nitheesh Kumar; Shiri, Aidin; Hosseini, Morteza; Prakash, Bharat; Waytowich, Nicholas R.; Mohsenin, Tinoosh; An Energy Efficient EdgeAI Autoencoder Accelerator for Reinforcement Learning; IEEE Open Journal of Circuits and Systems ( Volume: 2) (2021); https://ieeexplore.ieee.org/document/9335309	en
dc.identifier.uri	https://doi.org/10.1109/OJCAS.2020.3043737
dc.identifier.uri	http://hdl.handle.net/11603/21047
dc.language.iso	en	en
dc.publisher	IEEE	en
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartof	UMBC Faculty Collection
dc.relation.ispartof	UMBC Student Collection
dc.rights	Public Domain Mark 1.0	*
dc.rights	This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rights	This work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.
dc.rights.uri	http://creativecommons.org/publicdomain/mark/1.0/	*
dc.subject	UMBC Energy Efficient High Performance Computing Lab
dc.title	An Energy Efficient EdgeAI Autoencoder Accelerator for Reinforcement Learning	en
dc.type	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 09335309.pdf
Size:: 2.65 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.56 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

UMBC Computer Science and Electrical Engineering Department
UMBC Faculty Collection
UMBC Student Collection