Kernearl-Based Lifelong Policy Gradient Reinforcement Learning

dc.contributor.authorMowakeaa, Rami
dc.contributor.authorKim, Seung-Jun
dc.contributor.authorEmge, Darren K.
dc.date.accessioned2021-06-04T18:41:37Z
dc.date.available2021-06-04T18:41:37Z
dc.date.issued2021-05-13
dc.descriptionICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)en_US
dc.description.abstractPolicy gradient methods have been widely used in reinforcement learning (RL), especially thanks to their facility to handle continuous state spaces, strong convergence guarantees, and low-complexity updates. Training of the methods for individual tasks, however, can still be taxing in terms of the learning speed and the sample trajectory collection. Lifelong learning aims to exploit the intrinsic structure shared among a suite of RL tasks, akin to multitask learning, but in an efficient online fashion. In this work, we propose a lifelong RL algorithm based on the kernel method to leverage nonlinear features of the data based on a popular union-of-subspace model. Experimental results on a set of simple related tasks verify the advantage of the proposed strategy, compared to the single-task and the parametric counterparts.en_US
dc.description.urihttps://ieeexplore.ieee.org/abstract/document/9414511en_US
dc.format.extent5 pagesen_US
dc.genreconference papers and proceedingsen_US
dc.identifierdoi:10.13016/m26bzu-l5ow
dc.identifier.citationMowakeaa, Rami; Kim, Seung-Jun; Emge, Darren K.; Kernearl-Based Lifelong Policy Gradient Reinforcement Learning; ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 13, 2021; https://doi.org/10.1109/ICASSP39728.2021.9414511en_US
dc.identifier.urihttps://doi.org/10.1109/ICASSP39728.2021.9414511
dc.identifier.urihttp://hdl.handle.net/11603/21679
dc.language.isoen_USen_US
dc.publisherIEEEen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rightsPublic Domain Mark 1.0*
dc.rightsThis work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.
dc.rights.urihttp://creativecommons.org/publicdomain/mark/1.0/*
dc.titleKernearl-Based Lifelong Policy Gradient Reinforcement Learningen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
09414511.pdf
Size:
861.64 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: