L³ Ensembles: Lifelong Learning Approach for Ensemble of Foundational Language Models

dc.contributor.authorShiri, Aidin
dc.contributor.authorRoy, Kaushik
dc.contributor.authorSheth, Amit
dc.contributor.authorGaur, Manas
dc.date.accessioned2023-11-30T19:41:04Z
dc.date.available2023-11-30T19:41:04Z
dc.date.issued2023-11-11
dc.description.abstractFine-tuning pre-trained foundational language models (FLM) for specific tasks is often impractical, especially for resource-constrained devices. This necessitates the development of a Lifelong Learning (L3) framework that continuously adapts to a stream of Natural Language Processing (NLP) tasks efficiently. We propose an approach that focuses on extracting meaningful representations from unseen data, constructing a structured knowledge base, and improving task performance incrementally. We conducted experiments on various NLP tasks to validate its effectiveness, including benchmarks like GLUE and SuperGLUE. We measured good performance across the accuracy, training efficiency, and knowledge transfer metrics. Initial experimental results show that the proposed L3 ensemble method increases the model accuracy by 4% ~ 36% compared to the fine-tuned FLM. Furthermore, L3 model outperforms naive fine-tuning approaches while maintaining competitive or superior performance (up to 15.4% increase in accuracy) compared to the state-of-the-art language model (T5) for the given task, STS benchmark.
dc.description.urihttps://arxiv.org/abs/2311.06493
dc.format.extent2 pages
dc.genrejournal articles
dc.genrepreprints
dc.identifier.urihttps://doi.org/10.48550/arXiv.2311.06493
dc.identifier.urihttp://hdl.handle.net/11603/30960
dc.language.isoen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rightsCC BY-SA 4.0 DEED Attribution-ShareAlike 4.0 International en
dc.rights.urihttps://creativecommons.org/licenses/by-sa/4.0/
dc.titleL³ Ensembles: Lifelong Learning Approach for Ensemble of Foundational Language Models
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0001-5402-0988
dcterms.creatorhttps://orcid.org/0000-0002-5411-2230

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2311.06493.pdf
Size:
550.31 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: