On Finer Control of Information Flow in LSTMs

dc.contributor.authorGao, Hang
dc.contributor.authorOates, Tim
dc.date.accessioned2021-04-07T17:45:33Z
dc.date.available2021-04-07T17:45:33Z
dc.date.issued2019-01-18
dc.descriptionECML PKDD 2018: Machine Learning and Knowledge Discovery in Databases pp 527-540, Lecture Notes in Computer Science, vol 11051.en_US
dc.description.abstractSince its inception in 1995, the Long Short-Term Memory (LSTM) architecture for recurrent neural networks has shown promising performance, sometimes state-of-art, for various tasks. Aiming at achieving constant error flow through hidden units, LSTM introduces a complex unit called a memory cell, in which gates are adopted to control the exposure/isolation of information flowing in, out and back to itself. Despite its widely acknowledged success, in this paper, we propose a hypothesis that LSTMs may suffer from an implicit functional binding of information exposure/isolation for the output and candidate computation, i.e., the output gate at time t−1 is not only in charge of the information flowing out of a cell as the response to the external environment, but also controls the information flowing back to the cell for the candidate computation, which is often the only source of nonlinear combination of input at time t and previous cell state at time t−1 for cell memory updates. We propose Untied Long Short Term Memory (ULSTM) as a solution to the above problem. We test our model on various tasks, including semantic relatedness prediction, language modeling and sentiment classification. Experimental results indicate that our proposed model is capable to at least partially solve the problem and outperform LSTM for all these tasks.en_US
dc.description.urihttps://link.springer.com/chapter/10.1007/978-3-030-10925-7_32en_US
dc.format.extent14 pagesen_US
dc.genreconference papers and proceedings preprintsen_US
dc.identifierdoi:10.13016/m2oevo-q5fs
dc.identifier.citationGao H., Oates T. (2019) On Finer Control of Information Flow in LSTMs. In: Berlingerio M., Bonchi F., Gärtner T., Hurley N., Ifrim G. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2018. Lecture Notes in Computer Science, vol 11051. Springer, Cham. https://doi.org/10.1007/978-3-030-10925-7_32en_US
dc.identifier.urihttps://doi.org/10.1007/978-3-030-10925-7_32
dc.identifier.urihttp://hdl.handle.net/11603/21292
dc.language.isoen_USen_US
dc.publisherSpringer Natureen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rights© Springer Nature Switzerland AG 2019
dc.subjectUMBC High Performance Computing Facility (HPCF)
dc.titleOn Finer Control of Information Flow in LSTMsen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
594.pdf
Size:
357.52 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: