Streaming Propagation Through Time: A New Computational Paradigm for Recurrent Neural Networks

dc.contributor.authorWang, Huachuan
dc.contributor.authorXia, Weihao
dc.contributor.authorGuan, Yunpeng
dc.contributor.authorWang, Yuanhao
dc.contributor.authorKe, Chaoyi
dc.contributor.authorYan, Enshuo
dc.contributor.authorWang, Ping
dc.contributor.authorQiu, Chen
dc.contributor.authorZheng, Xiangping
dc.contributor.authorYao, Yuan
dc.contributor.authorBi, Yuanfei
dc.contributor.authorLo, James Ting-Ho
dc.date.accessioned2025-11-21T00:29:39Z
dc.date.issued2025-10-17
dc.description.abstractRecurrent Neural Networks (RNNs) are foundational to numerous advances in artificial intelligence, yet their training has for decades predominantly relies on Backpropagation Through Time (BPTT), a paradigm that struggles with substantial computational and memory demands for long sequences. The inherently batch-oriented nature of BPTT further constrains RNNs’ ability to learn from streaming or online data. Earlier efforts toward online RNN training have been hindered by prohibitive costs in both computation and memory. Here, we introduce Streaming Propagation Through Time (SPTT), a new computational paradigm for RNN training. SPTT employs a streaming low-rank matrix decomposition to decouple gradient computation into two independent components: an optimization direction and an update magnitude. This incremental exploration of the gradient landscape enables efficient long-sequence processing while maintaining learning continuity.Across diverse sequence modeling benchmarks, SPTT outperforms BPTT, with stronger generalization and improved computational efficiency, thereby opening new possibilities for real-time and resource-constrained RNN applications.
dc.description.urihttps://www.researchsquare.com/article/rs-7583281/v1
dc.format.extent32 pages
dc.genrejournal articles
dc.genrepreprints
dc.identifierdoi:10.13016/m2bima-ly2a
dc.identifier.urihttps://doi.org/10.21203/rs.3.rs-7583281/v1
dc.identifier.urihttp://hdl.handle.net/11603/40773
dc.language.isoen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Mathematics and Statistics Department
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectUMBC High Performance Computing Facility (HPCF)
dc.titleStreaming Propagation Through Time: A New Computational Paradigm for Recurrent Neural Networks
dc.typeText

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
v1coveredaa6291d350674525a1c96f91d4053c20.pdf
Size:
4.38 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
e6114ab9f3c602530d576639.pdf
Size:
2.37 MB
Format:
Adobe Portable Document Format