Long short-term memory - PubMed (original) (raw)
Long short-term memory
S Hochreiter et al. Neural Comput. 1997.
Abstract
Learning to store information over extended time intervals by recurrent backpropagation takes a very long time, mostly because of insufficient, decaying error backflow. We briefly review Hochreiter's (1991) analysis of this problem, then address it by introducing a novel, efficient, gradient-based method called long short-term memory (LSTM). Truncating the gradient where this does not do harm, LSTM can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units. Multiplicative gate units learn to open and close access to the constant error flow. LSTM is local in space and time; its computational complexity per time step and weight is O(1). Our experiments with artificial data involve local, distributed, real-valued, and noisy pattern representations. In comparisons with real-time recurrent learning, back propagation through time, recurrent cascade correlation, Elman nets, and neural sequence chunking, LSTM leads to many more successful runs, and learns much faster. LSTM also solves complex, artificial long-time-lag tasks that have never been solved by previous recurrent network algorithms.
Similar articles
- Learning to forget: continual prediction with LSTM.
Gers FA, Schmidhuber J, Cummins F. Gers FA, et al. Neural Comput. 2000 Oct;12(10):2451-71. doi: 10.1162/089976600300015015. Neural Comput. 2000. PMID: 11032042 - Framewise phoneme classification with bidirectional LSTM and other neural network architectures.
Graves A, Schmidhuber J. Graves A, et al. Neural Netw. 2005 Jun-Jul;18(5-6):602-10. doi: 10.1016/j.neunet.2005.06.042. Neural Netw. 2005. PMID: 16112549 - Kalman filters improve LSTM network performance in problems unsolvable by traditional recurrent nets.
Pérez-Ortiz JA, Gers FA, Eck D, Schmidhuber J. Pérez-Ortiz JA, et al. Neural Netw. 2003 Mar;16(2):241-50. doi: 10.1016/S0893-6080(02)00219-8. Neural Netw. 2003. PMID: 12628609 - A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures.
Yu Y, Si X, Hu C, Zhang J. Yu Y, et al. Neural Comput. 2019 Jul;31(7):1235-1270. doi: 10.1162/neco_a_01199. Epub 2019 May 21. Neural Comput. 2019. PMID: 31113301 Review. - Working models of working memory.
Barak O, Tsodyks M. Barak O, et al. Curr Opin Neurobiol. 2014 Apr;25:20-4. doi: 10.1016/j.conb.2013.10.008. Epub 2013 Dec 4. Curr Opin Neurobiol. 2014. PMID: 24709596 Review.
Cited by
- Prediction of power conversion efficiency parameter of inverted organic solar cells using artificial intelligence techniques.
Marzouglal M, Souahlia A, Bessissa L, Mahi D, Rabehi A, Alharthi YZ, Bojer AK, Flah A, Alharthi MM, Ghoneim SSM. Marzouglal M, et al. Sci Rep. 2024 Oct 29;14(1):25931. doi: 10.1038/s41598-024-77112-3. Sci Rep. 2024. PMID: 39472726 Free PMC article. - Learning and diSentangling patient static information from time-series Electronic hEalth Records (STEER).
Liao W, Voldman J. Liao W, et al. PLOS Digit Health. 2024 Oct 21;3(10):e0000640. doi: 10.1371/journal.pdig.0000640. eCollection 2024 Oct. PLOS Digit Health. 2024. PMID: 39432484 Free PMC article. - Exploiting Temporal Features in Calculating Automated Morphological Properties of Spiky Nanoparticles Using Deep Learning.
Rafique MA. Rafique MA. Sensors (Basel). 2024 Oct 10;24(20):6541. doi: 10.3390/s24206541. Sensors (Basel). 2024. PMID: 39460021 Free PMC article. - From rumor to genetic mutation detection with explanations: a GAN approach.
Cheng M, Li Y, Nazarian S, Bogdan P. Cheng M, et al. Sci Rep. 2021 Mar 12;11(1):5861. doi: 10.1038/s41598-021-84993-1. Sci Rep. 2021. PMID: 33712675 Free PMC article. - Intelligent Health Care: Applications of Deep Learning in Computational Medicine.
Yang S, Zhu F, Ling X, Liu Q, Zhao P. Yang S, et al. Front Genet. 2021 Apr 12;12:607471. doi: 10.3389/fgene.2021.607471. eCollection 2021. Front Genet. 2021. PMID: 33912213 Free PMC article. Review.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical