Search in:

Connection Science Volume 16, 2004 - Issue 2

Submit an article Journal homepage

Free access

635

Views

CrossRef citations to date

Altmetric

Miscellany

Self-refreshing memory in artificial neural networks: learning temporal sequences without catastrophic forgetting

Bernard Ans Psychology and NeuroCognition, Pierre Mendes-France University, Grenoble 2—CNRS UMR 5105, BP 47, 38040, Grenoble cedex 09, France Phone: +33-476-825-674 Fax: +33-476-825-674 E-mail: [email protected]

Stéphane Rousset Psychology and NeuroCognition, Pierre Mendes-France University, Grenoble 2—CNRS UMR 5105, BP 47, 38040, Grenoble cedex 09, France Phone: +33-476-825-674 Fax: +33-476-825-674 E-mail: [email protected]

Robert M. French Quantitative Psychology and Cognitive Science, University of Liège, (Bat B32), Sart Tilman, 4000, Liège, Belgium

Serban Musca Psychology and NeuroCognition, Pierre Mendes-France University, Grenoble 2—CNRS UMR 5105, BP 47, 38040, Grenoble cedex 09, France Phone: +33-476-825-674 Fax: +33-476-825-674 E-mail: [email protected]

Pages 71-99 | Published online: 21 Oct 2010

Cite this article
https://doi.org/10.1080/09540090412331271199

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF

Figures & data

Figure 1. (a) The RFN architecture integrates an autoassociative processing constraint into a standard backpropagation network (large arrows represent full connectivity with modifiable weights). Here the emphasis is on the learning algorithm. The network is shown learning a pattern P: Input → Target. (b) An equivalent visualization of the RFN architecture emphasizing the input–hidden layer reverberation. It is crucial to note that the updating of the hidden-to-input weights depends not only on the autoassociative error between the original input and the reverberated input, but also on the difference between the network's actual output and the target. As above, the network is shown learning a pattern, P: Input → Target.

Figure 2. A standard SRN network that is designed to learn a sequence S(0), S(1), … , S(t), … , S(n). At each time t, the relation between item S(t) and the associated target item S(t + 1) is learned along with the context H(t − 1), a copy of the hidden layer activation from time t − 1 when the network was learning the previous association S(t − 1) → S(t).

Figure 3. A reverberating SRN. This architecture can also be visualized as in to emphasize the input reverberation between the input and hidden layer.

Figure 3. A reverberating SRN. This architecture can also be visualized as in figure 1(b) to emphasize the input reverberation between the input and hidden layer.

Figure 4. (a) Learning of sequence B (after having previously learned sequence A). By 450 epochs (an epoch corresponds to one pass through the entire sequence), sequence B has been completely learned. Note that it is more difficult to learn the two ‘ambiguous’ target items, S(2) and S(6). (b) The number of incorrect units for sequence A during learning of sequence B. After 450 epochs, the SRN has, for all intents and purposes, completely forgotten the previously learned sequence A. (Note that for the sake of readability of the graphs, the learning epochs increase from left to right in the second graph in the direction of the arrow.)

Figure 5. Recall performance for the first sequence A, once the second sequence B is completely learned in a SRN, with (a) and without (b) the pseudo-sequences refreshing. Whatever the learning criterion may be, it appears clearly that refreshing by pseudo-sequences in no way reduces catastrophic forgetting.

Figure 6. Recall performance for sequences B and A during learning of sequence B by a dual-network RSRN. (a) By 400 epochs, the second sequence B has been completely learned. Note that it is more difficult to learn the two ‘ambiguous’ target items. (b) The previously learned sequence A shows virtually no forgetting. Catastrophic forgetting of the previously learned sequence A has been completely overcome.

Figure 7. Recall performance for the whole sequence in the course of learning of its second sub-sequence D within a RSRN dual-architecture. By 300 epochs, the second sub-sequence D has been completely learned. The previously learned sub-sequence C shows no forgetting and the whole sequence of 20 ordered items can be perfectly reproduced when starting only from the initializing item S(0) and the neutral context. The two separately learned sub-sequences C and D were correctly linked.

Table 1. Forgetting of sequence A after complete learning of sequence B using different self-refreshing procedures.

Download CSV Display Table

Figure 8. Recall performance for the previously learned SOC1 sequence during learning of a second SOC2 sequence (completely learned by 450 epochs). The two SOCs are made up of 13 items and, as in previous simulations, the item in position 0 is not shown because it is used only to initialize sequence learning and recall. (a) Without self-refreshing, catastrophic forgetting is severe. (b) With self-refreshing, the previously learned SOC1 sequence does not show any catastrophic forgetting during SOC2 learning.

Figure 9. Recall performance of the new sequence and of the previously learned sequences during learning of the new sequence. Vertical bars denote standard errors.

Figure 10. Recall performance, with and without the self-refreshing mechanism at work, of the previously learned sequences during learning of the new sequence (which is completed after 150 presentations). Without refreshing, there is clearly catastrophic forgetting of the previously learned sequences. With refreshing, however, the learning curve exhibits, as for humans, an initial drop and subsequent rise in recall performance.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related Research Data

Connectionist models of recognition memory: Constraints imposed by learning and forgetting functions.

Source: American Psychological Association (APA)

Connectionist Models of Learning, Development and Evolution

Source: Springer London

Consolidation in Neural Networks and in the Sleeping Brain

Source: Informa UK Limited

Evaluating the relationship between explicit and implicit knowledge in a sequential reaction time task.

Source: American Psychological Association (APA)

ON THE RELATION BETWEEN CATASTROPHIC INTERFERENCE AND GENERALIZATION IN CONNECTIONIST NETWORKS

Source: World Scientific Pub Co Pte Lt

Competitive Learning: From Interactive Activation to Adaptive Resonance

Source: Wiley

"Fate" of first-list associations in transfer theory.

Source: American Psychological Association (APA)

Catastrophic forgetting, rehearsal and pseudorehearsal

Source: Informa UK Limited

Connectionist modelling in psychology: A localist manifesto

Source: Cambridge University Press (CUP)

Semi-distributed Representations and Catastrophic Forgetting in Connectionist Networks

Source: Informa UK Limited

End-to-End Incremental Learning

Source: HAL CCSD

Neural networks with a self-refreshing memory : knowledge transfer in sequential learning tasks without catastrophic forgetting

Source: HAL CCSD

Learning and development in neural networks--the importance of prior experience.

Source: Elsevier BV

The ART of adaptive pattern recognition by a self-organizing neural network

Source: Institute of Electrical and Electronics Engineers (IEEE)

A neural network model for temporal sequence learning and motor programming

Source: Elsevier BV

Lifelong Machine Learning, Second Edition

Source: (:unav)

"Schema Abstraction" in a Multiple-Trace Memory Model

Source: American Psychological Association (APA)

Rule Learning by Seven-Month-Old Infants and Neural Networks

Source: American Association for the Advancement of Science (AAAS)

Sequence Recognition with Recurrent Neural Networks

Source: Informa UK Limited

Understanding normal and impaired word reading: computational principles in quasi-regular domains.

Source: American Psychological Association (APA)

Avoiding catastrophic forgetting by coupling two reverberating neural networks

Source: HAL CCSD

Human Category Learning: Implications for Backpropagation Models

Source: Informa UK Limited

Deterministic Boltzmann learning performs steepest descent in weight-space

Source: MIT Press - Journals

Pseudo-recurrent Connectionist Networks: An Approach to the 'Sensitivity-Stability' Dilemma

Source: Informa UK Limited

Connectionist learning procedures

Source: Elsevier BV

Assessing implicit learning with indirect tests: Determining what is learned about sequence structure

Source: American Psychological Association (APA)

Mapping across Domains Without Feedback: A Neural Network Model of Transfer of Implicit Knowledge

Source: Wiley

Biologically plausible error-driven learning using local activation differences: The generalized recirculation algorithm

Source: MIT Press - Journals

Doing without schema hierarchies: a recurrent connectionist approach to normal and impaired routine sequential action

Source: Carnegie Mellon University

Equivalence of backpropagation and contrastive Hebbian learning in a layered network

Source: MIT Press - Journals

Connectionist Models of Sequence Processing

Source: The MIT Press

Storing temporal sequences

Source: Elsevier BV

Implicit learning of artificial grammars

Source: Elsevier BV

Catastrophic Forgetting and the Pseudorehearsal Solution in Hopfield-type Networks

Source: Informa UK Limited

Attention and structure in sequence learning.

Source: American Psychological Association (APA)

Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory.

Source: American Psychological Association (APA)

Catastrophic forgetting in connectionist networks.

Source: Elsevier BV

An analysis of catastrophic interference.

Source: Informa UK Limited

Linking provided by

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Self-refreshing memory in artificial neural networks: learning temporal sequences without catastrophic forgetting

Table 1. Forgetting of sequence A after complete learning of sequence B using different self-refreshing procedures.

Related Research Data

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Self-refreshing memory in artificial neural networks: learning temporal sequences without catastrophic forgetting

Figures & data

Table 1. Forgetting of sequence A after complete learning of sequence B using different self-refreshing procedures.

Reprints and Corporate Permissions

Academic Permissions

Related Research Data

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date