Reservoir computing

From HandWiki
Revision as of 19:45, 6 February 2024 by BotanyGa (talk | contribs) (fixing)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Short description: Type of recurrent neural network with random and non-trainable internal structure

Reservoir computing is a framework for computation derived from recurrent neural network theory that maps input signals into higher dimensional computational spaces through the dynamics of a fixed, non-linear system called a reservoir.[1] After the input signal is fed into the reservoir, which is treated as a "black box," a simple readout mechanism is trained to read the state of the reservoir and map it to the desired output.[1] The first key benefit of this framework is that training is performed only at the readout stage, as the reservoir dynamics are fixed.[1] The second is that the computational power of naturally available systems, both classical and quantum mechanical, can be used to reduce the effective computational cost.[2]

History

The concept of reservoir computing stems from the use of recursive connections within neural networks to create a complex dynamical system.[3] It is a generalisation of earlier neural network architectures such as recurrent neural networks, liquid-state machines and echo-state networks. Reservoir computing also extends to physical systems that are not networks in the classical sense, but rather continuous systems in space and/or time: e.g. a literal "bucket of water" can serve as a reservoir that performs computations on inputs given as perturbations of the surface.[4] The resultant complexity of such recurrent neural networks was found to be useful in solving a variety of problems including language processing and dynamic system modeling.[3] However, training of recurrent neural networks is challenging and computationally expensive.[3] Reservoir computing reduces those training-related challenges by fixing the dynamics of the reservoir and only training the linear output layer.[3]

A large variety of nonlinear dynamical systems can serve as a reservoir that performs computations. In recent years semiconductor lasers have attracted considerable interest as computation can be fast and energy efficient compared to electrical components.

Recent advances in both AI and quantum information theory have given rise to the concept of quantum neural networks.[5] These hold promise in quantum information processing, which is challenging to classical networks, but can also find application in solving classical problems.[5][6] In 2018, a physical realization of a quantum reservoir computing architecture was demonstrated in the form of nuclear spins within a molecular solid.[6] However, the nuclear spin experiments in [6] did not demonstrate quantum reservoir computing per se as they did not involve processing of sequential data. Rather the data were vector inputs, which makes this more accurately a demonstration of quantum implementation of a random kitchen sink[7] algorithm (also going by the name of extreme learning machines in some communities). In 2019, another possible implementation of quantum reservoir processors was proposed in the form of two-dimensional fermionic lattices.[6] In 2020, realization of reservoir computing on gate-based quantum computers was proposed and demonstrated on cloud-based IBM superconducting near-term quantum computers.[8]

Reservoir computers have been used for time-series analysis purposes. In particular, some of their usages involve chaotic time-series prediction,[9][10] separation of chaotic signals,[11] and link inference of networks from their dynamics.[12]

Classical reservoir computing

Reservoir

The 'reservoir' in reservoir computing is the internal structure of the computer, and must have two properties: it must be made up of individual, non-linear units, and it must be capable of storing information. The non-linearity describes the response of each unit to input, which is what allows reservoir computers to solve complex problems. Reservoirs are able to store information by connecting the units in recurrent loops, where the previous input affects the next response. The change in reaction due to the past allows the computers to be trained to complete specific tasks.[13]

Reservoirs can be virtual or physical.[13] Virtual reservoirs are typically randomly generated and are designed like neural networks.[13][3] Virtual reservoirs can be designed to have non-linearity and recurrent loops, but, unlike neural networks, the connections between units are randomized and remain unchanged throughout computation.[13] Physical reservoirs are possible because of the inherent non-linearity of certain natural systems. The interaction between ripples on the surface of water contains the nonlinear dynamics required in reservoir creation, and a pattern recognition RC was developed by first inputting ripples with electric motors then recording and analyzing the ripples in the readout.[1]

Readout

The readout is a neural network layer that performs a linear transformation on the output of the reservoir.[1] The weights of the readout layer are trained by analyzing the spatiotemporal patterns of the reservoir after excitation by known inputs, and by utilizing a training method such as a linear regression or a Ridge regression.[1] As its implementation depends on spatiotemporal reservoir patterns, the details of readout methods are tailored to each type of reservoir.[1] For example, the readout for a reservoir computer using a container of liquid as its reservoir might entail observing spatiotemporal patterns on the surface of the liquid.[1]

Types

Context reverberation network

An early example of reservoir computing was the context reverberation network.[14] In this architecture, an input layer feeds into a high dimensional dynamical system which is read out by a trainable single-layer perceptron. Two kinds of dynamical system were described: a recurrent neural network with fixed random weights, and a continuous reaction–diffusion system inspired by Alan Turing’s model of morphogenesis. At the trainable layer, the perceptron associates current inputs with the signals that reverberate in the dynamical system; the latter were said to provide a dynamic "context" for the inputs. In the language of later work, the reaction–diffusion system served as the reservoir.

Echo state network

Liquid-state machine

Main page: Liquid-state machineChaotic Liquid State Machine

The liquid (i.e. reservoir) of a Chaotic Liquid State Machine (CLSM),[15][16] or chaotic reservoir, is made from chaotic spiking neurons but which stabilize their activity by settling to a single hypothesis that describes the trained inputs of the machine. This is in contrast to general types of reservoirs that don’t stabilize. The liquid stabilization occurs via synaptic plasticity and chaos control that govern neural connections inside the liquid. CLSM showed promising results in learning sensitive time series data.[15][16]

Nonlinear transient computation

This type of information processing is most relevant when time-dependent input signals depart from the mechanism’s internal dynamics.[17] These departures cause transients or temporary altercations which are represented in the device’s output.[17]

Deep reservoir computing

The extension of the reservoir computing framework towards Deep Learning, with the introduction of Deep Reservoir Computing and of the Deep Echo State Network (DeepESN) model[18][19][20][21] allows to develop efficiently trained models for hierarchical processing of temporal data, at the same time enabling the investigation on the inherent role of layered composition in recurrent neural networks.

Quantum reservoir computing

Quantum reservoir computing may use the nonlinear nature of quantum mechanical interactions or processes to form the characteristic nonlinear reservoirs[5][6][22][8] but may also be done with linear reservoirs when the injection of the input to the reservoir creates the nonlinearity.[23] The marriage of machine learning and quantum devices is leading to the emergence of quantum neuromorphic computing as a new research area.[24]

Types

Gaussian states of interacting quantum harmonic oscillators

Gaussian states are a paradigmatic class of states of continuous variable quantum systems.[25] Although they can nowadays be created and manipulated in, e.g, state-of-the-art optical platforms,[26] naturally robust to decoherence, it is well-known that they are not sufficient for, e.g., universal quantum computing because transformations that preserve the Gaussian nature of a state are linear.[27] Normally, linear dynamics would not be sufficient for nontrivial reservoir computing either. It is nevertheless possible to harness such dynamics for reservoir computing purposes by considering a network of interacting quantum harmonic oscillators and injecting the input by periodical state resets of a subset of the oscillators. With a suitable choice of how the states of this subset of oscillators depends on the input, the observables of the rest of the oscillators can become nonlinear functions of the input suitable for reservoir computing; indeed, thanks to the properties of these functions, even universal reservoir computing becomes possible by combining the observables with a polynomial readout function.[23] In principle, such reservoir computers could be implemented with controlled multimode optical parametric processes,[28] however efficient extraction of the output from the system is challenging especially in the quantum regime where measurement back-action must be taken into account.

2-D quantum dot lattices

In this architecture, randomized coupling between lattice sites grants the reservoir the “black box” property inherent to reservoir processors.[5] The reservoir is then excited, which acts as the input, by an incident optical field. Readout occurs in the form of occupational numbers of lattice sites, which are naturally nonlinear functions of the input.[5]

Nuclear spins in a molecular solid

In this architecture, quantum mechanical coupling between spins of neighboring atoms within the molecular solid provides the non-linearity required to create the higher-dimensional computational space.[6] The reservoir is then excited by radiofrequency electromagnetic radiation tuned to the resonance frequencies of relevant nuclear spins.[6] Readout occurs by measuring the nuclear spin states.[6]

Reservoir computing on gate-based near-term superconducting quantum computers

The most prevalent model of quantum computing is the gate-based model where quantum computation is performed by sequential applications of unitary quantum gates on qubits of a quantum computer.[29] A theory for the implementation of reservoir computing on a gate-based quantum computer with proof-of-principle demonstrations on a number of IBM superconducting noisy intermediate-scale quantum (NISQ) computers[30] has been reported in.[8]

See also

References

  1. 1.0 1.1 1.2 1.3 1.4 1.5 1.6 1.7 Tanaka, Gouhei; Yamane, Toshiyuki; Héroux, Jean Benoit; Nakane, Ryosho; Kanazawa, Naoki; Takeda, Seiji; Numata, Hidetoshi; Nakano, Daiju et al. (2019). "Recent advances in physical reservoir computing: A review". Neural Networks 115: 100–123. doi:10.1016/j.neunet.2019.03.005. ISSN 0893-6080. PMID 30981085. 
  2. Röhm, André; Lüdge, Kathy (2018-08-03). "Multiplexed networks: reservoir computing with virtual and real nodes". Journal of Physics Communications 2 (8): 085007. doi:10.1088/2399-6528/aad56d. ISSN 2399-6528. Bibcode2018JPhCo...2h5007R. 
  3. 3.0 3.1 3.2 3.3 3.4 Benjamin Schrauwen, David Verstraeten, and Jan Van Campenhout. "An overview of reservoir computing: theory, applications, and implementations." Proceedings of the European Symposium on Artificial Neural Networks ESANN 2007, pp. 471–482.
  4. Fernando, C.; Sojakka, Sampsa (2003). "Pattern Recognition in a Bucket". Advances in Artificial Life. Lecture Notes in Computer Science. 2801. pp. 588–597. doi:10.1007/978-3-540-39432-7_63. ISBN 978-3-540-20057-4. 
  5. 5.0 5.1 5.2 5.3 5.4 Ghosh, Sanjib; Opala, Andrzej; Matuszewski, Michał; Paterek, Tomasz; Liew, Timothy C. H. (December 2019). "Quantum reservoir processing". npj Quantum Information 5 (1): 35. doi:10.1038/s41534-019-0149-8. ISSN 2056-6387. Bibcode2019npjQI...5...35G. 
  6. 6.0 6.1 6.2 6.3 6.4 6.5 6.6 6.7 Negoro, Makoto; Mitarai, Kosuke; Fujii, Keisuke; Nakajima, Kohei; Kitagawa, Masahiro (2018-06-28). "Machine learning with controllable quantum dynamics of a nuclear spin ensemble in a solid". arXiv:1806.10910 [quant-ph].
  7. Rahimi, Ali; Recht, Benjamin (December 2008). "Weighted Sums of Random Kitchen Sinks: Replacing minimization with randomization in Learning". NIPS'08: Proceedings of the 21st International Conference on Neural Information Processing Systems: 1313–1320. http://papers.nips.cc/paper/3495-weighted-sums-of-random-kitchen-sinks-replacing-minimization-with-randomization-in-learning.pdf. 
  8. 8.0 8.1 8.2 Chen, Jiayin; Nurdin, Hendra; Yamamoto, Naoki (2020-08-24). "Temporal Information Processing on Noisy Quantum Computers". Physical Review Applied 14 (2): 024065. doi:10.1103/PhysRevApplied.14.024065. Bibcode2020PhRvP..14b4065C. https://doi.org/10.1103/PhysRevApplied.14.024065. 
  9. Pathak, Jaideep; Hunt, Brian; Girvan, Michelle; Lu, Zhixin; Ott, Edward (2018-01-12). "Model-Free Prediction of Large Spatiotemporally Chaotic Systems from Data: A Reservoir Computing Approach". Physical Review Letters 120 (2): 024102. doi:10.1103/PhysRevLett.120.024102. PMID 29376715. Bibcode2018PhRvL.120b4102P. 
  10. Vlachas, P.R.; Pathak, J.; Hunt, B.R.; Sapsis, T.P.; Girvan, M.; Ott, E.; Koumoutsakos, P. (2020-03-21). "Backpropagation algorithms and Reservoir Computing in Recurrent Neural Networks for the forecasting of complex spatiotemporal dynamics". Neural Networks 126: 191–217. doi:10.1016/j.neunet.2020.02.016. ISSN 0893-6080. PMID 32248008. http://dx.doi.org/10.1016/j.neunet.2020.02.016. 
  11. Krishnagopal, Sanjukta; Girvan, Michelle; Ott, Edward; Hunt, Brian R. (2020-02-01). "Separation of chaotic signals by reservoir computing". Chaos: An Interdisciplinary Journal of Nonlinear Science 30 (2): 023123. doi:10.1063/1.5132766. ISSN 1054-1500. PMID 32113243. Bibcode2020Chaos..30b3123K. https://aip.scitation.org/doi/10.1063/1.5132766. 
  12. Banerjee, Amitava; Hart, Joseph D.; Roy, Rajarshi; Ott, Edward (2021-07-20). "Machine Learning Link Inference of Noisy Delay-Coupled Networks with Optoelectronic Experimental Tests". Physical Review X 11 (3): 031014. doi:10.1103/PhysRevX.11.031014. Bibcode2021PhRvX..11c1014B. 
  13. 13.0 13.1 13.2 13.3 Soriano, Miguel C. (2017-02-06). "Viewpoint: Reservoir Computing Speeds Up" (in en). Physics 10: 12. doi:10.1103/Physics.10.12. https://physics.aps.org/articles/v10/12. 
  14. Kevin Kirby. "Context dynamics in neural sequential learning." Proceedings of the Florida Artificial Intelligence Research Symposium FLAIRS (1991), 66–70.
  15. 15.0 15.1 Aoun, Mario Antoine; Boukadoum, Mounir (2014). "Learning algorithm and neurocomputing architecture for NDS Neurons". 2014 IEEE 13th International Conference on Cognitive Informatics and Cognitive Computing. IEEE. pp. 126–132. doi:10.1109/icci-cc.2014.6921451. ISBN 978-1-4799-6081-1. http://dx.doi.org/10.1109/icci-cc.2014.6921451. 
  16. 16.0 16.1 Aoun, Mario Antoine; Boukadoum, Mounir (2015). "Chaotic Liquid State Machine". International Journal of Cognitive Informatics and Natural Intelligence 9 (4): 1–20. doi:10.4018/ijcini.2015100101. ISSN 1557-3958. http://dx.doi.org/10.4018/ijcini.2015100101. 
  17. 17.0 17.1 Crook, Nigel (2007). "Nonlinear Transient Computation". Neurocomputing 70 (7–9): 1167–1176. doi:10.1016/j.neucom.2006.10.148. 
  18. Pedrelli, Luca (2019). Deep Reservoir Computing: A Novel Class of Deep Recurrent Neural Networks (PhD thesis). Università di Pisa.
  19. Gallicchio, Claudio; Micheli, Alessio; Pedrelli, Luca (2017-12-13). "Deep reservoir computing: A critical experimental analysis". Neurocomputing 268: 87–99. doi:10.1016/j.neucom.2016.12.089. 
  20. Gallicchio, Claudio; Micheli, Alessio (2017-05-05). "Echo State Property of Deep Reservoir Computing Networks". Cognitive Computation 9 (3): 337–350. doi:10.1007/s12559-017-9461-9. ISSN 1866-9956. 
  21. Gallicchio, Claudio; Micheli, Alessio; Pedrelli, Luca (December 2018). "Design of deep echo state networks". Neural Networks 108: 33–47. doi:10.1016/j.neunet.2018.08.002. ISSN 0893-6080. PMID 30138751. 
  22. Chen, Jiayin; Nurdin, Hendra (2019-05-15). "Learning nonlinear input–output maps with dissipative quantum systems". Quantum Information Processing 18 (7): 198. doi:10.1007/s11128-019-2311-9. Bibcode2019QuIP...18..198C. https://link.springer.com/article/10.1007%2Fs11128-019-2311-9. 
  23. 23.0 23.1 Nokkala, Johannes; Martínez-Peña, Rodrigo; Giorgi, Gian Luca; Parigi, Valentina; Soriano, Miguel C.; Zambrini, Roberta (2021). "Gaussian states of continuous-variable quantum systems provide universal and versatile reservoir computing". Communications Physics 4 (1): 53. doi:10.1038/s42005-021-00556-w. Bibcode2021CmPhy...4...53N. 
  24. Marković, Danijela; Grollier, Julie (2020-10-13). "Quantum Neuromorphic Computing". Applied Physics Letters 117 (15): 150501. doi:10.1063/5.0020014. Bibcode2020ApPhL.117o0501M. https://doi.org/10.1063/5.0020014. 
  25. Ferraro, Alessandro; Olivares, Stefano; Paris, Matteo G. A. (2005-03-31). "Gaussian states in continuous variable quantum information". arXiv:quant-ph/0503237.
  26. Roslund, Jonathan; de Araújo, Renné Medeiros; Jiang, Shifeng; Fabre, Claude; Treps, Nicolas (2013-12-15). "Wavelength-multiplexed quantum networks with ultrafast frequency combs" (in en). Nature Photonics 8 (2): 109–112. doi:10.1038/nphoton.2013.340. ISSN 1749-4893. https://www.nature.com/articles/nphoton.2013.340. 
  27. Bartlett, Stephen D.; Sanders, Barry C.; Braunstein, Samuel L.; Nemoto, Kae (2002-02-14). "Efficient Classical Simulation of Continuous Variable Quantum Information Processes". Physical Review Letters 88 (9): 097904. doi:10.1103/PhysRevLett.88.097904. PMID 11864057. Bibcode2002PhRvL..88i7904B. https://link.aps.org/doi/10.1103/PhysRevLett.88.097904. 
  28. Nokkala, J.; Arzani, F.; Galve, F.; Zambrini, R.; Maniscalco, S.; Piilo, J.; Treps, N.; Parigi, V. (2018-05-09). "Reconfigurable optical implementation of quantum complex networks" (in en). New Journal of Physics 20 (5): 053024. doi:10.1088/1367-2630/aabc77. ISSN 1367-2630. Bibcode2018NJPh...20e3024N. https://doi.org/10.1088%2F1367-2630%2Faabc77. 
  29. Nielsen, Michael; Chuang, Isaac (2010), Quantum Computation and Quantum Information (2 ed.), Cambridge University Press Cambridge 
  30. John Preskill. "Quantum Computing in the NISQ era and beyond." Quantum 2,79 (2018)

Further reading