Attractor network

From HandWiki

An attractor network is a type of recurrent dynamical network, that evolves toward a stable pattern over time. Nodes in the attractor network converge toward a pattern that may either be fixed-point (a single state), cyclic (with regularly recurring states), chaotic (locally but not globally unstable) or random (stochastic).[1] Attractor networks have largely been used in computational neuroscience to model neuronal processes such as associative memory[2] and motor behavior, as well as in biologically inspired methods of machine learning. An attractor network contains a set of n nodes, which can be represented as vectors in a d-dimensional space where n>d. Over time, the network state tends toward one of a set of predefined states on a d-manifold; these are the attractors.

Overview

In attractor networks, an attractor (or attracting set) is a closed subset of states A toward which the system of nodes evolves. A stationary attractor is a state or sets of states where the global dynamics of the network stabilize. Cyclic attractors evolve the network toward a set of states in a limit cycle, which is repeatedly traversed. Chaotic attractors are non-repeating bounded attractors that are continuously traversed.

The network state space is the set of all possible node states. The attractor space is the set of nodes on the attractor. Attractor networks are initialized based on the input pattern. The dimensionality of the input pattern may differ from the dimensionality of the network nodes. The trajectory of the network consists of the set of states along the evolution path as the network converges toward the attractor state. The basin of attraction is the set of states that results in movement towards a certain attractor.[1]

Types

Various types of attractors may be used to model different types of network dynamics. While fixed-point attractor networks are the most common (originating from Hopfield networks[3]), other types of networks are also examined.

Fixed point attractors

The fixed point attractor naturally follows from the Hopfield network. Conventionally, fixed points in this model represent encoded memories. These models have been used to explain associative memory, classification, and pattern completion. Hopfield nets contain an underlying energy function[4] that allow the network to asymptotically approach a stationary state. One class of point attractor network is initialized with an input, after which the input is removed and the network moves toward a stable state. Another class of attractor network features predefined weights that are probed by different types of input. If this stable state is different during and after the input, it serves as a model of associative memory. However, if the states during and after input do not differ, the network can be used for pattern completion.

Other stationary attractors

Line attractors and plane attractors are used in the study of oculomotor control. These line attractors, or neural integrators, describe eye position in response to stimuli. Ring attractors have been used to model rodent head direction.

Cyclic attractors

Cyclic attractors are instrumental in modelling central pattern generators, neurons that govern oscillatory activity in animals such as chewing, walking, and breathing.

Chaotic attractors

Chaotic attractors (also called strange attractors) have been hypothesized to reflect patterns in odor recognition. While chaotic attractors have the benefit of more quickly converging upon limit cycles, there is yet no experimental evidence to support this theory.[5]

Continuous attractors

Neighboring stable states (fix points) of continuous attractors (also called continuous attractor neural networks) code for neighboring values of a continuous variable such as head direction or actual position in space.

Ring attractors

A subtype of continuous attractors with a particular topology of the neurons (ring for 1-dimensional and torus or twisted torus for 2-dimensional networks). The observed activity of grid cells is successfully explained by assuming the presence of ring attractors in the medial entorhinal cortex. [6] Recently, it has been proposed that similar ring attractors are present in the lateral portion of the entorhinal cortex and their role extends to registering new episodic memories. [7]

Implementations

Attractor networks have mainly been implemented as memory models using fixed-point attractors. However, they have been largely impractical for computational purposes because of difficulties in designing the attractor landscape and network wiring, resulting in spurious attractors and poorly conditioned basins of attraction. Furthermore, training on attractor networks is generally computationally expensive, compared to other methods such as k-nearest neighbor classifiers.[8] However, their role in general understanding of different biological functions, such as, locomotor function, memory, decision-making, to name a few, makes them more attractive as biologically realistic models.

Hopfield networks

Hopfield attractor networks are an early implementation of attractor networks with associative memory. These recurrent networks are initialized by the input, and tend toward a fixed-point attractor. The update function in discrete time is [math]\displaystyle{ x(t+1)=f(Wx(t)) }[/math], where [math]\displaystyle{ x }[/math] is a vector of nodes in the network and [math]\displaystyle{ W }[/math] is a symmetric matrix describing their connectivity. The continuous time update is [math]\displaystyle{ \frac{dx}{dt}=-\lambda x+f(Wx) }[/math].

Bidirectional networks are similar to Hopfield networks, with the special case that the matrix [math]\displaystyle{ W }[/math] is a block matrix.[4]

Localist attractor networks

Zemel and Mozer (2001)[8] proposed a method to reduce the number of spurious attractors that arise from the encoding of multiple attractors by each connection in the network. Localist attractor networks encode knowledge locally by implementing an expectation-maximization algorithm on a mixture-of-gaussians representing the attractors, to minimize the free energy in the network and converge only the most relevant attractor. This results in the following update equations:

  1. Determine the activity of attractors: [math]\displaystyle{ q_i(t)=\frac{\pi_i g(y(t),w_i,\sigma(t))}{\sum_j\pi_jg(y(t),w_j,\sigma(t))} }[/math]
  2. Determine the next state of the network: [math]\displaystyle{ y(t+1)=\alpha(t)\xi+(1-\alpha(t))\sum_iq_i(t)w_i\,\! }[/math]
  3. Determine the attractor width through network: [math]\displaystyle{ \sigma^2_y(t)=\frac{1}{n}\sum_iq_i(t)|y(t)-w_i|^2 }[/math]

([math]\displaystyle{ \pi_i }[/math] denotes basin strength, [math]\displaystyle{ w_i }[/math] denotes the center of the basin. [math]\displaystyle{ \xi }[/math] denotes input to the net, [math]\displaystyle{ g }[/math] is a un-normalized gaussian distribution centered in [math]\displaystyle{ y }[/math] and of standard deviation equals to [math]\displaystyle{ \sigma }[/math].)

The network is then re-observed, and the above steps repeat until convergence. The model also reflects two biologically relevant concepts. The change in [math]\displaystyle{ \alpha }[/math] models stimulus priming by allowing quicker convergence toward a recently visited attractor. Furthermore, the summed activity of attractors allows a gang effect that causes two nearby attractors to mutually reinforce the other's basin.

Reconsolidation attractor networks

Siegelmann (2008)[9] generalized the localist attractor network model to include the tuning of attractors themselves. This algorithm uses the EM method above, with the following modifications: (1) early termination of the algorithm when the attractor's activity is most distributed, or when high entropy suggests a need for additional memories, and (2) the ability to update the attractors themselves: [math]\displaystyle{ w_i(t+1)=vq_i(t)\cdot y(t)+[1-vq_i(t)]\cdot w_i(t)\,\! }[/math], where [math]\displaystyle{ v }[/math] is the step size parameter of the change of [math]\displaystyle{ w_i }[/math]. This model reflects memory reconsolidation in animals, and shows some of the same dynamics as those found in memory experiments.

Further developments in attractor networks, such as kernel based attractor networks,[10] have improved the computational feasibility of attractor networks as a learning algorithm, while maintaining the high-level flexibility to perform pattern completion on complex compositional structures.

References

  1. 1.0 1.1 *Amit, D. J. (1989). Modeling brain function: The world of attractor neural networks. New York, NY: Cambridge University Press. 
  2. *Poucet, B.; Save, E. (2005). "Attractors in Memory". Science 308 (5723): 799–800. doi:10.1126/science.1112555. PMID 15879197. 
  3. *Hopfield, J. J. (1982). "Neural networks and physical systems with emergent collective computational abilities". Proceedings of the National Academy of Sciences 79 (8): 2554–2558. doi:10.1073/pnas.79.8.2554. PMID 6953413. Bibcode1982PNAS...79.2554H. 
  4. 4.0 4.1 *John Hopfield, ed. "Hopfield network". http://www.scholarpedia.org/article/Hopfield_network. 
  5. *Chris Eliasmith, ed. "Attractor network". http://www.scholarpedia.org/article/Attractor_network. 
  6. "Path integration and the neural basis of the "cognitive map".". Nat. Rev. Neurosci. 7 (8): 663–678. August 2006. doi:10.1038/nrn1932. PMID 16858394. https://www.nature.com/articles/nrn1932. 
  7. "Episodic Memories: How do the Hippocampus and the Entorhinal Ring Attractors Cooperate to Create Them?". Frontiers in Systems Neuroscience 14: 68. September 2020. doi:10.3389/fnsys.2020.559186. PMID 33013334. 
  8. 8.0 8.1 *Zemel, R.; Mozer, M. (2001). "Localist attractor networks". Neural Computation 13 (5): 1045–1064. doi:10.1162/08997660151134325. PMID 11359644. 
  9. *Siegelmann, H. T. (2008). "Analog-symbolic memory that tracks via reconsolidation". Physica D 237 (9): 1207–1214. doi:10.1016/j.physd.2008.03.038. Bibcode2008PhyD..237.1207S. 
  10. *Nowicki, D.; Siegelmann, H.T. (2010). "Flexible Kernel Memory". PLOS ONE 5 (6): e10955. doi:10.1371/journal.pone.0010955. PMID 20552013. Bibcode2010PLoSO...510955N.