Prefrontal cortex basal ganglia working memory
Prefrontal cortex basal ganglia working memory (PBWM) is an algorithm that models working memory in the prefrontal cortex and the basal ganglia.[1] It can be compared to long short-term memory (LSTM) in functionality, but is more biologically explainable.[1][2]
It uses the primary value learned value model to train prefrontal cortex working-memory updating system, based on the biology of the prefrontal cortex and basal ganglia.[3]
It is used as part of the Leabra framework and was implemented in Emergent in 2019.
Abstract
The prefrontal cortex has long been thought to subserve both working memory (the holding of information online for processing) and "executive" functions (deciding how to manipulate working memory and perform processing). Although many computational models of working memory have been developed, the mechanistic basis of executive function remains elusive.
PBWM is a computational model of the prefrontal cortex to control both itself and other brain areas in a strategic, task-appropriate manner. These learning mechanisms are based on subcortical structures in the midbrain, basal ganglia and amygdala, which together form an actor/critic architecture. The critic system learns which prefrontal representations are task-relevant and trains the actor, which in turn provides a dynamic gating mechanism for controlling working memory updating. Computationally, the learning mechanism is designed to simultaneously solve the temporal and structural credit assignment problems.
The model's performance compares favorably with standard backpropagation-based temporal learning mechanisms on the challenging 1-2-AX working memory task, and other benchmark working memory tasks.[1][third-party source needed]
Model
First, there are multiple separate stripes (groups of units) in the prefrontal cortex and striatum layers. Each stripe can be independently updated, such that this system can remember several different things at the same time, each with a different "updating policy" of when memories are updated and maintained. The active maintenance of the memory is in prefrontal cortex (PFC), and the updating signals (and updating policy more generally) come from the striatum units (a subset of basal ganglia units).[3]
PVLV provides reinforcement learning signals to train up the dynamic gating system in the basal ganglia.
Sensory input and motor output
The sensory input is connected to the posterior cortex which is connected to the motor output. The sensory input is also linked to the PVLV system.
Posterior cortex
The posterior cortex form the hidden layers of the input/output mapping. The PFC is connected with the posterior cortex to contextualize this input/output mapping.
PFC
The PFC (for output gating) has a localist one-to-one representation of the input units for every stripe. Thus, you can look at these PFC representations and see directly what the network is maintaining. The PFC maintains the working memory needed to perform the task.
Striatum
This is the dynamic gating system representing the striatum units of the basal ganglia. Every even-index unit within a stripe represents "Go", while the odd-index units represent "NoGo." The Go units cause updating of the PFC, while the NoGo units cause the PFC to maintain its existing memory representation.
There are groups of units for every stripe.
In the PBWM model in Emergent, the matrices represent the striatum.
PVLV
All of these layers are part of PVLV system. The PVLV system controls the dopaminergic modulation of the basal ganglia (BG). Thus, BG/PVLV form an actor-critic architecture where the PVLV system learns when to update.[citation needed]
SNrThal
SNrThal represents the substantia nigra pars reticulata (SNr) and the associated area of the thalamus, which produce a competition among the Go/NoGo units within a given stripe and mediates competition using k-winners-take-all dynamics. If there is more overall Go activity in a given stripe, then the associated SNrThal unit gets activated, and it drives updating in PFC. For every stripe, there is one unit in SNrThal.[citation needed]
VTA and SNc
Ventral tegmental area (VTA) and substantia nigra pars compacta (SNc) are part of the dopamine layer. This layer models midbrain dopamine neurons. They control the dopaminergic modulation of the basal ganglia.[citation needed]
See also
- State–action–reward–state–action
- Sammon Mapping
- Constructing skill trees
References
- ↑ 1.0 1.1 1.2 O'Reilly, R.C; Frank, M.J. (2006). "Making Working Memory Work: A Computational Model of Learning in the Frontal Cortex and Basal Ganglia". Neural Computation 18 (2): 283–328. doi:10.1162/089976606775093909. PMID 16378516. http://psych.colorado.edu/~oreilly/pubs-abstr.html#OReillyFrank06.
- ↑ Jeevanandam, Nivash (2021-09-13). "Underrated But Fascinating ML Concepts #5 – CST, PBWM, SARSA, & Sammon Mapping" (in en). https://analyticsindiamag.com/underrated-but-fascinating-ml-concepts-5-cst-pbwm-sarsa-sammon-mapping/.
- ↑ 3.0 3.1 "Leabra PBWM". CCNLab. http://grey.colorado.edu/emergent/index.php/Leabra_PBWM.
Original source: https://en.wikipedia.org/wiki/Prefrontal cortex basal ganglia working memory.
Read more |