PVLV
From HandWiki
The primary value learned value (PVLV) model is a possible explanation for the reward-predictive firing properties of dopamine (DA) neurons.[1] It simulates behavioral and neural data on Pavlovian conditioning and the midbrain dopaminergic neurons that fire in proportion to unexpected rewards. It is an alternative to the temporal-differences (TD) algorithm.[2] It is used as part of Leabra.
References
- ↑ O'Reilly, R.C.; Frank, M.J.; Hazy, T.E.; Watz, B. (2007). "PVLV: The Primary Value and Learned Value Pavlovian Learning Algorithm". Behavioral Neuroscience 121 (1): 31–4. doi:10.1037/0735-7044.121.1.31. PMID 17324049. http://psych.colorado.edu/~oreilly/pubs-abstr.html#OReillyFrankHazyEtAl07.
- ↑ "Leabra PBWM". CCNLab. http://grey.colorado.edu/emergent/index.php/Leabra_PBWM.
Original source: https://en.wikipedia.org/wiki/PVLV.
Read more |