Citation Relationships

Legends: Link to a Model Reference cited by multiple papers


Williams RJ (1992) Simple statistical gradient-following algorithms for connectionist reinforcement learning Mach Learn 8:229-256

References and models cited by this paper

References and models that cite this paper

Fiete IR, Fee MS, Seung HS (2007) Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances. J Neurophysiol 98:2038-57 [Journal] [PubMed]
Florian RV (2007) Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural Comput 19:1468-502 [Journal] [PubMed]
Fujita H, Ishii S (2007) Model-based reinforcement learning for partially observable games with sampling-based state estimation. Neural Comput 19:3051-87 [Journal] [PubMed]
Richmond P, Buesing L, Giugliano M, Vasilaki E (2011) Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations. PLoS One 6:e18539 [Journal] [PubMed]
   Democratic population decisions result in robust policy-gradient learning (Richmond et al. 2011) [Model]
Roelfsema PR, van Ooyen A (2005) Attention-gated reinforcement learning of internal representations for classification. Neural Comput 17:2176-214 [Journal] [PubMed]
Swinehart CD, Abbott LF (2005) Supervised learning through neuronal response modulation. Neural Comput 17:609-31 [Journal] [PubMed]
Werfel J, Xie X, Seung HS (2005) Learning curves for stochastic gradient descent in linear feedforward networks. Neural Comput 17:2699-718 [Journal] [PubMed]
Whittington JCR, Bogacz R (2017) An Approximation of the Error Backpropagation Algorithm in a Predictive Coding Network with Local Hebbian Synaptic Plasticity. Neural Comput 29:1229-1262 [Journal] [PubMed]
   Supervised learning with predictive coding (Whittington & Bogacz 2017) [Model]
(8 refs)