Citation Relationships



Seung HS (2003) Learning in spiking neural networks by reinforcement of stochastic synaptic transmission. Neuron 40:1063-73 [PubMed]

References and models cited by this paper

References and models that cite this paper

Baras D, Meir R (2007) Reinforcement learning, spike-time-dependent plasticity, and the BCM rule. Neural Comput 19:2245-79 [Journal] [PubMed]

Chadderdon GL, Neymotin SA, Kerr CC, Lytton WW (2012) Reinforcement learning of targeted movement in a spiking neuronal model of motor cortex. PLoS One 7:e47251 [Journal] [PubMed]

   Reinforcement learning of targeted movement (Chadderdon et al. 2012) [Model]

Costa RP, Froemke RC, Sjöström PJ, van Rossum MC (2015) Unified pre- and postsynaptic long-term plasticity enables reliable and flexible learning. Elife [Journal] [PubMed]

   Memory savings through unified pre- and postsynaptic STDP (Costa et al 2015) [Model]

Costa RP, Padamsey Z, D'Amour JA, Emptage NJ, Froemke RC, Vogels TP (2017) Synaptic Transmission Optimization Predicts Expression Loci of Long-Term Plasticity. Neuron 96:177-189.e7 [Journal] [PubMed]

   Statistical Long-term Synaptic Plasticity (statLTSP) (Costa et al 2017) [Model]

Fiete IR, Fee MS, Seung HS (2007) Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances. J Neurophysiol 98:2038-57 [Journal] [PubMed]

Florian RV (2007) Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural Comput 19:1468-502 [Journal] [PubMed]

Izhikevich EM (2007) Solving the distal reward problem through linkage of STDP and dopamine signaling. Cereb Cortex 17:2443-52 [Journal] [PubMed]

   Linking STDP and Dopamine action to solve the distal reward problem (Izhikevich 2007) [Model]

Nemenman I (2005) Fluctuation-dissipation theorem and models of learning. Neural Comput 17:2006-33 [Journal] [PubMed]

Neymotin SA, Chadderdon GL, Kerr CC, Francis JT, Lytton WW (2013) Reinforcement learning of two-joint virtual arm reaching in a computer model of sensorimotor cortex. Neural Comput 25:3263-93 [Journal] [PubMed]

   Sensorimotor cortex reinforcement learning of 2-joint virtual arm reaching (Neymotin et al. 2013) [Model]

Richmond P, Buesing L, Giugliano M, Vasilaki E (2011) Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations. PLoS One 6:e18539 [Journal] [PubMed]

   Democratic population decisions result in robust policy-gradient learning (Richmond et al. 2011) [Model]

Roelfsema PR, van Ooyen A (2005) Attention-gated reinforcement learning of internal representations for classification. Neural Comput 17:2176-214 [Journal] [PubMed]

Sakai Y, Fukai T (2008) The actor-critic learning is behind the matching law: matching versus optimal behaviors. Neural Comput 20:227-51 [Journal] [PubMed]

Soltani A, Wang XJ (2006) A biophysically based neural model of matching law behavior: melioration by stochastic synapses. J Neurosci 26:3731-44 [Journal] [PubMed]

Swinehart CD, Abbott LF (2005) Supervised learning through neuronal response modulation. Neural Comput 17:609-31 [Journal] [PubMed]

Toyoizumi T, Pfister JP, Aihara K, Gerstner W (2007) Optimality model of unsupervised spike-timing-dependent plasticity: synaptic memory and weight distribution. Neural Comput 19:639-71 [Journal] [PubMed]

Whittington JCR, Bogacz R (2017) An Approximation of the Error Backpropagation Algorithm in a Predictive Coding Network with Local Hebbian Synaptic Plasticity. Neural Comput 29:1229-1262 [Journal] [PubMed]

   Supervised learning with predictive coding (Whittington & Bogacz 2017) [Model]

(16 refs)