Citation Relationships



Baxter J, Weaver L, Bartlett PL (1999) Direct gradient-based reinforcement learning: II. Gradient ascent algorithms and experiments Tech Rep Australian National University, Research School of Information Sciences and Engineering

References and models cited by this paper

References and models that cite this paper

Florian RV (2007) Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural Comput 19:1468-502 [Journal] [PubMed]

Legenstein R, Pecevski D, Maass W (2008) A learning theory for reward-modulated spike-timing-dependent plasticity with application to biofeedback. PLoS Comput Biol 4:e1000180 [Journal] [PubMed]

   Reward modulated STDP (Legenstein et al. 2008) [Model]

(2 refs)