Citation Relationships



Bartlett PL, Baxter J (2000) Estimation and approximation bounds for gradient based reinforcement learning Proc 13th Ann Conf Comput Learn Theory :133-141

References and models cited by this paper

References and models that cite this paper

Florian RV (2007) Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural Comput 19:1468-502 [Journal] [PubMed]

(1 refs)