Citation Relationships



Bartlett P, Baxter J (2000) Stochastic optimization of controlled partially observable Markov decision processes Proc 39th IEEE Conf Decision and Control

References and models cited by this paper

References and models that cite this paper

Florian RV (2007) Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural Comput 19:1468-502 [Journal] [PubMed]

(1 refs)