Citation Relationships

Legends: Link to a Model Reference cited by multiple papers


Baxter J, Bartlett PL (2001) Infinite-horizon policy-gradient estimation J Artif Intell Res 15:319-350

References and models cited by this paper

References and models that cite this paper

Baras D, Meir R (2007) Reinforcement learning, spike-time-dependent plasticity, and the BCM rule. Neural Comput 19:2245-79 [Journal] [PubMed]
Fiete IR, Fee MS, Seung HS (2007) Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances. J Neurophysiol 98:2038-57 [Journal] [PubMed]
Florian RV (2007) Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural Comput 19:1468-502 [Journal] [PubMed]
Roelfsema PR, van Ooyen A (2005) Attention-gated reinforcement learning of internal representations for classification. Neural Comput 17:2176-214 [Journal] [PubMed]
(4 refs)