Citation Relationships

Legends: Link to a Model Reference cited by multiple papers


Watkins CJCH (1989) Learning from delayed rewards Unpublished doctoral dissertation

References and models cited by this paper

References and models that cite this paper

Kato A, Morita K (2016) Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation. PLoS Comput Biol 12:e1005145 [Journal] [PubMed]
   Reinforcement Learning with Forgetting: Linking Sustained Dopamine to Motivation (Kato Morita 2016) [Model]
Morita K, Kato A (2014) Striatal dopamine ramping may indicate flexible reinforcement learning with forgetting in the cortico-basal ganglia circuits. Front Neural Circuits 8:36 [Journal] [PubMed]
   Striatal dopamine ramping: an explanation by reinforcement learning with decay (Morita & Kato, 2014) [Model]
Porr B, Wörgötter F (2006) Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only. Neural Comput 18:1380-412 [Journal] [PubMed]
Richmond P, Buesing L, Giugliano M, Vasilaki E (2011) Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations. PLoS One 6:e18539 [Journal] [PubMed]
   Democratic population decisions result in robust policy-gradient learning (Richmond et al. 2011) [Model]
Wörgötter F, Porr B (2005) Temporal sequence learning, prediction, and control: a review of different models and their relation to biological mechanisms. Neural Comput 17:245-319 [Journal] [PubMed]
(5 refs)