Find models by
Find models for
Find models of
Link to a Model
Reference cited by multiple papers
Watkins CJCH (1989) Learning from delayed rewards
Unpublished doctoral dissertation
References and models cited by this paper
References and models that cite this paper
Kato A, Morita K (2016)
Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation.
PLoS Comput Biol
Reinforcement Learning with Forgetting: Linking Sustained Dopamine to Motivation (Kato Morita 2016) [Model]
Morita K, Kato A (2014)
Striatal dopamine ramping may indicate flexible reinforcement learning with forgetting in the cortico-basal ganglia circuits.
Front Neural Circuits
Striatal dopamine ramping: an explanation by reinforcement learning with decay (Morita & Kato, 2014) [Model]
Porr B, Wörgötter F (2006)
Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only.
Richmond P, Buesing L, Giugliano M, Vasilaki E (2011)
Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations.
Democratic population decisions result in robust policy-gradient learning (Richmond et al. 2011) [Model]
Wörgötter F, Porr B (2005)
Temporal sequence learning, prediction, and control: a review of different models and their relation to biological mechanisms.