Citation Relationships



Di_castro d, Volkinshtein S, Meir R (2009) Temporal difference based actor critic learning- convergence and neural implementation NIPS 22:385-392

References and models cited by this paper

References and models that cite this paper

Richmond P, Buesing L, Giugliano M, Vasilaki E (2011) Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations. PLoS One 6:e18539 [Journal] [PubMed]

   Democratic population decisions result in robust policy-gradient learning (Richmond et al. 2011) [Model]

(1 refs)