Citation Relationships

Legends: Link to a Model Reference cited by multiple papers


Baras D, Meir R (2007) Reinforcement learning, spike-time-dependent plasticity, and the BCM rule. Neural Comput 19:2245-79 [PubMed]

References and models cited by this paper

References and models that cite this paper

Baras D (2006) Direct policy search in reinforcement learning and synaptic plasticity in biological neural networks Unpublished masters thesis Technion. Available onlineat http:--www.ee.technion.ac.il-rmeir-BarasThesis06.pdf
Bartlett P, Baxter J (1999) Hebbian synaptic modifications in spiking neurons that learn Tech Rep
Baxter J, Bartlett PL (2001) Infinite-horizon policy-gradient estimation J Artif Intell Res 15:319-350
Bertsekas D, Tsitsiklis J (1996) Neuro-dynamic programming
Bienenstock EL, Cooper LN, Munro PW (1982) Theory for the development of neuron selectivity: orientation specificity and binocular interaction in visual cortex. J Neurosci 2:32-48 [PubMed]
Gerstner W, Kistler WM (2002) Spiking neuron models
Haykin S (1999) Neural Networks: A Comprehensive Foundation (2nd Ed)
Izhikevich EM, Desai NS (2003) Relating STDP to BCM. Neural Comput 15:1511-23 [Journal] [PubMed]
Koch C (1999) Biophysics Of Computation: Information Processing in Single Neurons
Konda VR, Tsitsiklis JN (2003) On actor-critic algorithms SIAM J Control Optim 42:1143-1166
Rao RP, Sejnowski TJ (2001) Spike-timing-dependent Hebbian plasticity as temporal difference learning. Neural Comput 13:2221-37 [Journal] [PubMed]
Richardson MJ, Melamed O, Silberberg G, Gerstner W, Markram H (2005) Short-term synaptic plasticity orchestrates the response of pyramidal cells and interneurons to population bursts. J Comput Neurosci 18:323-31 [Journal] [PubMed]
Schultz W (2002) Getting formal with dopamine and reward. Neuron 36:241-63 [PubMed]
Seung HS (2003) Learning in spiking neural networks by reinforcement of stochastic synaptic transmission. Neuron 40:1063-73 [PubMed]
Toyoizumi T, Pfister JP, Aihara K, Gerstner W (2005) Generalized Bienenstock-Cooper-Munro rule for spiking neurons that maximizes information transmission. Proc Natl Acad Sci U S A 102:5239-44 [Journal] [PubMed]
Wörgötter F, Porr B (2005) Temporal sequence learning, prediction, and control: a review of different models and their relation to biological mechanisms. Neural Comput 17:245-319 [Journal] [PubMed]
Xie X, Seung HS (2004) Learning in neural networks by reinforcement of irregular spiking. Phys Rev E Stat Nonlin Soft Matter Phys 69:041909 [Journal] [PubMed]
Legenstein R, Pecevski D, Maass W (2008) A learning theory for reward-modulated spike-timing-dependent plasticity with application to biofeedback. PLoS Comput Biol 4:e1000180 [Journal] [PubMed]
   Reward modulated STDP (Legenstein et al. 2008) [Model]
Richmond P, Buesing L, Giugliano M, Vasilaki E (2011) Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations. PLoS One 6:e18539 [Journal] [PubMed]
   Democratic population decisions result in robust policy-gradient learning (Richmond et al. 2011) [Model]
(21 refs)