Citation Relationships

Florian RV (2007) Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural Comput 19:1468-502 [PubMed]

References and models cited by this paper

References and models that cite this paper

Abbott LF, Gerstner W (2005) Homeostasis and learning through spike-timing dependent plasticity Methods and models in neurophysics: Proceedings of the Les Houches SummerSchool 2003, Gutkin B:Hansel D:Meunier C:Dalibard J:Chow C, ed.

Abbott LF, Nelson SB (2000) Synaptic plasticity: taming the beast. Nat Neurosci 3 Suppl:1178-83 [Journal] [PubMed]

Aizenman CD, Linden DJ (2000) Rapid, synaptically driven increases in the intrinsic excitability of cerebellar deep nuclear neurons. Nat Neurosci 3:109-11 [Journal] [PubMed]

Alstrom P, Stassinopoulos D (1995) Versatility and adaptive performance. Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics 51:5027-5032 [PubMed]

Bartlett P, Baxter J (1999) Hebbian synaptic modifications in spiking neurons that learn Tech Rep

Bartlett P, Baxter J (2000) Stochastic optimization of controlled partially observable Markov decision processes Proc 39th IEEE Conf Decision and Control

Bartlett PL, Baxter J (1999) Direct gradient-based reinforcement learning: I. Gradient estimation algorithms Tech Rep Australian National University, Research School of Information Sciences and Engineering

Bartlett PL, Baxter J (2000) A biologically plausible and locally optimal learning algorithm for spiking neurons Available online at

Bartlett PL, Baxter J (2000) Estimation and approximation bounds for gradient based reinforcement learning Proc 13th Ann Conf Comput Learn Theory :133-141

Barto AG (1985) Learning by statistical cooperation of self-interested neuron-like computing elements. Hum Neurobiol 4:229-56 [PubMed]

Barto AG, Anandan P (1985) Pattern-recognizing stochastic learning automata IEEE Trans Syst Man Cybernet 15:360-375

Barto AG, Anderson CW (1985) Structural learning in connectionist systems Proc 7th Ann Conf Cogn Sci Soc

Barto AG, Jordan MI (1987) Gradient following without back-propagation in layered networks Proc 1st IEEE Ann Conf Neural Networks, Caudill M:Butler C, ed. pp.629

Baxter J, Bartlett PL (2001) Infinite-horizon policy-gradient estimation J Artif Intell Res 15:319-350

Baxter J, Bartlett PL, Weaver L (2001) Experiments with infinite-horizon, policy-gradient estimation J Artif Intel Res 15:351-381

Baxter J, Weaver L, Bartlett PL (1999) Direct gradient-based reinforcement learning: II. Gradient ascent algorithms and experiments Tech Rep Australian National University, Research School of Information Sciences and Engineering

Bell AJ, Parra LC (2005) Maximising sensitivity in a spiking network Advances in neural information processing systems, Saul LK:Weiss Y:Bottou L, ed. pp.121

Bell CC, Han VZ, Sugawara Y, Grant K (1997) Synaptic plasticity in a cerebellum-like structure depends on temporal order. Nature 387:278-81 [Journal] [PubMed]

Bi GQ, Poo MM (1998) Synaptic modifications in cultured hippocampal neurons: dependence on spike timing, synaptic strength, and postsynaptic cell type. J Neurosci 18:10464-72 [PubMed]

Bohte SM (2004) The evidence for neural information processing with precise spike-times: A survey Natural Comput 3:195-206

Bohte SM, Mozer MC (2005) Reducing spike train variability: A computational theory of spike-timing dependent plasticity Advances in neural information processing systems, Saul LK:Weiss Y:Bottou L, ed. pp.201

Chechik G (2003) Spike-timing-dependent plasticity and relevant mutual information maximization. Neural Comput 15:1481-510 [Journal] [PubMed]

Cudmore RH, Turrigiano GG (2004) Long-term potentiation of intrinsic excitability in LV visual cortical neurons. J Neurophysiol 92:341-8 [Journal] [PubMed]

Dan Y, Poo MM (1992) Hebbian depression of isolated neuromuscular synapses in vitro. Science 256:1570-3 [PubMed]

Dan Y, Poo MM (2004) Spike timing-dependent plasticity of neural circuits. Neuron 44:23-30 [Journal] [PubMed]

Daoudal G, Debanne D (2003) Long-term plasticity of intrinsic excitability: learning rules and mechanisms. Learn Mem 10:456-65 [Journal] [PubMed]

Dauce E, Soula H, Beslon G (2005) Learning methods for dynamic neural networks Proc 2005 Intl Symp Nonlinear Theory and Its Applications

Egger V, Feldmeyer D, Sakmann B (1999) Coincidence detection and changes of synaptic efficacy in spiny stellate neurons in rat barrel cortex. Nat Neurosci 2:1098-105 [Journal] [PubMed]

Farries MA, Fairhall AL (2005) Reinforcement learning with modulated spike timing-dependent plasticity Poster presented at the Computational and Systems Neuroscience Conference (COSYNE 2005). Available online at Abstracts.pdf

Farries MA, Fairhall AL (2005) Reinforcement learning with modulated spike timing-dependent plasticity Program No. 384.3. 2005 Abstract Viewer-Itinerary Planner

Florian RV (2005) A reinforcement learning algorithm for spiking neural networks Proc 7th Intl Symp Symbolic and Numeric Algorithms for Scientific Computing, Zaharie D:Petcu D:Negru V:Jebelean T:Biobanu G:Cicortas A:Abraham A:Paprzycki M, ed. pp.299

Florian RV, Muresan RC (2006) Phase precession and recession with STDP and anti-STDP Proc 16th Intl Conf Artif Neural Networks

Froemke RC, Dan Y (2002) Spike-timing-dependent synaptic modification induced by natural spike trains. Nature 416:433-8 [Journal] [PubMed]

Ganguly K, Kiss L, Poo M (2000) Enhancement of presynaptic neuronal excitability by correlated presynaptic and postsynaptic spiking. Nat Neurosci 3:1018-26 [Journal] [PubMed]

Gerstner W (2001) A framework for spiking neuron models: The spike response method The Handbook of Biological Physics, Gielen S et al, ed. pp.469

Gerstner W, Kistler WM (2002) Spiking neuron models

Gütig R, Aharonov R, Rotter S, Sompolinsky H (2003) Learning input correlations through nonlinear temporally asymmetric Hebbian plasticity. J Neurosci 23:3697-714 [PubMed]

Han VZ, Grant K, Bell CC (2000) Reversible associative depression and nonassociative potentiation at a parallel fiber synapse. Neuron 27:611-22 [PubMed]

Hopfield JJ, Brody CD (2004) Learning rules and network repair in spike-timing-based computation networks. Proc Natl Acad Sci U S A 101:337-42 [Journal] [PubMed]

Huang YY, Simpson E, Kellendonk C, Kandel ER (2004) Genetic evidence for the bidirectional modulation of synaptic plasticity in the prefrontal cortex by D1 receptors. Proc Natl Acad Sci U S A 101:3236-41 [Journal] [PubMed]

Kempter R, Gerstner W, van Hemmen JL (2001) Intrinsic stabilization of output rates by spike-based Hebbian learning. Neural Comput 13:2709-41 [Journal] [PubMed]

Kempter R, Gerstner W, van_Hemmen JL (1999) Hebbian learning and spiking neurons Physical Review E 59:4498-4514 [Journal]

Legenstein R, Naeger C, Maass W (2005) What can a neuron learn with spike-timing-dependent plasticity? Neural Comput 17:2337-82 [Journal] [PubMed]

Li CY, Lu JT, Wu CP, Duan SM, Poo MM (2004) Bidirectional modification of presynaptic neuronal excitability accompanying spike timing-dependent synaptic plasticity. Neuron 41:257-68 [PubMed]

Lin YW, Min MY, Chiu TH, Yang HW (2003) Enhancement of associative long-term potentiation by activation of beta-adrenergic receptors at CA1 synapses in rat hippocampal slices. J Neurosci 23:4173-81 [PubMed]

Marbach P, Tsitsiklis JN (1999) Simulation-based optimization of Markov reward processes: Implementation issues Proc 38th Conf Decision and Control

Marbach P, Tsitsiklis JN (2000) Approximate gradient methods in policy-space optimization of Markov reward processes Discrete Event Dynamic Systems: Theory and Applications 13:111-148

Markram H, Lübke J, Frotscher M, Sakmann B (1997) Regulation of synaptic efficacy by coincidence of postsynaptic APs and EPSPs. Science 275:213-5 [PubMed]

Mazzoni P, Andersen RA, Jordan MI (1991) A more biologically plausible learning rule for neural networks. Proc Natl Acad Sci U S A 88:4433-7 [PubMed]

Neutze R (1995) Ring interferometer with angular acceleration. Phys Rev A 51:5039-5042 [PubMed]

Nick TA, Ribera AB (2000) Synaptic activity modulates presynaptic excitability. Nat Neurosci 3:142-9 [Journal] [PubMed]

Pfister JP, Toyoizumi T, Barber D, Gerstner W (2006) Optimal spike-timing-dependent plasticity for precise action potential firing in supervised learning. Neural Comput 18:1318-48 [Journal] [PubMed]

Pouget A, Deffayet C, Sejnowski TJ (1995) Reinforcement learning predicts the site of plasticity for auditory remapping in the barn owl Advances in neural information processing systems, Fritzke B:Tesauro G:Touretzky DS:Leen TK, ed.

Press WH, Teukolsky SA, Vellerling WT, Flannery BP (1992) Numerical Recipes In C: The Art Of Scientific Computing

Rao RP, Sejnowski TJ (2001) Spike-timing-dependent Hebbian plasticity as temporal difference learning. Neural Comput 13:2221-37 [Journal] [PubMed]

Roberts PD (1999) Computational consequences of temporally asymmetric learning rules: I. Differential hebbian learning. J Comput Neurosci 7:235-46 [Journal] [PubMed]

Roberts PD, Bell CC (2002) Spike timing dependent synaptic plasticity in biological systems. Biol Cybern 87:392-403 [Journal] [PubMed]

Schultz W (2002) Getting formal with dopamine and reward. Neuron 36:241-63 [PubMed]

Seamans JK, Yang CR (2004) The principal features and mechanisms of dopamine modulation in the prefrontal cortex. Prog Neurobiol 74:1-58 [Journal] [PubMed]

Seung HS (2003) Learning in spiking neural networks by reinforcement of stochastic synaptic transmission. Neuron 40:1063-73 [PubMed]

Song S, Miller KD, Abbott LF (2000) Competitive Hebbian learning through spike-timing-dependent synaptic plasticity. Nat Neurosci 3:919-26 [Journal] [PubMed]

Soula H, Alwan A, Beslon G (2004) Obstacle avoidance learning in a spiking neural network Poster presented at Last Minute Results of Simulation of Adaptive Behavior

Soula H, Alwan A, Beslon G (2005) Learning at the edge of chaos: Temporal coupling of spiking neuron controller of autonomous robotic Proc AAAI Spring Symposia on Developmental Robotics

Stassinopoulos D, Bak P (1996) Democratic reinforcement: Learning via self organization Available online at

Strosslin T, Gerstner W (2003) Reinforcement learning in continuous state and action space Available online at

Sutton RS (1988) Learning to predict by the method of temporal diferences Machine Learning 3:9-44

Sutton RS, Barto AG (1998) Reinforcement learning: an introduction [Journal]

   A reinforcement learning example (Sutton and Barto 1998) [Model]

Takita K, Hagiwara M (2002) A pulse neural network learning algorithm for POMDP environment Proc 2002 Intl Joint Conf Neural Networks :1643-1648

Takita K, Hagiwara M (2005) A pulse neural network reinforcement learning algorithm for partially observable Markov decision processes Systems And Computers In Japan 36:42-52

Takita K, Osana Y, Hagiwara M (2001) Reinforcement learning algorithm with network extension for pulse neural network Trans Institute Electrical Engineers Of Japan 121:1634-1640

Thiel CM, Friston KJ, Dolan RJ (2002) Cholinergic modulation of experience-dependent plasticity in human auditory cortex. Neuron 35:567-74 [PubMed]

Toyoizumi T, Pfister JP, Aihara K, Gerstner W (2005) Spike-timing dependent plasticity and mutual information maximization for a spiking neuron model Advances in neural information processing systems, Saul LK:Weiss Y:Bottou L, ed. pp.1409

Turrigiano GG, Nelson SB (2004) Homeostatic plasticity in the developing nervous system. Nat Rev Neurosci 5:97-107 [Journal] [PubMed]

Williams RJ (1992) Simple statistical gradient-following algorithms for connectionist reinforcement learning Mach Learn 8:229-256

Xie X, Seung HS (2004) Learning in neural networks by reinforcement of irregular spiking. Phys Rev E Stat Nonlin Soft Matter Phys 69:041909 [Journal] [PubMed]

Xie X, Seung S (2000) Spike-based learning rules and stabilization of persistent neural activity. Advances in neural information processing systems, Solla SA, Leen T, Mu ller K-R, ed. pp.199

Zhang W, Linden DJ (2003) The other side of the engram: experience-driven changes in neuronal intrinsic excitability. Nat Rev Neurosci 4:885-900 [Journal] [PubMed]

Brzosko Z, Zannone S, Schultz W, Clopath C, Paulsen O (2017) Sequential neuromodulation of Hebbian plasticity offers mechanism for effective reward-based navigation. Elife [Journal] [PubMed]

   Sequential neuromodulation of Hebbian plasticity in reward-based navigation (Brzosko et al 2017) [Model]

Chadderdon GL, Neymotin SA, Kerr CC, Lytton WW (2012) Reinforcement learning of targeted movement in a spiking neuronal model of motor cortex. PLoS One 7:e47251 [Journal] [PubMed]

   Reinforcement learning of targeted movement (Chadderdon et al. 2012) [Model]

Legenstein R, Pecevski D, Maass W (2008) A learning theory for reward-modulated spike-timing-dependent plasticity with application to biofeedback. PLoS Comput Biol 4:e1000180 [Journal] [PubMed]

   Reward modulated STDP (Legenstein et al. 2008) [Model]

Mozafari M, Kheradpisheh SR, Masquelier T, Nowzari-Dalini A, Ganjtabesh M (2018) First-Spike-Based Visual Categorization Using Reward-Modulated STDP IEEE Transactions on Neural Networks and Learning Systems :1-13 [Journal]

   First-Spike-Based Visual Categorization Using Reward-Modulated STDP (Mozafari et al. 2018) [Model]

Nakano T, Otsuka M, Yoshimoto J, Doya K (2015) A spiking neural network model of model-free reinforcement learning with high-dimensional sensory input and perceptual ambiguity. PLoS One 10:e0115620 [Journal] [PubMed]

   A spiking neural network model of model-free reinforcement learning (Nakano et al 2015) [Model]

Neymotin SA, Chadderdon GL, Kerr CC, Francis JT, Lytton WW (2013) Reinforcement learning of two-joint virtual arm reaching in a computer model of sensorimotor cortex. Neural Comput 25:3263-93 [Journal] [PubMed]

   Sensorimotor cortex reinforcement learning of 2-joint virtual arm reaching (Neymotin et al. 2013) [Model]

Richmond P, Buesing L, Giugliano M, Vasilaki E (2011) Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations. PLoS One 6:e18539 [Journal] [PubMed]

   Democratic population decisions result in robust policy-gradient learning (Richmond et al. 2011) [Model]

Rivest F, Kalaska JF, Bengio Y (2010) Alternative time representation in dopamine models. J Comput Neurosci 28:107-30 [Journal] [PubMed]

   Alternative time representation in dopamine models (Rivest et al. 2009) [Model]

Zannone S, Brzosko Z, Paulsen O, Clopath C (2018) Acetylcholine-modulated plasticity in reward-driven navigation: a computational study. Sci Rep 8:9486 [Journal] [PubMed]

   Acetylcholine-modulated plasticity in reward-driven navigation (Zannone et al 2018) [Model]

(86 refs)