Citation Relationships

Legends: Link to a Model Reference cited by multiple papers


Nakano T, Otsuka M, Yoshimoto J, Doya K (2015) A spiking neural network model of model-free reinforcement learning with high-dimensional sensory input and perceptual ambiguity. PLoS One 10:e0115620 [PubMed]

   A spiking neural network model of model-free reinforcement learning (Nakano et al 2015)

References and models cited by this paper

References and models that cite this paper

Bakker B (2002) Reinforcement learning with long short-term memory Neural information processing systems, Dietterich T:Becker S:Ghahramani Z, ed. pp.1475
Belavkin RV, Huyck CR (2011) Conflict resolution and learning probability matching in a neural cell-assembly architecture Cognitive Systems Research 12:93-101
Boerlin M, Denève S (2011) Spike-based population coding and working memory. PLoS Comput Biol 7:e1001080 [Journal] [PubMed]
Doya K (2002) Metalearning and neuromodulation. Neural Netw 15:495-506 [PubMed]
Elfwing S, Otsuka M, Uchibe E, Doya K (2010) Free-energy based reinforcement learning for visionbased navigation with high-dimensional sensory inputs Neural Information Processing. Theory and Algorithms :215-222
Florian RV (2007) Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural Comput 19:1468-502 [Journal] [PubMed]
Freedman DJ, Assad JA (2006) Experience-dependent representation of visual categories in parietal cortex. Nature 443:85-8 [Journal] [PubMed]
Freedman DJ, Riesenhuber M, Poggio T, Miller EK (2001) Categorical representation of visual stimuli in the primate prefrontal cortex. Science 291:312-6 [Journal] [PubMed]
Gerstner W, Kistler WM (2002) Spiking neuron models
Hinton GE (2002) Training products of experts by minimizing contrastive divergence. Neural Comput 14:1771-800 [Journal] [PubMed]
Hollensen P, Hartono P, Trappenberg T (2011) Topographic RBM as robot controller. The 21st Annual Conference of the Japanese Neural Network Society
Izhikevich EM (2007) Solving the distal reward problem through linkage of STDP and dopamine signaling. Cereb Cortex 17:2443-52 [Journal] [PubMed]
   Linking STDP and Dopamine action to solve the distal reward problem (Izhikevich 2007) [Model]
Jimenez Rezende D, Gerstner W (2014) Stochastic variational learning in recurrent spiking networks. Front Comput Neurosci 8:38 [Journal] [PubMed]
Kwee I, Hutter M (2001) Market-based reinforcement learning in partially observable worlds. Proceedings of the International Conference on Arti Neural Networks (ICANN) :865-873
Matsuda W, Furuta T, Nakamura KC, Hioki H, Fujiyama F, Arai R, Kaneko T (2009) Single nigrostriatal dopaminergic neurons form widely spread and highly dense axonal arborizations in the neostriatum. J Neurosci 29:444-53 [Journal] [PubMed]
Miller EK, Freedman DJ, Wallis JD (2002) The prefrontal cortex: categories, concepts and cognition. Philos Trans R Soc Lond B Biol Sci 357:1123-36 [Journal] [PubMed]
Otsuka M, Yoshimoto J, Doya K (2008) Robust population coding in free-energy-based reinforcement learning Proceedings of the International Conference on Arti Neural Networks (ICANN) :377-386
Otsuka M, Yoshimoto J, Doya K (2010) Free-energy-based reinforcement learning in a partially observable environment European Symposium on Artificial Neural Networks (ESANN) :541-545
Potjans W, Morrison A, Diesmann M (2009) A spiking neural network model of an actor-critic learning agent. Neural Comput 21:301-39 [Journal] [PubMed]
Reynolds JN, Hyland BI, Wickens JR (2001) A cellular mechanism of reward-related learning. Nature 413:67-70 [Journal] [PubMed]
Roberts PD, Santiago RA, Lafferriere G (2008) An implementation of reinforcement learning based on spike timing dependent plasticity. Biol Cybern 99:517-23 [Journal] [PubMed]
Saeb S, Weber C, Triesch J (2009) Goal-directed learning of features and forward models. Neural Netw 22:586-92 [Journal] [PubMed]
Sallans B, Hinton GE (2004) Reinforcement learning with factored states and actions Journal Of Machine Learning Research 5:1063-1088
Samejima K, Ueda Y, Doya K, Kimura M (2005) Representation of action-specific reward values in the striatum. Science 310:1337-40 [Journal] [PubMed]
Schmidhuber J (2014) Deep learning in neural networks: An overview. CoRR abs-1404.7828
Schultz W, Dayan P, Montague PR (1997) A neural substrate of prediction and reward. Science 275:1593-9 [PubMed]
Szatmary B, Izhikevich EM (2010) () Spike-timing theory of working memory PLoS Computational Biology
Whitehead SD, Lin LJ (1995) Reinforcement learning of non-Markov decision processes Artificial Intel 73:271-306
Rössert C, Dean P, Porrill J (2015) At the Edge of Chaos: How Cerebellar Granular Layer Network Dynamics Can Provide the Basis for Temporal Filters. PLoS Comput Biol 11:e1004515 [Journal] [PubMed]
   Basis for temporal filters in the cerebellar granular layer (Roessert et al. 2015) [Model]
Wilson CJ, Beverlin B, Netoff T (2011) Chaotic desynchronization as the therapeutic mechanism of deep brain stimulation. Front Syst Neurosci 5:50 [Journal] [PubMed]
(31 refs)