Citation Relationships

Legends: Link to a Model Reference cited by multiple papers


Montague PR, Dayan P, Sejnowski TJ (1996) A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J Neurosci 16:1936-47 [PubMed]

References and models cited by this paper

References and models that cite this paper

Acquas E, Carboni E, Di Chiara G (1991) Profound depression of mesolimbic dopamine release after morphine withdrawal in dependent rats. Eur J Pharmacol 193:133-4 [PubMed]
Bechara A, Damasio AR, Damasio H, Anderson SW (1994) Insensitivity to future consequences following damage to human prefrontal cortex. Cognition 50:7-15 [PubMed]
Bernheimer H, Birkmayer W, Hornykiewicz O, Jellinger K, Seitelberger F (1973) Brain dopamine and the syndromes of Parkinson and Huntington. Clinical, morphological and neurochemical correlations. J Neurol Sci 20:415-55 [PubMed]
Bush RR, Mosteller F (1955) Stochastic models for learning
Crippens D, Robinson TE (1994) Withdrawal from morphine or amphetamine: different effects on dopamine in the ventral-medial striatum studied with microdialysis. Brain Res 650:56-62 [PubMed]
DeLong MR, Crutcher MD, Georgopoulos AP (1983) Relations between movement and single cell discharge in the substantia nigra of the behaving monkey. J Neurosci 3:1599-606 [PubMed]
Diana M, Pistis M, Carboni S, Gessa GL, Rossetti ZL (1993) Profound decrement of mesolimbic dopaminergic neuronal activity during ethanol withdrawal syndrome in rats: electrophysiological and biochemical evidence. Proc Natl Acad Sci U S A 90:7966-9 [PubMed]
Dickinson A (1980) Contemporary animal learning theory
Doya K, Sejnowski TJ (1995) A novel reinforcement model of birdsongvocalization learning Advances in neural information processing systems, Tesauro G:Touretzky D:Alspector J, ed.
Egelman DM, Person C, Montague PR (1995) A predictive model for diffuse systems matches human choices in a simple decision-making task Soc Neurosci Abstr 21:2087
Fibiger HC, Phillips AG (1986) Reward, motivation, cognition: psycholobiologyof mesotelencephalic dopamine systems Handbook of physiology.The nervous system. Intrinsic regulatory systems of the brain 14:647-675
Freeman AS, Bunney BS (1987) Activity of A9 and A10 dopaminergic neurons in unrestrained rats: further characterization and effects of apomorphine and cholecystokinin. Brain Res 405:46-55 [PubMed]
Gallagher M, Holland PC (1994) The amygdala complex: multiple roles in associative learning and attention. Proc Natl Acad Sci U S A 91:11771-6 [PubMed]
Gallistel CR (1990) The Organization of Learning
Grossberg S, Levine DS (1987) Neural dynamics of attentionally modulated Pavlovian conditioning: blocking, interstimulus interval, and secondary reinforcement. Appl Opt 26:5015-30 [Journal] [PubMed]
Grossberg S, Schmajuk NA (1989) Neural dynamics of adaptive timing and temporal discrimination during associative learning Neural Netw Networks2:79-102
Harder LD, Real LA (1987) Why are bumble bees risk averse? Ecology 68:1104-1108
HERRNSTEIN RJ (1961) Relative and absolute strength of response as a function of frequency of reinforcement. J Exp Anal Behav 4:267-72 [Journal] [PubMed]
Herrnstein RJ (1991) Experiments on stable suboptimality in individual behavior Am Econ Rev Paoers Proc 83:360-364
Houk JC, Adams JL, Barto AGA (1995) A model of how the basal ganglia generate and use neural signals that predict reinforcement. Models Of Information Processing In The Basal Ganglia, Houk JC:Davis JL:Beiser DG, ed. pp.249
Kalman RE (1960) A new approach to linear filtering and prediction problems Trans ASME J Basic Eng 82:35-45
Klopf AH (1982) The hedonistic neuron
Koob GF, Bloom FE (1988) Cellular and molecular mechanisms of drug dependence. Science 242:715-23 [PubMed]
Ljungberg T, Apicella P, Schultz W (1992) Responses of monkey dopamine neurons during learning of behavioral reactions. J Neurophysiol 67:145-63 [Journal] [PubMed]
Luce RD, Raiffa H (1957) Games and decisions: introduction and critical survey
Mackintosh NJ (1983) Conditioning and associative learning
Montague PR (1996) Biological substrates of predictive mechanisms in learning and action choice Neural-network approaches to cognition: biobehavioral foundations, Donahoe PP, ed.
Montague PR, Dayan P, Nowlan SJ, Sejnowski TJ (1993) Using aperiodic reinforcement for directed self-organization Advances in neural information processing systems , Giles Cl:Hanson SJ:Cowan JD, ed.
Montague PR, Dayan P, Person C, Sejnowski TJ (1995) Bee foraging in uncertain environments using predictive hebbian learning. Nature 377:725-8 [Journal] [PubMed]
Montague PR, Sejnowski TJ (1994) The predictive brain: temporal coincidence and temporal order in synaptic learning mechanisms. Learn Mem 1:1-33 [PubMed]
Oades RD, Halliday GM (1987) Ventral tegmental (A10) system: neurobiology. 1. Anatomy and connectivity. Brain Res 434:117-65 [PubMed]
Parsons LH, Smith AD, Justice JB (1991) Basal extracellular dopamine is decreased in the rat nucleus accumbens during abstinence from chronic cocaine. Synapse 9:60-5 [Journal] [PubMed]
Quartz SR, Dayan P, Montague PR, Sejnowski TJ (1992) Expectation learning in the brain using diffuse ascending projections Soc Neuroscie Abstr 18:1210
Real LA (1991) Animal choice behavior and the evolution of cognitive architecture. Science 253:980-6 [PubMed]
Rescorla R, Wagner A (1972) A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and non-reinforcement Classical Conditioning II: Current Research and Theory, Black A:Prokasy W, ed. pp.64
Romo R, Schultz W (1990) Dopamine neurons of the monkey midbrain: contingencies of responses to active touch during self-initiated arm movements. J Neurophysiol 63:592-606 [Journal] [PubMed]
Rossetti ZL, Hmaidan Y, Gessa GL (1992) Marked inhibition of mesolimbic dopamine release: a common feature of ethanol, morphine, cocaine and amphetamine abstinence in rats. Eur J Pharmacol 221:227-34 [PubMed]
Sawaguchi T, Goldman-Rakic PS (1991) D1 dopamine receptors in prefrontal cortex: involvement in working memory. Science 251:947-50 [PubMed]
Schultz W (1992) Activity of dopamine neurons in the behaving primate Semin Neurosci 4:129-138
Schultz W, Apicella P, Ljungberg T (1993) Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task. J Neurosci 13:900-13 [PubMed]
Schultz W, Romo R (1990) Dopamine neurons of the monkey midbrain: contingencies of responses to stimuli eliciting immediate behavioral reactions. J Neurophysiol 63:607-24 [Journal] [PubMed]
Sutton RS (1988) Learning to predict by the method of temporal diferences Machine Learning 3:9-44
Sutton RS, Barto A (1987) A temporal-difference model of classical conditioning Proceedings of the Ninth Annual Conference of the Cognitive Science Society :355-378
Sutton RS, Barto AG (1981) Toward a modern theory of adaptive networks: expectation and prediction. Psychol Rev 88:135-70 [PubMed]
Sutton RS, Barto AG (1990) Time-derivative models of Pavlovian reinforcement Learning and computational neuroscience: Foundations of adaptive networks, Gabriel M:Moore J, ed. pp.497
Tranel D, Damasio AR (1985) Knowledge without awareness: an autonomic index of facial recognition by prosopagnosics. Science 228:1453-4 [PubMed]
von_Neumann J, Morgenstearn O (1947) Theory of games and economic behavior
Wickens J, Kotter R (1995) Cellular models of reinforcement Models ofinformation processing in the basal ganglia, Houk JC:Davis JL:Beiser DG, ed.
Widrow B, Stearns SD (1985) Adaptive signal processing
Wise RA (1982) Neuroleptics and operant behavior: the anhedonia hypothesis Behav Brain Sci 5:39
Wise RA, Bozarth MA (1984) Brain reward circuitry: four circuit elements "wired" in apparent series. Brain Res Bull 12:203-8 [PubMed]
Bogacz R, Gurney K (2007) The basal ganglia and cortex implement optimal decision making between alternative actions. Neural Comput 19:442-77 [Journal] [PubMed]
Daw ND, Courville AC, Tourtezky DS, Touretzky DS (2006) Representation and timing in theories of the dopamine system. Neural Comput 18:1637-77 [Journal] [PubMed]
Gruber AJ, Solla SA, Surmeier DJ, Houk JC (2003) Modulation of striatal single units by expected reward: a spiny neuron model displaying dopamine-induced bistability. J Neurophysiol 90:1095-114 [Journal] [PubMed]
   Spiny neuron model with dopamine-induced bistability (Gruber et al 2003) [Model]
Gurney KN, Humphries MD, Redgrave P (2015) A new framework for cortico-striatal plasticity: behavioural theory meets in vitro data at the reinforcement-action interface. PLoS Biol 13:e1002034 [Journal] [PubMed]
   Cortico-striatal plasticity in medium spiny neurons (Gurney et al 2015) [Model]
Gutkin BS, Dehaene S, Changeux JP (2006) A neurocomputational hypothesis for nicotine addiction. Proc Natl Acad Sci U S A 103:1106-11 [Journal] [PubMed]
Hasselmo ME (2005) A model of prefrontal cortical mechanisms for goal-directed behavior. J Cogn Neurosci 17:1115-29 [Journal] [PubMed]
   Prefrontal cortical mechanisms for goal-directed behavior (Hasselmo 2005) [Model]
Hazy TE, Frank MJ, O'reilly RC (2007) Towards an executive without a homunculus: computational models of the prefrontal cortex/basal ganglia system. Philos Trans R Soc Lond B Biol Sci 362:1601-13 [Journal] [PubMed]
Izhikevich EM (2007) Solving the distal reward problem through linkage of STDP and dopamine signaling. Cereb Cortex 17:2443-52 [Journal] [PubMed]
   Linking STDP and Dopamine action to solve the distal reward problem (Izhikevich 2007) [Model]
Kato A, Morita K (2016) Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation. PLoS Comput Biol 12:e1005145 [Journal] [PubMed]
   Reinforcement Learning with Forgetting: Linking Sustained Dopamine to Motivation (Kato Morita 2016) [Model]
Keramati M, Dezfouli A, Piray P (2011) Speed/accuracy trade-off between the habitual and the goal-directed processes. PLoS Comput Biol 7:e1002055 [Journal] [PubMed]
   Speed/accuracy trade-off between the habitual and the goal-directed processes (Kermati et al. 2011) [Model]
Morita K, Kato A (2014) Striatal dopamine ramping may indicate flexible reinforcement learning with forgetting in the cortico-basal ganglia circuits. Front Neural Circuits 8:36 [Journal] [PubMed]
   Striatal dopamine ramping: an explanation by reinforcement learning with decay (Morita & Kato, 2014) [Model]
Moustafa AA, Cohen MX, Sherman SJ, Frank MJ (2008) A role for dopamine in temporal decision making and reward maximization in parkinsonism. J Neurosci 28:12294-304 [Journal] [PubMed]
O'Reilly RC, Frank MJ (2005) Making Working Memory Work: A Computational Model of Learning in the Prefrontal Cortex and Basal Ganglia Neural Comput 18:283-328
O'Reilly RC, Frank MJ (2006) Making working memory work: a computational model of learning in the prefrontal cortex and basal ganglia. Neural Comput 18:283-328 [Journal] [PubMed]
Rivest F, Kalaska JF, Bengio Y (2010) Alternative time representation in dopamine models. J Comput Neurosci 28:107-30 [Journal] [PubMed]
   Alternative time representation in dopamine models (Rivest et al. 2009) [Model]
Sakai Y, Fukai T (2008) The actor-critic learning is behind the matching law: matching versus optimal behaviors. Neural Comput 20:227-51 [Journal] [PubMed]
Sejnowski TJ, Destexhe A (2000) Why do we sleep? Brain Res 886:208-223 [PubMed]
Sharp PE, Blair HT, Brown M (1996) Neural network modeling of the hippocampal formation spatial signals and their possible role in navigation: a modular approach. Hippocampus 6:720-34 [Journal] [PubMed]
Smith AJ, Becker S, Kapur S (2005) A computational model of the functional role of the ventral-striatal D2 receptor in the expression of previously acquired behaviors. Neural Comput 17:361-95 [Journal] [PubMed]
Szita I, Lorincz A (2004) Kalman filter control embedded into the reinforcement learning framework. Neural Comput 16:491-9 [PubMed]
Zannone S, Brzosko Z, Paulsen O, Clopath C (2018) Acetylcholine-modulated plasticity in reward-driven navigation: a computational study. Sci Rep 8:9486 [Journal] [PubMed]
   Acetylcholine-modulated plasticity in reward-driven navigation (Zannone et al 2018) [Model]
(75 refs)