Citation Relationships

Szita I, Lorincz A (2004) Kalman filter control embedded into the reinforcement learning framework. Neural Comput 16:491-9 [PubMed]

References and models cited by this paper

References and models that cite this paper

Baird LC (1995) Residual algorithms: Reinforcement learning with function approximation Proc. 12th International Conference on Machine Learning (ICML-95) :30-37

Bousquet O, Balakrishnan K, Honavar V (1998) Is the hippocampus a Kalman filter? Proceedings of the Pacific Symposium on Biocomputing :655-666

Bradtke SJ (1993) Reinforcement learning applied to linear quadratic regulation Advances in neural information processing systems, Giles CL:Hanson SJ:Cowan JD, ed. pp.295

Daw ND, Courville AC, Touretzky DS (2004) Timing and partial observability in the dopamine system Advances in neural information processing systems, Still S:Bialek W:Botlou L, ed.

Egorov AV, Hamam BN, Fransén E, Hasselmo ME, Alonso AA (2002) Graded persistent activity in entorhinal cortex neurons. Nature 420:173-8 [Journal] [PubMed]

Gordon GJ (2001) Reinforcement learning with function approximation converges to a region Advances in neural information processing systems, Leen TK:Dietterich TG:Tresp V, ed. pp.1040

Kakade S, Dayan P (2000) Acquisition in autoshaping Advances in neural information processing systems, Solla SA:Leen TK:Muller KR, ed.

Kéri S, Janka Z, Benedek G, Aszalós P, Szatmáry B, Szirtes G, Lörincz A (2002) Categories, prototypes and memory systems in Alzheimer's disease. Trends Cogn Sci 6:132-136 [PubMed]

Landelius T, Knutsson H (1996) Greedy adaptive critics for LQR problems: Convergence proofs Tech. Rep. No. LiTH-ISY-R-1896

Lörincz A, Buzsáki G (2000) Two-phase computational model training long-term memories in the entorhinal-hippocampal region. Ann N Y Acad Sci 911:83-111 [PubMed]

Lorincz A, Szatmary B, Szirtes G (2004) The mystery of structure and function of sensory processing areas of the neocortex: a resolution. J Comput Neurosci 13:187-205 [Journal]

Lorincz A, Szirtes G, Takacs B, Biederman I, Vogels R (2004) Relating priming and repetition suppression. Int J Neural Syst 12:187-201

Montague PR, Dayan P, Sejnowski TJ (1996) A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J Neurosci 16:1936-47 [PubMed]

Murphy KP (2000) A survey of POMDP solution techniques (Available on-line at:

Nádasdy Z, Hirase H, Czurkó A, Csicsvari J, Buzsáki G (1999) Replay and time compression of recurring spike sequences in the hippocampus. J Neurosci 19:9497-507 [PubMed]

Rao RP, Ballard DH (1997) Dynamic model of visual recognition predicts neural response properties in the visual cortex. Neural Comput 9:721-63 [PubMed]

Rao RP, Ballard DH (1999) Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat Neurosci 2:79-87 [Journal] [PubMed]

Schultz W, Dayan P, Montague PR (1997) A neural substrate of prediction and reward. Science 275:1593-9 [PubMed]

Skaggs WE, McNaughton BL (1996) Replay of neuronal firing sequences in rat hippocampus during sleep following spatial experience. Science 271:1870-3 [PubMed]

ten_Hagen S, Krose B (1998) Linear quadratic regulation using reinforcement learning Proceedings of the 8th Belgian-Dutch Conf. on Machine Learning, Verdenius F:van den Broek W, ed. pp.39

Todorov E, Jordan M (2002) Supplementary notes for optimal feedback control as a theory of motor coordination Available on-line at:

Tsitsiklis JN, Van_Roy B (1996) An analysis of temporal-difference learning with function approximation Tech. Rep. No. LIDS-P-2322

Daw ND, Courville AC, Tourtezky DS, Touretzky DS (2006) Representation and timing in theories of the dopamine system. Neural Comput 18:1637-77 [Journal] [PubMed]

(23 refs)