Legends: |
Link to a Model |
Reference cited by multiple papers |

## References and models cited by this paper | ## References and models that cite this paper | |

Akaike H (1980) Likelihood and the Bayes procedure Bayesian statistics, Bernardo NJ:DeGroot MH:Lindley DV:Smith AFM, ed. pp.1411Amari S, Park H, Ozeki T (2006) Singularities affect dynamics of learning in neuromanifolds. Neural Comput 18:1007-65 [Journal] [PubMed]Aoyagi M, Watanabe S (2004) Stochastic complerities of reduced rank regression in Bayesian estimation Neural Networks 18:924-933Attias H (1999) Inferring parameters and structure of latent variable models by variational bayes In Proceedings of Fifteenth Conference on Uncertainty in Artificial IntelligenceBaldi PF, Hornik K (1995) Learning in linear neural networks: a survey. IEEE Trans Neural Netw 6:837-58 [Journal] [PubMed]Bickel P, Chernoff H (1993) Asymptotic distribution of the likelihood ratio statistic in a prototypical non regular problem Statistics and Probability: A Raghu Raj Bahadur Festschrift, Ghosh JK:Mitra SK:Parthasarathy KR:Prakasa BL, ed. pp.83Cramer H (1951) Mathematical methods of statistics.Dacunha-castelle D, Gassiat E (1997) Testing in locally conic models, and application to mixture models Probability And Statistics 1:285-317Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc B 39:1-38Efron B, Morris C (1973) Steins estimation rule and its competitors an empirical Bayes approach J Am Stat Assoc 68:117-130Fukumizu K (1999) Generalization error of linear neural networks in unidentifiable cases Algorithmic learning theory: Proceedings of the 10th International Conference on Algorithmic Learning Theory (ALT99), Watanabe O:Yokomori T, ed. pp.51Fukumizu K (2003) Likelihood ratio of unidentifiable models and multilayer neural networks Annals Of Statistics 31:833-851Ghahramani Z, Beal MJ (2001) Graphical models and variational methods Advanced mean field methods, Opper M:Saad D, ed. pp.161Hagiwara K (2002) On the problem in model selection of neural network regression in overrealizable scenario. Neural Comput 14:1979-2002 [Journal] [PubMed]Hartigan JA (1985) A failure of likelihood asymptotics for normal mixtures Proc Barkeley Conf in Honor of J Neyman and J Kiefer 2:807-810Hinton GE, van_Camp D (1993) Keeping neural networks simple by minimizing the description length of the weights Proc Conf Computational Learning Theory :5-13Hosino T, Watanabe K, Watanabe S (2005) Stochastic complexity of variational Bayesian hidden Markov models Proc Intl Joint Conf Neural NetworksJaakkola TS, Jordan MI (2000) Bayesian parameter estimation via variational methods Statistics And Computing 10:25-37James W, Stein C (1961) Estimation with quadratic loss Proc Fourth Berkeley Symposium Mathematical Statistics And Probability, Neyman J, ed. pp.361Kuriki S, Takemura A (2001) Tail probabilities of the maxima of multilinear forms and their applications Ann Stat 29:328-371Levin E, Tishby N, Solla SA (1990) A statistical approach to learning and generalization in layered neural networks Proc IEEE 78:1568-1674Mackay DJC (1995) Developments in probabilistic modeling with neural networks ensemble learning Proc 3rd Ann Symp Neural Networks :191-198Nakajima S, Watanabe S (2005) Generalization error of linear neural networks in an empirical Bayes approach Proc IJCAI :804-810Nakajima S, Watanabe S (2006) Generalization performance of subspace Bayes approach in linear neural networks IEICE Trans 89:1128-1138Nakano N, Watanabe S (2005) Stochastic complexity of layered neural networks in mean field approximation Proc ICONIPReinsel GC, Velu RP (1998) Multivariate reduced-rank regressionRusakov D, Geiger D (2002) Asymptotic model selection for naive Bayesian networks Proc Conf Uncertainty Artif Intel :438-445Stein C (1956) Inadmissibility of the usual estimator for the mean of a multivariate normal distribution Proceedings Of The 3rd Berkeley Symposium On Mathematical Statistics And Probability 1:197-206Takemura A, Kuriki S (1997) Weights of chi-bar square distribution for smoothor piecewise smooth cone alternatives Ann Stat 25:2368-2387Wang B, Titterington DM (2004) Convergence and asymptotic normality of variational Bayesian approximations for exponential family models with missing values Proc Conf Uncertainty Artif Intel :577-584Watanabe K, Watanabe S (2006) Stochastic complexities of gaussian mixtures in variational Bayesian approximation J Mach Learn Res 7:625-644Watanabe S (1995) A generalized bayesian framework for neural networks with singular fisher information matrices Proc Intl Symposium Nonlinear Theory And Its Applications 2:207-210Watanabe S (2001) Algebraic analysis for nonidentifiable learning machines. Neural Comput 13:899-933 [PubMed]Watanabe S (2001) Algebraic information geometry for learning machines with singularities Advances in neural information processing systems, Leen TK:Dietterich TG:Tresp V, ed. pp.329Watanabe S, Amari S (2003) Learning coefficients of layered models when the true distribution mismatches the singularities Neural Comput 15:1013-1033Watcher KW (1978) The strong limits of random matrix spectra for sample matrices of independent elements Ann Prob 6:1-18Yamazaki K, Nagata K, Watanabe S (2005) A new method of model selection based on learning coefficient Proc Intl Symposium Nonlinear Theory and Its Applications :389-392Yamazaki K, Watanabe S (2002) Resolution of singularities in mixture models and its stochastic complexity Proc 9th Intl Conf Neural Information Process :1355-1359Yamazaki K, Watanabe S (2003) Stochastic complexities of hidden Markov models Proc Neural Networks Signal Process :179-188Yamazaki K, Watanabe S (2003) Stochastic complexity of Bayesian networks Proc 19th Conf Uncertainty Artif Intell :592-599 |