Chap5.7-BayesianNeuralNetworks

Chap5.7-BayesianNeuralNetworks - Machine Learning Srihari...

Info iconThis preview shows pages 1–6. Sign up to view the full content.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: Machine Learning Srihari Bayesian Neural Networks Sargur Srihari [email protected] 1 Machine Learning Srihari Topics discussed here 1. Why Bayesian? 2. DifFculty of exact Bayesian treatment and need for approximation 3. Two approximate approaches • Variational • Laplace (one discussed here) 4. Bayesian neural network for regression • Posterior parameter distribution • Hyper-parameter optimization 5. Bayesian neural network for classiFcation 2 Machine Learning Srihari Why Bayesian? • More complex models ft data better but generalize poorly • Linear with two Free parameters, quadratic with three, cubic with Four? • Occam ʼ s razor says that unnecessarily complex models should not be preFerred to simpler ones • Neural networks are popular but notoriously lack objective grounding • Bayesian approach allows diFFerent models to be compared (no oF hidden units) 3 Machine Learning Srihari Classical and Bayesian neural networks • Classical neural networks use maximum likelihood • To determine network parameters (weights and biases) • Regularized maximum likelihood is MAP (maximum a posteriori) • Regularizer is the logarithm of prior parameter distribution • Bayesian treatment marginalizes over distribution of parameters in order to make prediction 4 Machine Learning Srihari Need for Approximation in Bayesian treatment • In simple linear regression problem, under assumption of Gaussian noise • Posterior is Gaussian and evaluated exactly • Predictive distribution found in closed form...
View Full Document

This document was uploaded on 02/25/2012.

Page1 / 14

Chap5.7-BayesianNeuralNetworks - Machine Learning Srihari...

This preview shows document pages 1 - 6. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online