Why Deep Learning Ensembles Outperform Bayesian Neural Networks

Recently I came across an interesting Paper named, “Deep Ensembles: A Loss Landscape Perspective” by a Laxshminarayan et al.In this article, I will break down the paper, summarise it’s findings and delve into some of the techniques and strategies they used that will be useful for delving into understanding models and their learning process. It will also go over some possible extensions to the paper. You can also find my annotations on the paper down below.

The Theory

The authors conjectured (correctly) that Deep Ensembles (an ensemble of Deep learning models) outperform Bayesian Neural Networks because “popular scalable variational Bayesian methods tend to focus on a single mode, whereas deep ensembles tend to explore diverse modes in function space.”

#ai #machine-learning #deep-learning #ensemble-learning #bayesian-machine-learning

The Theory

medium.com

Why Deep Learning Ensembles Outperform Bayesian Neural Networks