ICCV 2015 Tutorial: From Image Statistics to Deep Learning

Full-day tutorial in conjunction with ICCV 2015.
Friday, December 11, 2015, Santiago, Chile

The tutorial's program and slides are available here.

Description

Over the last few years, deep learning has delivered state-of-the-art results in a host of problems in computer vision, and machine learning in general. Even though the basic techniques e.g. for classification with deep convolutional networks are fairly straightforward (loss minimization through back-propagation), the wealth of the broader techniques that pertain to probabilistic deep modeling may not be as accessible in a first encounter.

The goal of this tutorial is to elaborate on the ties between probabilistic approaches to deep learning and traditional methods from statistical signal processing and statistical data modeling. Our aim is to provide the background necessary to make a larger part of probabilistic deep learning accessible to starting graduate students with an electrical engineering and/or computer science background.

In particular, flat, single-layer probabilistic models have been extensively studied in image processing, computer vision and machine learning, and in several cases come equipped with exact methods for inference and learning; for instance linear models with or without sparsity constraints often lead to analytic Maximum-a-Posteriori solutions, while Maximum Entropy parameter estimation for the exponential family leads to convex optimization problems with unique solutions. As such, understanding such models is easier and provides a solid background on which to develop intuition for deeper and less tractable models.

For this, we will approach deep generative models from a statistical image modeling perspective, starting from linear models, moving on to Boltzmann-Gibbs distributions, Markov Random Fields, and Restricted Boltzmann Machines, and eventually will cover Deep Belief Networks and Deep Boltzmann Machines. We will finally present non-probabilistic deep relaxations such as convolutional neural networks and stacked auto-encoders while exploiting the intuitions developed from probabilistic modeling.

Organizers