Organizers
Time
Monday 14th, June 2010, Afternoon
Duration
4 hours
Short Description
Sparse coding calls for modelling data vectors as a linear combination of a few
elements from a dictionary. Whereas the question of -designing- the best
dictionary adapted to natural signals has been the topic of much research in
the past, this tutorial focuses on recent techniques that -learn-
the basis set from training data. This has already proven very
effective for signal reconstruction, leading to state-of-the-art results for
many signal or image processing tasks, as well as advances in computer vision
tasks such as object recognition.
A critical component in this approach is how to sparsely encode a signal
given the dictionary. The tutorial will describe the state of the art in this
area, ranging from greedy algorithms and l1-optimization to simultaneous sparse
coding of collections of signals. The actual dictionary plays a critical role,
and it has been shown once and again that learned dictionaries significantly
outperforms off-the-shelf ones such as wavelets. The second part of this
tutorial will present the dictionary learning formulation and its links
with existing matrix factorization techniques, as well as state-of-the-art applications
to image processing tasks.
The third part of the tutorial will be devoted to the most recent
advances in the field, starting from learning dictionaries of image
descriptors, a strategy used by the latest PASCAL VOC'09 challenge
winners, to task-driven dictionaries and structured sparsity.
A similar tutorial was given at ICCV'09 in Kyoto, and its success
prompts us to hold a second tutorial at CVPR'2010, with updates and new material.
Preliminary Syllabus
- Introduction: What sparse coding and dictionary learning are about and why one should care?
- Part I:Sparse Models in Machine Learning.
- Sparse Models
- Dictionary Learning, Vector Quantization and Matrix Factorization
- Grouped Sparsity
- Part II:Applications in Image Processing
- Dictionary Learning for Image Denoising and Inpainting
- Learning Dictionaries for Compressed Sensing
- Alternative Models
- Part III:Optimization Techniques for Sparse Coding
- Greedy Approaches
- Convex Optimization
- Optimization for Dictionary Learning
- Part IV: Recent advances in Computer Vision and New Models
- Discriminative Dictionaries for Patch Classification
- Learning Codebooks for Image Classification
- Structured Sparse Models
- Open questions and discussion
Course Material and Software
A Matlab toolbox for sparse decomposition and dictionary learning is available here.
The slides will all be available soon.
Relevant References
Among the vast literature on sparse coding, here are a few selected publications. The list is preliminary and subject to modifications:
- B. Efron, T. Hastie, I. Johnstone, and R. Tibshirani. Least angle regression. Annals of
statistics, 32(2):407--499, 2004.
- J. Friedman, T. Hastie, H. Hölfling, and R. Tibshirani. Pathwise coordinate optimization.
Annals of statistics, 1(2):302--332, 2007.
- B. A. Olshausen and D. J. Field. Sparse coding with an overcomplete basis set: A strategy
employed by V1? Vision Research, 37:3311--3325, 1997.
- M. Elad and M. Aharon. Image denoising via sparse and redundant representations over
learned dictionaries.IEEE Transactions on Image Processing, 54(12):3736--3745,
2006.
- R Jenatton, JY Audibert, F Bach. Structured variable selection with sparsity-inducing norms. arXiv:0904.3523 . 2009.
- J. Mairal, F. Bach, J. Ponce and G. Sapiro. Online Learning for Matrix Factorization and Sparse Coding. Journal of Machine Learning Research, 11:19--60, 2010
- J. Mairal, F. Bach, J. Ponce, G. Sapiro and A. Zisserman. Non-Local Sparse Models for Image Restoration. International Conference on Computer Vision. 2009.
- J. Mairal, F. Bach, J. Ponce, G. Sapiro and A. Zisserman. Discriminative Learned Dictionaries for Local Image Analysis. IEEE Conference on Computer Vision and Pattern Recognition, 2008.
- J. Yang, K. Yu, Y. Gong, and T. Huang. Linear Spatial Pyramid Matching Using Sparse Coding for Image Classification. IEEE Conference on Computer Vision and Pattern Recognition. 2009.
Biographies
Francis Bach is a researcher in the Willow INRIA project-team, in the
Computer Science Department of the Ecole Normale Supérieure, Paris,
France. He graduated from the Ecole Polytechnique, Palaiseau, France,
in 1997, and earned his PhD in 2005 from the Computer Science division
at the University of California, Berkeley. His research interests
include machine learning, statistics, optimization, graphical models, kernel
methods, sparse methods and statistical signal processing.
Julien Mairal received the graduate degree from Ecole
Polytechnique and Ecole Nationale Supérieure des Téléecommunications,
Paris, and the MS degree from the Ecole Normale Supérieure, Cachan. He is
currently pursuing the Ph.D. degree under the supervision of Jean Ponce and
Francis Bach at Ecole Normale Supérieure, Paris. His research interests
include machine learning, computer vision and image processing.
Jean Ponce is a computer science professor at Ecole Normale
Supérieure (ENS) in Paris, France, where he heads the joint ENS/INRIA/CNRS
oject-team WILLOW. Before joining ENS, he spent most of his career in the
US, with positions at MIT, Stanford, and the University of Illinois at
Urbana-Champaign, where he was a full professor until 2005. Jean Ponce is the
author of over 120 technical publications in computer vision and robotics,
including the textbook ``Computer Vision: A Modern Approach'', which has been
translated in Chinese, Japanese, and Russian. He is an IEEE Fellow, served as
editor-in-chief for the International Journal of Computer Vision from 2003 to
2008, and chaired the IEEE Conference on Computer Vision and Pattern
Recognition in 1997 and 2000, and the European Conference on Computer Vision
in 2008.

Guillermo Sapiro was born in Montevideo, Uruguay, on April 3,
1966. He received his B.Sc. (summa cum laude), M.Sc., and Ph.D. from the
Department of Electrical Engineering at the Technion, Israel Institute of
Technology, in 1989, 1991, and 1993 respectively. After post-doctoral research
at MIT, Dr. Sapiro became Member of Technical Staff at the research facilities
of HP Labs in Palo Alto, California. He is currently with the Department of
Electrical and Computer Engineering at the University of Minnesota, where he
holds the position of Distinguished McKnight University Professor and
Vincentine Hermes-Luh Chair in Electrical and Computer Engineering. He works on
differential geometry and geometric partial differential equations, both in
theory and applications in computer vision, computer graphics, medical imaging,
and image analysis. He recently co-edited a special issue of IEEE Image
Processing in this topic and a second one in the Journal of Visual
Communication and Image Representation. He has authored and co-authored
numerous papers in this area and has written a book published by Cambridge
University Press, January 2001. He was awarded the Gutwirth Scholarship for
Special Excellence in Graduate Studies in 1991, the Ollendorff Fellowship for
Excellence in Vision and Image Understanding Work in 1992, the Rothschild
Fellowship for Post-Doctoral Studies in 1993, the Office of Naval Research
Young Investigator Award in 1998, the Presidential Early Career Awards for
Scientist and Engineers (PECASE) in 1998, and the National Science Foundation
Career Award in 1999. He is the funding Editor-in-Chief of the SIAM Journal on
Imaging Sciences.