Modeling visual knowledge from large-scale data
Thoth is a joint team of Inria and Laboratoire Jean Kuntzmann, and
started in January 2016. It is a follow up to the LEAR team (2003-2015).
Thoth is motivated by today's context in which the quantity of digital images
and videos available on-line continues to grow at a phenomenal speed. The main
objectives of the team are: (i) designing and learning structured models
capable of representing this visual information; (ii) learning visual models
from minimal supervision or unstructured meta-data; and (iii) large-scale
learning and optimization. An additional focus of Thoth is on collection of
appropriate datasets and design of accompanying evaluation protocols.
For more information see our research description page, and annual reports of
2019,
2018,
2017,
2016,
2015,
2014,
2013,
2012, 2011, 2010, 2009, 2008, 2007, 2006, 2005, 2004, 2003.
Highlights
2020
 |
Our recent papers in major conferences (3 NeurIPS'20, 3 ICML'20, 1 ICLR'20, 5 ECCV'20, 2 CVPR'20 papers) are available on our publications page.
|
 |
Alberto Bietti received the best PhD award of the University Grenoble Alpes!
|
2019
 |
Our recent papers in major conferences (4 NeurIPS'19, 4 ICML'19, 2 ICLR'19, 5 ICCV'19, 2 CVPR'19 papers) are available on our publications page.
|
 |
Cordelia Schmid received the Royal Society Milner Award, 2019.
|
 |
Julien Mairal received the test-of-time award at the International Conference on Machine Learning (ICML), 2019.
|
2018
 |
Our recent papers in major conferences (2 NeurIPS'18, 1 ICML'18, 6 ECCV'18, 5 CVPR'18 papers) are available on our publications page.
|
 |
Thoth is co-organizing an AI summer school from July 2 to July 6, 2018.
|
 |
Alberto Bietti received the Dodu award during the Journees SMAI-MODE at Autrans, 2018.
|
2017
2016
2015
 |
We obtained top ranked results in the VOT-TIR track of the visual object tracking challenge 2015.
For more details see the competition results summary.
|
 |
INRIA Grenoble has been selected as an NVIDIA GPU Research Center.
For more details see NVIDIA academic collaboration.
|
 |
Ramazan Gokberk Cinbis (PhD, 2014) was awarded the 2014 AFRIF thesis prize for his thesis entitled "Fisher kernel based models for image classification and object localization". He was supervised by Jakob Verbeek and Cordelia Schmid. More details at AFRIF laureats.
|
 |
Navneet Dalal (PhD, 2006) and Bill Triggs, two former members of the team, were awarded the Longuet-Higgins Prize for their paper entitled "Histograms of Oriented Gradients for Human Detection" (CVPR 2005 paper). More details at awards CVPR'15.
|
 |
We organized a 2-day workshop at Inria Grenoble. The program and the slides from the talks are available online.
|
 |
Our recent papers in major conferences (3 CVPR'15 papers, 8 ICCV'15 papers, 1 COLT'15 paper and 2 NIPS'15 papers) are available on our publications page.
|
2014
 |
We obtained top ranked results in the localization track of the Thumos 2014 Action Recognition Challenge.
The goal of the challenge is to evaluate large-scale action recognition in natural settings. The dataset used is the UCF101 dataset, which is currently the largest action dataset both in terms of number of categories and clips, with more than 13000 clips drawn from 101 action classes. This year special attention was paid to classification of uncropped videos, where the action of interest appears in videos that contain also non-relevant sections.
|
 |
Cordelia Schmid was awarded the Longuet-Higgins Prize (for the 2nd time) in 2014 for her CVPR paper co-authored with Krystian Mikolajczyk entitled "A performance evaluation of local descriptors" (extended TPAMI version). More details at awards CVPR'14.
|
 |
We organized a 2-day workshop on "Weakly Supervised Learning and Video Recognition" at Inria Grenoble. The program and the slides from the talks are available online.
|
 |
Our recent papers in major conferences (5 CVPR'14 papers, 5 ECCV'14 papers, 2 ICML'14 papers and 1 NIPS'14 paper) are available on our publications page.
|
2013
 |
We obtained top ranked results in the Thumos 2013 Action Recognition Challenge.
The goal of the challenge is to evaluate large-scale action recognition in natural settings. The dataset used is the newly released UCF101 dataset, which is currently the largest action dataset both in terms of number of categories and clips, with more than 13000 clips drawn from 101 action classes.
|
 |
Our recent papers in major conferences (3 CVPR'13 papers, 9 ICCV'13 papers, 1 NIPS'13, and 1 ICML'13 papers) are available on our publications page.
|
 |
LEAR participated together with the AXES project to the TRECVID MED 2013 challenge, and finished in first position.
The Multimedia Event Detection (MED) evaluation track is part of the TRECVID Evaluation. The goal of MED is to assemble core detection technologies into a system that can search multimedia recordings for user-defined events based on pre-computed metadata.
|
 |
We are co-organizing the workshop on Greedy Algorithms, Frank-Wolfe and Friends - A modern perspective
as part of NIPS 2013, Lake Tahoe, Nevada, USA, December 10, 2013.
|
 |
The fourth INRIA Visual Recognition and Machine Learning Summer School took place at the Ecole Normale Superieure campus in Paris, from July 22 to 26, 2013.
|
2012
 |
Cordelia Schmid was awarded one of the ERC advanced grants 2012.
|
 |
LEAR participated together with the AXES project to the TRECVID MED challenge, and finished first and second.
The Multimedia Event Detection (MED) evaluation track is part of the TRECVID Evaluation. The goal of MED is to assemble core detection technologies into a system that can search multimedia recordings for user-defined events based on pre-computed metadata.
|
 |
The third INRIA Visual Recognition and Machine Learning Summer School took place at the INRIA Grenoble campus, from July 9 to July 13, 2012.
|
 |
Our recent publications in major computer vision conferences: 5 CVPR'12 papers, 2 ECCV'12 papers, and 1 BMVC'12 papers. See our publications page for downloads.
|
2011
2010
 |
We co-organized the workshop on machine learning for next generation computer vision challenges
at NIPS 2010, December 10, Whistler BC, Canada. Papers and slides of the talks are now available online.
|
 |
In the PASCAL VOC 2010 our work on human action recognition achieved best results on three out of nine action classes.
In the ECCV'10 International Workshop on Sign, Gesture, and Activity, our paper Human Focused Action Localization in Video was awarded the best paper prize.
|
 |
For the Photo Annotation task of ImageClef 2010 our joint submissions with Xerox Research Centre Europe have achieved best results on 56 of the 93 annotation concepts.
For 88 concepts, our runs were among the 6 best runs out of the 63 submitted ones. See this paper for details.
|
 |
The first INRIA Visual Recognition and Machine Learning Summer School took place at our institute in Grenoble, from July 26 to July 30. The school had 150 international attendees. Most lecture slides are now available.
|
 |
Our recent publications in major computer vision conferences: 5 CVPR'10 papers (2 orals), 2 ECCV'10 papers, and 1 BMVC'10 paper. See our publications page for details. |
2009
 |
For both the Photo Annotation and Image Retrieval tasks of ImageCLEF'09 Lear obtained a second place among the 19 participating teams for each task.
The methods that were used are described in this paper.
|
 |
Recent publications in major computer vision conferences:
4 ICCV'09 papers (2 orals), 3 BMVC'09 papers (2 orals), and 3 CVPR'09 papers. See publications web page for details. |
2008
 |
Lear got excellent results on Trecvid 2008.
The method used is described in this paper.
|
 |
In the PASCAL VOC 2008 Lear won
the detection contest for 11 out of 20 classes (see example detections here)
and the classification contest for 7 out of 20 classes. |
 |
Recent publications in major computer vision conferences: 4 ECCV'08 and 4 CVPR'08 papers. See publications web page for details.
|
 |
Development of an image indexing system that searches in real time for similar images in very large databases.
It is currently transferred and tested by the Start-Up MilPix.
Our image search demo on 10,000,000 images: Bigimbaz.
|
 |
Organization of an International Workshop on Object Recognition,
Como, May 2008. |
2007
|