DS0708 - Données massives, connaissances, décision, calcul haute performance et simulation numérique

Uncertainty in Machine Learning Network – UML-NET

Uncertainty in Machine Learning Network

This project aims at building a European network centered around the problem of learning under severe undertainty. It tackles two main issues:<br /><br />* Learning from scarce and imperfect (imprecise, noisy) data<br /><br />* Discussing how to assess and learn cautious models, where cautious means considering sets of possible models.

Building a European network

The main objectives of the project are the following:<br /><br />* Allow researchers from different fields and using different approaches to discuss about their respective pros and cons. <br /><br />* Federate a research community around the problem of learning in case of severe uncertainties.

* The organisation of workshops dedicated to the project topic, including researchers within and outside the network

* Research stays (long and short) between members of the network

* Organisation of two Workshops on Uncertainty in Machine Learning (WUML), in 2015 and 2017

* Best paper award at SMPS 2016 conference («Technical Gestures Recognition by Set-Valued Hidden Markov Models with Prior Knowledge«)

* COST project submission

* Continuation of the network through additional national and european fundings

* Y. Soullard, A. Antonucci, S. Destercke, «Technical Gestures Recognition by Set-Valued Hidden Markov Models with Prior Knowledge«

Modern information systems become more and more complex, and so do the data they have to handle. This is the case, for example, of protein functions prediction, information retrieval from big data bases of documents, recommending items or actions from few preferences, diagnosing and monitoring the state of complex systems such as planes, energy networks, … This means that accurate data and models are respectively harder to obtain and harder to learn. Hence, available data are often noisy and incomplete (e.g., pairwise instead of full preferences).

Dealing with such data while preserving the computational efficiency of the learning methods often means a drop in the accuracy of model predictions. While some loss of accuracy for a gain of efficiency is affordable in some applications such as recommender systems or web crawlers, such a loss is a tremendous drawback in other applications such as medical diagnosis, risk analysis or autonomous vehicles. For these latter applications, recent and very promising trends of research consist in fully acknowledging this uncertainty in the learning process, possibly sacrificing some efficiency for more reliable results, for instance by allowing models to make partial and cautious but more accurate predictions, or by treating data uncertainty in a conservative way. The goal of this project is to foster collaborations between French laboratories and European research groups following this trend from different perspectives, to exchange views, establish a common understanding of this emerging field and develop a European network focusing on this specific topic. The aim of this project if to lay down the first corner stones of this long-term goal, notably through the orgnization of events (workshops, special sessions or journal special issues) and the exchange of researchers and students between the different involved groups and the Heudiasyc Laboratory.

The network will focus on two main topics. The first topic addressed by the network the one learning from uncertain data, either in the input features or the observed (structured) output. In particular, it will explore the recent trend recommending to learn sets of models from the uncertain data, thereby producing partial but more reliable prediction, and will compare it to the more usual requirement of learning one model giving a precise prediction.

The second explored topic will concern cautiousness in models and in decision-making, more specifically when cautiousness is understood as the fact of producing sets of models or sets of predictions when uncertainty is too important. Two problems related to this we expect to discuss are how this cautiousness should be exploited in learning methods, and how to handle this cautiousness in a computationally efficient way.

Project coordination

Sébastien Destercke (Laboratoire Heuristique et Diagnostic des Systèmes Complexes)

The author of this summary is the project coordinator, who is responsible for the content of this summary. The ANR declines any responsibility as for its contents.

Partner

IDSIA Dalle Molle Institute for Artificial Intelligence - Imprecise probability group
University of Paderborn - Intelligent Systems Group
Université d'Oviedo - Equipe Métrologie et modèles
BGE Unité Biologie à grande échelle
Heudiasyc Laboratoire Heuristique et Diagnostic des Systèmes Complexes

Help of the ANR 52,000 euros
Beginning and duration of the scientific project: January 2015 - 24 Months

Useful links

Explorez notre base de projets financés

 

 

ANR makes available its datasets on funded projects, click here to find more.

Sign up for the latest news:
Subscribe to our newsletter