CE48 - Fondements du numérique : informatique, automatique, traitement du signal 2023

Personalized Optimization with Human Feedback – Human-O

Making optimisation work for humans

Often in cyber-physical systems, like smart grids and transportation networks, humans are an after thought. We want to design online optimisation algorithms that consider humans from the start. This is hard to do mathematically, but it will open the way for transparent and flexible human-systems.

Data-driven and flexible optimization algorithms

In this project, I propose to explore and design new algorithms at the intersection of optimiza- tion, dynamical systems, and learning of the user’s costs and constraints, allowing end users to have a choice between possible decisions, and learn from it. [S1] Learning to optimize for dynamical systems. We will study how to go beyond the state of the art by learning and predicting the optimizer trajectory from data. Also, we will see how to incorporate prior knowledge from historical data. Complementary to the dynamical system, we will extend the learning to the cost function and the constraints of the problem, both to better predict how they change in time. [S2] Learning to personalize. We will use machine learning (kernel) methods to learn the human satisfaction, and therefore their costs and (possibly) constraints via their feedback, in a setting that could resemble and go beyond a bandit framework. The first key challenge that we will tackle is the combination of the needs of optimization and the properties of traditional machine learning techniques. In particular, we will study how to impose functional/shape constraints for the estimated cost and constraints. [S3] Learning to be flexible. We will go beyond learning a cost or constraints, and model the problem in a way that the result of the optimization is a set of possible solutions and we will give to the users the choice to pick the solution they prefer. This brings in novel perspectives on how to model flexible and time-varying optimization problems, and how to learn from the choice the users make. In particular, for this objective, humans play a more active role, and I expect their preferences (in terms of choices) to give rise to non-convexities and non-idealities.

Methods

We will be used a blend of time-varying optimisation, machine learning, as well as robust optimisation.

Results

The main results will be algorithms that can be run online to make humans part of the decision-making process.

For much more details, please refer to the project website and Hal for current up-to-date publications.

Prospects

The focus is building human-centred algorithms. Our aim is to unlock flexibility and transparency in how humans interact with technology.

Submission summary

The Human-O project deals with upgrading current optimization algorithms to the needs of cyber-physical and social system, namely information streams and human presence. The aim is to design novel algorithms that learns how optimization problems evolve in time, due to time-varying conditions, and learns human-specific objectives and constraints to be incorporated in the problem itself.

The research steps of the project pertain learning the dynamical system underlying the optimality conditions, incorporating human feedback by designing and learning human-specific costs and constraints, and giving users a choice among possible decisions, which gives humans an active part in the optimization process.

The ultimate vision is to shape the emerging field of cyber physical and social systems, where humans are an integral part of a complex, ever-changing, environment, which nevertheless requires optimization algorithms to be able to give hard guarantees on safety, reliability, and performance. This will change how humans experience technology and interact with it.

Andrea SIMONETTO (Unité de Mathématiques Appliquées)

The author of this summary is the project coordinator, who is responsible for the content of this summary. The ANR declines any responsibility as for its contents.

University of Colorado Boulder
UMA Unité de Mathématiques Appliquées

Help of the ANR 264,228 euros
Beginning and duration of the scientific project: December 2023 - 48 Months

Explorez notre base de projets financés

ANR makes available its datasets on funded projects, click here to find more.