TSIA - Giga-modèles - Thématiques Spécifiques en Intelligence Artificielle (Giga-modèles pour le traitement automatique du langage naturel et des données multimodales) 2023

Large adaptable and sovereign language models for the French medical field – MALADES

Submission summary

The recent arrival of Large Language Models (LLMs) and their associated tools for the general public reveal major challenges for society. Among the many fields that are, or will be, impacted by these generative models, the biomedical field is one of those that currently attract the attention of industrialists, researchers, but also the general public. Indeed, the need for tools and potential applications seems immense, whether, for example, at the level of the processing of textual documents, medical imaging, or even voice interaction. Due to the sensitive nature of the personal data handled and the fears of society associated with decision support tools, work in natural language processing (NLP) must innovate by addressing the issues inherent in this field. As part of the SALADES project, we presented innovative approaches for the integration of LLM in health centers. The aim is to equip these centers with NLP tools derived from LLMs and adapted for the biomedical field while maintaining sovereignty of the models and complete control of their health data. The work we carry out focuses on four areas of research: 1) the study of the legal and ethical aspects in France of LLMs for the biomedical field, 2) the integration of an interaction by the speech of LLMs by means of end-to-end approaches, including the massive collection of speech data, 3) The collection of new original case studies oriented for the evaluation of generative language models, and 4) the integration of dynamic and sovereign LLMs for the biomedical field, deployed on constrained material resources, and integrating original approaches providing LLMs with additional capabilities by means of mastered and verified knowledge bases.

Project coordination

Richard Dufour (Laboratoire des Sciences du Numérique de Nantes)

The author of this summary is the project coordinator, who is responsible for the content of this summary. The ANR declines any responsibility as for its contents.

Partnership

LS2N Laboratoire des Sciences du Numérique de Nantes
LIS Laboratoire d'Informatique et Systèmes
LIA Laboratoire Informatique d'Avignon
CHUN CHU de Nantes

Help of the ANR 674,060 euros
Beginning and duration of the scientific project: September 2023 - 48 Months

Useful links

Explorez notre base de projets financés

 

 

ANR makes available its datasets on funded projects, click here to find more.

Sign up for the latest news:
Subscribe to our newsletter