Search
Up-to-date LLM for all – LLM4all
Large Language Models (LLM) of sufficient size exhibit outstanding emergent abilities, such as learning from their input context and decomposing a complex problem into a chain of simpler steps. Both these emergent abilities and the performances that have been published on many tasks tend to prove th
Foundation INtegrated models for Libraries, Archives and Museum – FINLAM
The digital transformation of libraries, which has been based on OCR (Optical Character Recognition) technology for more than 20 years, faces certain limitations both in terms of quality, due to the diversity of the collections and the limitations of OCR technology, and in terms of added value, due
Translating with Large Language Models – TRaLaLaM
Within the span of five short years (2017-2022), the field of Natural Language Processing (NLP) has been deeply transformed by the advances of general-purpose neural architectures, which are both used to learn deep representations for linguistic units and to generate high-quality textual content. Th
Large adaptable and sovereign language models for the French medical field – MALADES
The recent arrival of Large Language Models (LLMs) and their associated tools for the general public reveal major challenges for society. Among the many fields that are, or will be, impacted by these generative models, the biomedical field is one of those that currently attract the attention of indu
Intrinsic and Extrinsic evaluation of biases in large language models – InExtenso
Large Language Models (LLM) are the Swiss army knife of today’s Natural Language Processing (NLP). They often outperform the state-of-the-art on benchmarks commonly used in the field for tasks such as part-of-speech tagging, text classification and named-entity recognition, thus paving the way to a
Construction and evaluation of multimodal and inclusive large language models (written, oral, pictograms) for general and clinical French – Pantagruel
The Pantagruel project is an ambitious initiative that aims to develop and evaluate multimodal (written, spoken, pictograms) and inclusive linguistic models for French. The project draws on the expertise of researchers from different disciplines, including computer science, signal processing, sociol
General pUrpose dIalogue-based DigitAl iNformation acCEss – GUIDANCE
- How to design new LLMs or re-use LLMs for DbIA; - How to leverage retrieval-Enhanced Machine Learning (ReML) techniques to improve the accuracy and efficiency of information retrieval systems; - Adapt LLMs and develop new architectures (for DbIA models) to deal with low resource and domain adapt
Observation de la Terre généralisée avec la télédétection et le texte – GEO ReSeT
In recent years, remote sensing images have become more available than ever thanks to important efforts coming from the public and private sectors. An emblematic example are the Sentinel satellites launched from 2014 in the frame of the Copernicus program. This mission provides a wide coverage of im