TSIA - Giga-modèles - Thématiques Spécifiques en Intelligence Artificielle (Giga-modèles pour le traitement automatique du langage naturel et des données multimodales) 2023

Translating with Large Language Models – TRaLaLaM

Submission summary

Within the span of five short years (2017-2022), the field of Natural Language Processing (NLP) has been deeply transformed by the advances of general-purpose neural architectures, which are both used to learn deep representations for linguistic units and to generate high-quality textual content. These architectures are nowadays ubiquitous in NLP applications; trained at scale, these “large language models” (LLMs) offer multiple services (summarization, writing aids, translation) in one model through human-like conversations and prompting techniques.
In this project, we try to analyze the new state of play from the perspective of the machine translation (MT) task and ask two main questions: (a) as LLMs can be trained without any parallel data, they open the perspective of improved MT for multiple language pairs for which such resources are scarce if they exist at all. Can this promise be held, especially for low-resource dialects or regional languages? (b) prompting techniques make it straightforward to inject various types of contextual information that could help a MT system to take context into specific account such as to adapt to a domain, a genre, a style, to a client’s translation memory, to the readers’ language proficiency, etc. Is prompting equally effective for all these situations, assuming good prompts can be generated, or is it hopeless to expect improvements without (instruction) fine-tuning? To address these two questions, project TraLaLaM will also (a) collect data for low-resource languages and use them to extend existing LLMs, (b) develop new testing corpora and associated evaluation strategies.

Project coordination

Josep CREGO (SYSTRAN)

The author of this summary is the project coordinator, who is responsible for the content of this summary. The ANR declines any responsibility as for its contents.

Partnership

ISIR Institut des Systèmes Intelligents et de Robotique
SYSTRAN
INRIA Paris Centre de Recherche Inria de Paris

Help of the ANR 595,348 euros
Beginning and duration of the scientific project: September 2023 - 36 Months

Useful links

Explorez notre base de projets financés

 

 

ANR makes available its datasets on funded projects, click here to find more.

Sign up for the latest news:
Subscribe to our newsletter