plateforme; mutualisation; ressources; outils; français; langues de france

Outils et Ressources pour un Traitement Optimisé des LANGues

Equipements d'excellence (EQPX)

Equipements d'excellence

Informations générales

Référence projet : 11-EQPX-0032
Etablissement Coordinateur : CNRS Centre Est (Vandoeuvre)
Région du projet : Grand Est
Discipline : 6 - SHS

Aide de l'ANR 2 600 000 euros
Investissement couvrant la période de août 2012 à décembre 2019

Résumé de soumission

ORTOLANG (Open Resources and Tools for Language: is a French infrastructure implemented in the framework of the “Programme d’Investissement d’Avenir” (PIA) funded by the “Investissements d’Avenir” French Government program.

The main purpose of ORTOLANG is to offer a repository of linguistic data (corpora, lexicons, dictionaries, etc.) and tools on languages and their processing readily available and well-documented which:


enable analysis research on analysis, modelling and automatic treatment of our language to reach the best international level by way of  a real pooling of resources;

ease the use and transfer of resources and tools created by public laboratories towards industrial partners, in particular towards SME which cannot often develop such resources and tools for language treatment due to the creation costs;

promote the French language and local languages of France by sharing the knowledge acquired by public laboratories.


The ORTOLANG platform is intended to be a mutualisation infrastructure for the management, long-term preservation and dissemination of language resources such as corpora, lexicons, terminologies and language processing tools.

This project also aims to generalise and ensure the perpetuation of the efforts made by digital resources centres on language, CNRTL (CentredeRessourcesTextuellesetLexicales, and the SLDR (Speech and Language Data Repository,, setup by the CNRS.

It has the ambition to be used as a French node and major support for France’s commitment within the CLARIN infrastructure (Common Language Resources and Technology Infrastructure: of which we have been partners from the very beginning.

It is also used as a technical language platform of written, oral language forms, to support the coordination actions of the TGIR HumaNum (



L'auteur de ce résumé est le coordinateur du projet, qui est responsable du contenu de ce résumé. L'ANR décline par conséquent toute responsabilité quant à son contenu.

Liens utiles