Blanc SIMI 2 - Blanc - SIMI 2 - Science informatique et applications 2012

Exploitation of context for proper names recognition in the diachronic audio documents – ContNomina

Exploitation of context for proper names recognition in the diachronic audio documents

In the context of diachronic data (data which change over time) new names appear constantly requiring dynamic updates of the lexicons and language models used by the speech recognition system.<br />As a result, the project ContNomina focuses on the problem of proper names in automatic audio processing systems by exploiting in the most efficient way the context of the processed documents.

Exploitation of context for proper names recognition in the diachronic audio documents

the project will address:<br />the statistical modeling of contexts and of relationships between contexts and proper names;<br />the contextualisation of the recognition module through the dynamic adjustment of the lexicon and of the language model in order to make them more accurate and certainly more relevant in terms of lexical coverage, particularly with respect to proper names;<br />the detection of proper names, on the one hand, in text documents for building lists of proper names, and on the other hand, in the output of the recognition system to identify spoken proper names in the audio / video data.

Exploitation of context for proper names recognition in the diachronic audio documents

the project will address:
the statistical modeling of contexts and of relationships between contexts and proper names;
the contextualisation of the recognition module through the dynamic adjustment of the lexicon and of the language model in order to make them more accurate and certainly more relevant in terms of lexical coverage, particularly with respect to proper names;
the detection of proper names, on the one hand, in text documents for building lists of proper names, and on the other hand, in the output of the recognition system to identify spoken proper names in the audio / video data.

Results

in progress

Prospects

in progress

Scientific productions and patents

D. Fohr, O. Mella «Combination of Random Indexing based Language Model and N-gram Language Model for Speech recognition«, Interspeech 2013
A. Lorenzo, C. Cerisara « Weakly supervised joint SRL and Dependency Parsing » soumis à l'EMNLP 2013

Submission summary

The technologies involved in information retrieval in large audio/video databases are often based on the analysis of large, but closed, corpora, and on machine learning techniques and statistical modeling of the written and spoken language. The effectiveness of these approaches is now widely acknowledged, but they nevertheless have major flaws, particularly for what concern new words and proper names, two types of inputs that are crucial for the interpretation of the content but which are extremely difficult to model from the analysis of closed corpora.
In the context of diachronic data (data which change over time) new names appear constantly requiring dynamic updates of the lexicons and language models used by the speech recognition system.
As a result, the project ContNomina focuses on the problem of proper names in automatic audio processing systems by exploiting in the most efficient way the context of the processed documents. To do this, the project will address:
• the statistical modeling of contexts and of relationships between contexts and proper names;
• the contextualisation of the recognition module through the dynamic adjustment of the lexicon and of the language model in order to make them more accurate and certainly more relevant in terms of lexical coverage, particularly with respect to proper names;
• the detection of proper names, on the one hand, in text documents for building lists of proper names, and on the other hand, in the output of the recognition system to identify spoken proper names in the audio / video data.
Resources developed during this project will be made accessible to the scientific community. This will correspond to a lexicon of phonetized proper names (currently such a lexicon is not available in French) and annotations of an audio / video corpus.
A WEB demonstrator will be implemented to validate the scientific developments achieved in the project.

Irina ILLINA (LABORATOIRE LORRAIN DE RECHERCHE EN INFORMATIQUE ET SES APPLICATIONS)

The author of this summary is the project coordinator, who is responsible for the content of this summary. The ANR declines any responsibility as for its contents.

LORIA LABORATOIRE LORRAIN DE RECHERCHE EN INFORMATIQUE ET SES APPLICATIONS
LIA LIA

Help of the ANR 317,117 euros
Beginning and duration of the scientific project: January 2013 - 42 Months

Explorez notre base de projets financés

ANR makes available its datasets on funded projects, click here to find more.