Microscopic Intelligibility Modeling – MIM
Our project has two goals: a) make machines recognise speech more like humans do, and b) validate our understanding about human speech perception through the use of data-driven techniques. MIM aims at proposing computational models that predict human speech recognition at a fine resolution. Current approaches to intelligibility prediction provide macroscopic estimates consisting of aggregates over many stimuli and listeners. By leveraging recent developments in the field of Artificial Intelligence, models could predict recognition at a sub-lexical level. Deep learning (DL) has improved automatic speech recognition performance significantly, achieving super-human transcription in conversational tasks. We plan to build DL models to predict human listening tests responses aiming at improving individualization of hearing solutions. Scarcity and variability of human listening data, and the interpretation problem in DL are two of the main issues that we will tackle.
Project coordination
Ricard Marxer (Laboratoire d'Informatique et Systèmes)
The author of this summary is the project coordinator, who is responsible for the content of this summary. The ANR declines any responsibility as for its contents.
Partnership
LIS Laboratoire d'Informatique et Systèmes
Help of the ANR 286,934 euros
Beginning and duration of the scientific project:
August 2021
- 48 Months