CE23 - Intelligence Artificielle

Analysing Discourse Automatically with Multiple Objectives – AnDiAMO

Submission summary

Documents are not random sequences of sentences, but rather spans of texts are linked together into a discourse structure to make coherent and meaningful documents. Developing systems able to interpret documents and to extract structured information is a challenge in natural language processing that requires high-performing discourse parsers. However, performance are still low and current work does not make it clear where the problem lies. Moreover, studies focus on English monologues from news, which limits their use and their robustness. Finally, empirical studies rarely try to inform theoretical models for discourse. We propose a general framework here called multi-objective learning to tackle these issues. We will build systems seeking to achieve several goals as a way to provide robust systems for multiple languages, domains and modalities, to investigate data representation and problem modelling, to improve evaluation, and to shed some light on the theoretical divergences.

Project coordination

Chloé Braud (Institut de Recherche en Informatique de Toulouse)

The author of this summary is the project coordinator, who is responsible for the content of this summary. The ANR declines any responsibility as for its contents.

Partner

IRIT Institut de Recherche en Informatique de Toulouse

Help of the ANR 239,891 euros
Beginning and duration of the scientific project: January 2022 - 36 Months

Useful links

Explorez notre base de projets financés

 

 

ANR makes available its datasets on funded projects, click here to find more.

Sign up for the latest news:
Subscribe to our newsletter