CE23 - Intelligence artificielle et science des données 2024

Facilitated Exploration: Interactive Constraint-Driven Data Mining – FIDD

Submission summary

The FIDD project addresses contemporary challenges in data mining and AI. The rise in data generation across various domains has led to a demand for interactive, user-centric approaches to efficiently extract valuable insights from complex datasets. Traditional data mining methods face limitations in keeping up with evolving user requirements and dynamic data environments. These methods employ three main strategies for handling user constraints: pre-processing, which restricts the dataset but becomes impractical with numerous constraints; integrating constraint filtering directly into the mining process, requiring new algorithms for each constraint; and post-processing, a brute-force method impractical for problems with many patterns. Constraint programming (CP) has emerged as a declarative solution to data mining, allowing the incorporation of user constraints without added complexity. However, CP can be challenging to model and requires manual construction of constraints. An emerging field, Constraint Acquisition (CA), focuses on acquiring a constraint network from data, addressing these challenges. Interactive pattern mining has transformed traditional methods, introducing an iterative approach. Instead of static constraints, it involves pattern extraction, user interaction to shape the process, and preference learning for future iterations. Current methods often rely on independent descriptors, limiting their ability to capture complex data relationships. Users struggle to articulate precise search criteria, compounded by data heterogeneity, graph complexities, and dynamic data. The FIDD project integrates advanced techniques from constraint-driven data mining, constraint acquisition, machine learning, and interactive data mining, aiming to provide a unified framework capable of adapting to evolving user needs, efficiently extracting patterns, and enhancing performance through user feedback and model revision.

Project coordination

Nadjib Lazaar (Laboratoire Interdisciplinaire des Sciences du Numérique)

The author of this summary is the project coordinator, who is responsible for the content of this summary. The ANR declines any responsibility as for its contents.

Partnership

DLM DEEPLINK MEDICAL
LS2N Ecole nationale supérieure Mines-Télécom Atlantique Bretagne Pays de la Loire
CRIL Centre de Recherche en Informatique de Lens
LISN Laboratoire Interdisciplinaire des Sciences du Numérique
GREYC Université de Caen Normandie
LIFO EA 4022 LABORATOIRE D'INFORMATIQUE FONDAMENTALE D'ORLÉANS

Help of the ANR 729,463 euros
Beginning and duration of the scientific project: December 2024 - 48 Months

Useful links

Explorez notre base de projets financés

 

 

ANR makes available its datasets on funded projects, click here to find more.

Sign up for the latest news:
Subscribe to our newsletter