CE48 - Fondements du numérique: informatique, automatique, traitement du signal 2020

Approximation and Randomised String Processing – PARSe

Submission summary

In this project we aim to study the foundations of processing large-scale, noisy string data. Our goal is to understand the limit of computations, and to provide new ultra-efficient algorithms and data structures for processing such data, inspired by approaches in hashing and high-dimensional geometry. We will focus on three research directions: streaming pattern matching, probabilistic text indexing, and sketching-based sting comparison. Algorithms and data structures on strings have traditionally been exploited in such fields as Bioinformatics, Information Retrieval, and Digital Security, and we expect our project to have a significant impact on these fields.

Project coordination

Tatiana Starikovskaya (Département d'Informatique de l'Ecole Normale Supérieure)

The author of this summary is the project coordinator, who is responsible for the content of this summary. The ANR declines any responsibility as for its contents.

Partnership

University of Wroclaw / Institute of Informatics
DI ENS Département d'Informatique de l'Ecole Normale Supérieure
IRISA Institut de Recherche en Informatique et Systèmes Aléatoires
LIRMM Laboratoire d'Informatique, de Robotique et de Microélectronique de Montpellier

Help of the ANR 202,340 euros
Beginning and duration of the scientific project: December 2020 - 48 Months

Useful links

Explorez notre base de projets financés

 

 

ANR makes available its datasets on funded projects, click here to find more.

Sign up for the latest news:
Subscribe to our newsletter