CE46 - Modèles numériques, simulation, applications

Dataflow Algorithm aRchitecture co-design of SKA pipeline for Exascale RadioAstronomy – DARK-ERA

Dataflow Algorithm aRchitecture co-design of SKA pipeline for Exascale Radio Astronomy

The exascale radio telescope Square Kilometre Array (SKA) [1] will require supercomputers with high technical demands. The Science Data Processor (SDP) pipeline in charge of producing the multidimen- sional images of the sky will have to execute in realtime a complex algorithm chain with data coming from telescopes at an incredible rate of several Tb/s and limited storage possibilities. The SDP will also have to be as green as possible with an energy budget of only 1 MWatt for 250 Petaflops.<br /><br />The SDP supercomputer will be based on a standard HPC system combined with FPGA or application-specific architectures like GPU or the manycore Kalray Massively Parallel Processor Array (MPPA). One crucial challenge is to assess the performance both in time and energy of new complex scientific dataflow algorithms on not-yet-existing complex computing infrastructures. It will be hardly possible without efficient co-design methods and rapid prototyping tools.

Dark-Era Goals

1- Building SimSDP, a rapid prototyping tool providing exascale simulations from dataflow algorithm description.<br />2 - Exploring low power accelerators like FPGA or Kalray MPPA as alternatives to mainstream GPU architecture.<br />3 - Being source of proposals for SKA computing and promoting french contributions such as ddfacet

SimSDP purpose is to provide early analyses in terms of mem- ory usage, latency, throughput, and energy consumption through an original mixed approach based on execution and simulation. Following an Algorithm Architecture Matching (AAM) approach, SimSDP will rely on a dataflow model of the algorithm and a model of the target architecture. SimSDP will be based on two existing tools: PREESM and SimGrid. PREESM accurately evaluates heterogeneous single node performance; SimGrid accurately simulates inter-node communications. Their association will allow for reliable simulations of large scale heterogeneous HPC systems.

The project started in 2021

The project started in 2021

The project started in 2021

SKA computing will be a pioneer HPC challenge tackled by Dark-Era. The exascale radio telescope Square Kilometer Array (SKA) will require supercomputers with high technical constraints. The Science Data Processor (SDP) pipeline in charge of producing the multidimensional images of the sky will have to execute in realtime a complex algorithm chain from data coming from telescopes at an incredible rate of several Tb/s and without any storage capabilities. The SDP will also have to be as green as possible with an energy budget of only 1 MWatt for 250 Petaflops. Such energy and computation requirements imply the SDP to be an innovative dataflow oriented and heterogeneous architecture. This supercomputer will be based on a standard HPC system combined with Field Programmable Gate Array (FPGA) or application-specific architectures like Graphical Processing Unit (GPU) or the manycore Kalray Massively Parallel Processor Array (MPPA). One crucial challenge is to assess the performance both in time and energy of new complex scientific dataflow algorithms on not-yet-existing complex computing infrastructures. It will be hardly possible without efficient co-design methods and rapid prototyping tools.

SimSDP is a rapid prototyping tool for SKA-like dataflow applications developed by the Dark-Era project. Through an original mixed approach based on execution and simulation, SimSDP purpose is to provide early analyses in terms of memory usage, latency, throughput, and energy consumption. Following an Algorithm Architecture Matching (AAM) approach, SimSDP will rely on a dataflow model of the algorithm and a model of the target architecture. SimSDP will be based on two existing tools: PREESM and SimGrid. PREESM accurately evaluates heterogeneous single node performance; SimGrid accurately simulates inter-node communications. Then, the association of PREESM and SimGrid will allow for reliable simulations of large scale heterogeneous HPC systems.

Thanks to SimSDP, algorithm and architecture spaces will be explored in the SKA context. The new generation of radio astronomy imaging pipelines like ddfacet will be described at a high-level of abstraction suitable for targeting any heterogeneous multinode HPC system SKA may choose in the future. Then, several SDP architecture configurations (number of nodes, kind of accelerators) and several SDP algorithm configurations will be explored together through the large scale simulations offered by SimSDP. Besides, SDP prototypes on MPPA and FPGA designed through High-Level Synthesis (HLS) tools and set up at the NenuFAR radio telescope will be developed and profiled on small scale datasets. It will allow to evaluating the potential of low power accelerators as an alternative to the mainstream GPU architecture. These SDP profiling feedbacks on GPU/MPPA/FPGA will fill out SimSDP with annotations on the dataflow graph. SDP prototypes will also be compared with SimSDP simulations on medium scale datasets to evaluate SimSDP new features.

Dark-Era will be a consortium gathering complementary skills in computer science, signal processing, and astronomy with twelve permanent members from the SimGrid development Team at IRISA, the PREESM development team at IETR, the inverse problem team at L2S, and two radio astronomy teams at Observatories of Paris and Cˆote d’Azur. Preliminary results obtained by this consortium in collaboration with Atos-Bull during the CNRS SKALLAS project will be pursued. Two PhD students will work on the association of PREESM and SimGrid tools in SimSDP, and two post-docs will be hired. With the support of SKA-France, this 4-year project aims to promote french contributions to SKA such as ddfacet and to be a force of proposal for SKA computing. Finally, Dark-Era intends to be the breeding ground for new international collaborations notably through the Rising STARS European and International network.

Project coordinator

Monsieur Nicolas Gac (Laboratoire des Signaux et Systèmes)

The author of this summary is the project coordinator, who is responsible for the content of this summary. The ANR declines any responsibility as for its contents.

Partner

IRISA Institut de Recherche en Informatique et Systèmes Aléatoires
L2S Laboratoire des Signaux et Systèmes
NANCAY Station de radioastronomie de Nançay
LAGRANGE (OCA/CNRS/UCA) Laboratoire J-L. Lagrange (OCA/CNRS/UCA)
IETR INSTITUT D'ELECTRONIQUE ET DE TELECOMMUNICATION DE RENNES (IETR)

Help of the ANR 498,960 euros
Beginning and duration of the scientific project: - 48 Months

Useful links

Explorez notre base de projets financés

 

 

ANR makes available its datasets on funded projects, click here to find more.

Sign up for the latest news:
Subscribe to our newsletter