The Paris-Saclay Center for Data Science: an interdisciplinary project around data

Latest News


Citizen science « à la française »: from Vigie-nature to « 65 millions d’observateurs », toward a national framework for citizen science.

Data Engineering Position

Paris-Saclay Center for Data Science (CDS)

Data Scientist Position

Paris-Saclay Center for Data Science (CDS)

The Paris-Saclay Center for Data Science is a "LIDEX" project initated by the Université Paris-Saclay.

Data science

The subject of data science is the design of automated methods to analyze massive and complex data in order to extract useful information. Data science projects require expertise from a vast spectrum of scientific fields ranging from research on methods (statistics, signal processing, machine learning, data mining, data visualization) through software building and maintenance to the mastery of the scientific domain where the data originate from.

The objectives of the CDS

The goal of this initiative is to establish an institutionalized agora in which these scientists can find each other, exchange ideas, initiate and nurture interdisciplinary projects, and share their experience on past data science projects. To foster synergy between data analysts and data producers we propose to provide initial resources for helping collaborations to get off the ground, to mitigate the non-negligible risk taken by researchers venturing into interdisciplinary data science projects, and to encourage the use of unconventional forms of information transmission and dissemination essential in this communication-intensive research area. The CDS fits perfectly in the recent surge of similar initiatives, both at the international and at the national level, and it has the potential to make the University one of the international fore-runners of data science.

Data science in human, natural, and engineering sciences

More than 250 permanent researchers in 35 laboratories participate in the CDS. On the mathematics/computer science side, the major research themes are

At the same time, we focus on data coming from


Balázs Kégl, CNRS, Laboratoire de l’Accélérateur Linéaire (LAL),

Arnak Dalalyan, ENSAE, Laboratoire de Statistique (LS), ENSAE-CREST,