Jilles is Faculty (W3) at the CISPA Helmholtz Center for Information Security, where he leads the Exploratory Data Analysis group. He is Honorary Professor of the Department of Computer Science of Saarland University, as well as ELLIS Fellow of the ELLIS Unit Saarbrücken on Artificial Intelligene and Machine Learning.
My research is mainly concerned with causality and unsupervised learning. In particular, I enjoy developing theory and algorithms for answering exploratory questions, such as 'what is going on in my data?' or 'what is going on in my model?' without having to make unnecessary or unjustified assumptions. To identify what is worth knowing, I often employ well-founded statistical methods based on information theory, and then proceed to develop efficient algorithms for extracting useful interpretable results. I like all data types equally much.
Currently I'm investigating techniques for identifying informative and ideally causal structures in large collections of complex data; how to efficiently mine easily interpretable summaries from data; how to determine and discover causal dependencies from observational data; the theoretical and practical foundations of interactive exploration of very large data, discovering things by serendipity; how to mine large relational databases; how to mine very large graphs, including characterising influence propagation in social networks; as well as to study well-founded approaches for meaningfully comparing between, and validation of, explorative results.
Christina is faculty office, i.e. the secretarial support, for Jilles and the EDA group.
Lénaïg Cornanguer is a post-doctoral researcher. Her main research interests are causal inference, discovery of temporal logic from observational data, and explainable anomaly detection.
Lénaïg pursued her PhD at IRISA in Rennes, obtaining her degree from the Université de Rennes 1 for her dissertation titled 'Timed Automata Learning from Time Series' in November 2023. Prior to that, she obtained her Master's of Science in Data Science from the Agrocampus Ouest Engineering School in Rennes, France.
Joscha Cüppers is a postdoctoral researcher working on novel methods for mining (causal) patterns from data. He is particularly interested in developing methods for discovering interactions and abstractions from sequential data.
Joscha obtained his Bachelor of Science in Computer Science in 2016 from Ulm University and his Master of Science from Saarland University in 2019. He joined the EDA group in 2019 to write his Master's thesis, and stayed to do his PhD. He defended his PhD thesis 'Discovering Actionable Insights from Event Sequences' on September 11th 2025.
Janis Kalofolias is a post-doctoral researcher affiliated with the EDA group. His research interests include many things, ranging from optimistic estimators for subgroup discovery, kernel-based methods for measuring similarities between graphs, to information theoretic methods for subjectively interesting structure from complex data.
Janis obtained his Bachelor of Science in 2011 from the University of Patras, Greece. In 2012 he joined Saarland University to pursue a Master of Science in Computer Science, and was a Research Assistant at the Max Planck Institute for Informatics. He joined the EDA group as a PhD student in November 2016, and defended his dissertation titled 'Subgroup Discovery for Structure Targets' on December 8th 2022. He subsequently was a Postdoc in the group until June 2024.
David Kaltenpoth is a postdoc who works on causal inference under realistic conditions, such as confounding, selection bias, or non-i.i.d. data.
David obtained his Ph.D. in Computer Science from Saarland University on November 25th, 2024. His thesis, 'Don't Confound Yourself: Causality from Biased Data,' was awarded the Helmholtz AI Dissertation Award 2024.
David obtained his Master of Science in Mathematics from the Ludwigs-Maximilian Universität München in 2016. He joined the EDA group in June 2016 for a Research Immersion lab, stayed for his Ph.D., and is now a postdoc.
Jawad Al Rahwanji is a PhD student in the EDA group. He works on explainable neuro-symbolic methods and is particularly interested in applications in the biomedical domain. He is a member of the Research Training Group on Neuro-Explicit Models.
Jawad obtained his Master's of Science in Data Science and Artificial Intelligence from Saarland University in 2025. He wrote his Master's thesis with us on differentiable subgroup discovery in time-to-event (survival) data. Previously, he obtained his Bachelor of Engineering in Information Technology Engineering in 2021 from Damascus University, specializing in artificial intelligence.
Sarah Mameche is a PhD student who is interested in exploratory causal analysis, such as discovering of invariant causal mechanisms from data.
Sarah did both her Bachelor's and Master's degree in Computer Science at Saarland University. She joined the EDA group in 2020 to write her Master's thesis with us on the topic of discovering invariant causal mechanisms between different environments, such as between the populations analyzed by different hospitals.
Praharsh Nanavati is a PhD student working on causality. He is a member of the Research Training Group on Neuro-Explicit Models.
Praharsh worked as a Predoctoral Researcher in Causal Fairness at the CSA Department at the Indian Institute of Science. Previously, he received a BS-MS degree in Data Science and Engineering from IISER Bhopal, India in 2024, where he worked on causal representation learning with minimal DAG knowledge as part of his Master's thesis.
Luis Paulus is a PhD student working on neuro-symbolic knowledge discovery and machine learning.
Luis obtained both his Master's and Bachelor's in Computer Science from Saarland University. He wrote his Master's thesis on how to efficiently discover high-quality rule sets from massive binary datasets using continuous optimization, and his Bachelor's thesis on reinforcement curriculum learning. He was a tutor for Programming 1 and the Math Preparatory Course, part of the CS Bachelor Honor's Program, did an Erasmus semester in Bergen, Norway, and likes rowing.
Hendrik will join us as a PhD student from March 1st. Hendrik is a PhD student interested in exploratory data analysis in general, be it pattern mining, anomaly detection, and/or causal analysis.
Hendrik obtained his Master's in Computer Science and his Bachelor's in Mathematics and Computer Science from Saarland University. He wrote his Master's thesis on how to detect and characterize rare exceptions in data. He was a student assistant at the Saarland University Competence Center, worked at retailSolutions, and likes traveling and playing football.
I am a Ph.D. student at CISPA Helmholtz Center for Information Security, supervised by Jilles Vreeken. I am broadly interested in robust and explainable machine learning for large-scale real-world applications. In my Ph.D, I intend to develop new approaches that are at the same time descriptive and predictive. That is the models not only offer predictive capabilities but also facilitate practitioners to gain deeper insights into the problems they are addressing.
I obtained my Bachelor's and Master's degrees in Computer Science from Saarland University. Before joining CISPA, I was a research assistant in the goup of Bernt Schiele at the Max-Planck-Institut for Informatics, supervised by David Stutz . My research focused on adversarial and out-of-distribution robustness of Quantized Neural Networks. I also worked on the influence of Batch Normalization on the vulnerability and generalization capabilities of neural networks.
Sascha Xiaguang Xu is a Ph.D. student who works on the intercept between causality and explainability.
Sascha obtained his Bachelor's and Master's degrees from Saarland University, respectively in 2019 and 2022. During this time he worked with us a student research assistant on the topic of bivariate causal inference in the presence of heteroscedastic noise, which led to an ICML paper in 2022.
Ben – or, if you want to be formal, Benedict – is pursuing his Bachelor of Science degree in Data Science and AI at Saarland University. He is currently writing his Bachelor's thesis with us on the topic of multi-modal subgroup discovery.
Teodora is pursuing a Bachelor's degree in, and part of the Honor programme of, Computer Science at Saarland University. She is broadly interested in gaining understanding of how large models reach their conclusions. She is writing her Bachelor's thesis with us on mechanistic interpretability, and is, in parallel, also a Research Assistant working on tokenization.
Previously, she tutored Programming 1 and the Mathematics Preparatory Course, and worked with Prof. Vera Demberg on LLMs and natural language processing.
Florian holds a Master's scholarship from the Konrad-Zuse School ELIZA and is pursuing a degree in Computer Science at Saarland University. He is a research assistant in the EDA group, working on interpretability and explainability.
Florian obtained his Bachelor's in Computational Linguistics from Saarland University. He was a tutor for six different lectures, among which on Statistics, NLP Algorithms, and Semantics.
Saanvi is pursuing her Bachelor's of Computer Science at Saarland University. She is currently writing her Bachelor's thesis with us on the topic of Komogorov-Arnold Networks and Causality.
Tim is pursuing his Master of Science in Data Science and AI at Saarland University. He is currently writing his Master's thesis with us on mechanistic interpretability.
Tim obtained his Bachelor's in Data Science and AI from Saarland University. He wrote his thesis on how to discover and characterize anomalies identified by deep vision models.
Ferdinand is pursuing a Master's of Science in Computer Science at Saarland University. He is currently working on his Master's thesis.
Tymur is pursuing his Master's of Science in Computer Science at Saarland University. He is currently exploring possible topics for his Master's thesis.
Ghada is pursuing a Master's of Science in Data Science and AI at Saarland University. She is currently working on her Master's thesis on continuous-optimization-based approaches for mechanistic interpretability