Synthetic data as a method for increasing reproducibility and transparency in educational research

Journal article › Research › Peer reviewed

Publication data

By	Simon Grund, Oliver Lüdtke, Alexander Robitzsch
Original language	English
Published in	Zeitschrift für Erziehungswissenschaft
Pages	25
Editor (Publisher)	VS Verlag fur Sozialwissenschaften
ISSN	1434-663X, 1862-5215
DOI/Link	https://doi.org/10.1007/s11618-026-01396-6
Publication status	Published advanced online – 02.2026
Keywords	Synthetic data, Reproducibility, International large-scale assessments, Transparency, Open science

Open data are often regarded as an important step towards improving the reproducibility and transparency of educational science. Yet, data sharing remains rare, and without open data, statistical analyses often remain irreproducible. In this article, we provide an introduction to synthetic data, a statistical technique based on multiple imputation (MI) that can be used to create simulated copies of the data that can be shared even when the original data cannot. To this end, we discuss reproducibility-related challenges of synthetic data and outline different approaches for generating synthetic data, including conventional and data-augmented MI (DA-MI) approaches to synthetic data. Furthermore, we conducted a case study using data from the PISA 2018 study, in which we aimed to address several challenges with synthetic data in educational research, such as missing data, multilevel data, and complex sampling designs. Our results indicate that these challenges can be addressed with relatively simple tools and that synthetic data can reproduce the results in a variety of statistical analyses. Finally, we discuss remaining challenges and directions for future research.

Announcements

About Us

The IPN's Departments

Research Lines

Projects

Publications

Collaboration and Networks

Open Science and Good Research Practice

Topics

Extracurricular Offers

Podcasts - Listening to Research

Teaching and Training Materials

Synthetic data as a method for increasing reproducibility and transparency in educational research

Journal article › Research › Peer reviewed

Publication data

DOI

IPN - Leibniz Institute for Science and Mathematics Education