Assessing the replicability of RCTs in RWE emulations

Abstract Background The standard regulatory approach to assess replication success is the two-trials rule, requiring both the original and the replication study to be significant with effect estimates in the same direction. The sceptical p-value was recently presented as an alternative method for th...

Full description

Saved in:

Bibliographic Details
Main Authors:	Jeanette Köppe, Charlotte Micheloud, Stella Erdmann, Rachel Heyard, Leonhard Held
Format:	Article
Language:	English
Published:	BMC 2025-05-01
Series:	BMC Medical Research Methodology
Subjects:	Randomized clinical trial Real-world evidence Replication Sceptical p-value Two-trials rule
Online Access:	https://doi.org/10.1186/s12874-025-02589-z
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850269185284767744
author	Jeanette Köppe Charlotte Micheloud Stella Erdmann Rachel Heyard Leonhard Held
author_facet	Jeanette Köppe Charlotte Micheloud Stella Erdmann Rachel Heyard Leonhard Held
author_sort	Jeanette Köppe
collection	DOAJ
description	Abstract Background The standard regulatory approach to assess replication success is the two-trials rule, requiring both the original and the replication study to be significant with effect estimates in the same direction. The sceptical p-value was recently presented as an alternative method for the statistical assessment of the replicability of study results. Methods We review the statistical properties of the sceptical p-value and compare those to the two-trials rule. We extend the methodology to non-inferiority trials and describe how to invert the sceptical p-value to obtain confidence intervals. We illustrate the performance of the different methods using real-world evidence emulations of randomized controlled trials (RCTs) conducted within the RCT DUPLICATE initiative. Results The sceptical p-value depends not only on the two p-values, but also on sample size and effect size of the two studies. It can be calibrated to have the same Type-I error rate as the two-trials rule, but has larger power to detect an existing effect. In the application to the results from the RCT DUPLICATE initiative, the sceptical p-value leads to qualitatively similar results than the two-trials rule, but tends to show more evidence for treatment effects compared to the two-trials rule. Conclusion The sceptical p-value represents a valid statistical measure to assess the replicability of study results and is useful in the context of real-world evidence emulations.
format	Article
id	doaj-art-2a6b3cfa016e4ab8ad61b4eac9433f72
institution	OA Journals
issn	1471-2288
language	English
publishDate	2025-05-01
publisher	BMC
record_format	Article
series	BMC Medical Research Methodology
spelling	doaj-art-2a6b3cfa016e4ab8ad61b4eac9433f722025-08-20T01:53:14ZengBMCBMC Medical Research Methodology1471-22882025-05-0125111510.1186/s12874-025-02589-zAssessing the replicability of RCTs in RWE emulationsJeanette Köppe0Charlotte Micheloud1Stella Erdmann2Rachel Heyard3Leonhard Held4Institute of Biostatistics and Clinical Research, University of MuensterEpidemiology, Biostatistics and Prevention Institute, University of ZurichInstitute of Medical Biometry, University of HeidelbergEpidemiology, Biostatistics and Prevention Institute, University of ZurichEpidemiology, Biostatistics and Prevention Institute, University of ZurichAbstract Background The standard regulatory approach to assess replication success is the two-trials rule, requiring both the original and the replication study to be significant with effect estimates in the same direction. The sceptical p-value was recently presented as an alternative method for the statistical assessment of the replicability of study results. Methods We review the statistical properties of the sceptical p-value and compare those to the two-trials rule. We extend the methodology to non-inferiority trials and describe how to invert the sceptical p-value to obtain confidence intervals. We illustrate the performance of the different methods using real-world evidence emulations of randomized controlled trials (RCTs) conducted within the RCT DUPLICATE initiative. Results The sceptical p-value depends not only on the two p-values, but also on sample size and effect size of the two studies. It can be calibrated to have the same Type-I error rate as the two-trials rule, but has larger power to detect an existing effect. In the application to the results from the RCT DUPLICATE initiative, the sceptical p-value leads to qualitatively similar results than the two-trials rule, but tends to show more evidence for treatment effects compared to the two-trials rule. Conclusion The sceptical p-value represents a valid statistical measure to assess the replicability of study results and is useful in the context of real-world evidence emulations.https://doi.org/10.1186/s12874-025-02589-zRandomized clinical trialReal-world evidenceReplicationSceptical p-valueTwo-trials rule
spellingShingle	Jeanette Köppe Charlotte Micheloud Stella Erdmann Rachel Heyard Leonhard Held Assessing the replicability of RCTs in RWE emulations BMC Medical Research Methodology Randomized clinical trial Real-world evidence Replication Sceptical p-value Two-trials rule
title	Assessing the replicability of RCTs in RWE emulations
title_full	Assessing the replicability of RCTs in RWE emulations
title_fullStr	Assessing the replicability of RCTs in RWE emulations
title_full_unstemmed	Assessing the replicability of RCTs in RWE emulations
title_short	Assessing the replicability of RCTs in RWE emulations
title_sort	assessing the replicability of rcts in rwe emulations
topic	Randomized clinical trial Real-world evidence Replication Sceptical p-value Two-trials rule
url	https://doi.org/10.1186/s12874-025-02589-z
work_keys_str_mv	AT jeanettekoppe assessingthereplicabilityofrctsinrweemulations AT charlottemicheloud assessingthereplicabilityofrctsinrweemulations AT stellaerdmann assessingthereplicabilityofrctsinrweemulations AT rachelheyard assessingthereplicabilityofrctsinrweemulations AT leonhardheld assessingthereplicabilityofrctsinrweemulations

Assessing the replicability of RCTs in RWE emulations

Similar Items