One-sample missing DNA-methylation value imputation

Abstract Background Currently, the most popular methods for missing DNA-methylation value imputation rely on exploiting methylation patterns across multiple samples from the same population. However, if there is significant variability between individuals or limited data available, these methods mig...

Full description

Saved in:
Bibliographic Details
Main Authors: Christelle Kemda Ngueda, Julia Palm, Flavia Remo, André Scherag, Lutz Leistritz
Format: Article
Language:English
Published: BMC 2025-05-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-025-06154-9
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Background Currently, the most popular methods for missing DNA-methylation value imputation rely on exploiting methylation patterns across multiple samples from the same population. However, if there is significant variability between individuals or limited data available, these methods might produce biased results. This situation has prompted researchers to seek alternative approaches for handling single-sample data, particularly in the context of personalized medicine. Accordingly, we propose One-Sample Methyl Imputation (OSMI), an imputation method that can also be used in single-sample applications. Results The proposed method in single-subject cases yielded an average imputation accuracy of RMSE = 0.2713 (95%-CI from 0.2696 to 0.2730) in β-value units (range: 0–1) based on real 450 K BeadChip data sets of 3,402 individuals. It is possible to take the affiliation of individual CpGs to CpG islands into account during the imputation of missing methylation values. This improves the imputation accuracy. In addition, the accuracy of imputation depends in general on the density of CpG sites on DNA-methylation microarrays and increases as the CpG site density increases. OSMI has low memory and computational requirements. Conclusions OSMI uses a single methylome to impute missing values quickly at very low memory constraints. Its imputation accuracy is inferior to other methods if multiple samples are available and these samples are reasonably similar, but OSMI represents a useful addition to the imputation toolbox for the case of single-sample applications.
ISSN:1471-2105