Motivation: Both β-value and M-value have been used as metrics to measure methylation levels. The M-value is more statistically valid for the differential analysis of methylation levels. However, the β-value is much more biologically interpretable and needs to be reported when M-value method is used for conducting differential methylation analysis. There is an urgent need to know how to interpret the degree of differential methylation from the M-value. In M-value linear regression model, differential methylation M-value ΔM can be easily obtained from the coefficient estimate, but it is not straightforward to get the differential methylation β-value, Δβ since it cannot be obtained from the coefficient alone.
Results: To fill the gap, we have built a bridge to connect the statistically sound M-value linear regression model and the biologically interpretable Δβ. In this article, three methods were proposed to calculate differential methylation values, Δβ from M-value linear regression model and compared with the Δβ directly obtained from β-value linear regression model. We showed that under the condition that M-value linear regression model is correct, the method M-model-coef is the best among the four methods. M-model-M-mean method works very well too. If the coefficients α0, α2,…αp are not given (as 'MethLAB' package), the M-model-M-mean method should be used. The Δβ directly obtained from β-value linear regression model can give very biased results, especially when M-values are not in (-2, 2) or β-values are not in (0.2, 0.8).
Availability and implementation: The dataset for example is available at the National Center for Biotechnology Information Gene Expression Omnibus repository, GSE104778.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: [email protected].