A comprehensive review of computational prediction of genome-wide features

Brief Bioinform. 2020 Jan 17;21(1):120-134. doi: 10.1093/bib/bby110.

Abstract

There are significant correlations among different types of genetic, genomic and epigenomic features within the genome. These correlations make the in silico feature prediction possible through statistical or machine learning models. With the accumulation of a vast amount of high-throughput data, feature prediction has gained significant interest lately, and a plethora of papers have been published in the past few years. Here we provide a comprehensive review on these published works, categorized by the prediction targets, including protein binding site, enhancer, DNA methylation, chromatin structure and gene expression. We also provide discussions on some important points and possible future directions.

Keywords: genomic features; machine learning; prediction model.