FGFICA: Independent Component Analysis of Fusion Genomic Features for Mining Epi-Transcriptome Profiling Data

IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):1842-1853. doi: 10.1109/TCBB.2022.3220552. Epub 2023 Jun 5.

Abstract

Existing studies indicate that in-depth studies of the N6-methyladenosine (m6A) co-methylation patterns in epi-transcriptome profiling data may contribute to understanding its complex regulatory mechanisms. In order to fully utilize the potential features of epi-transcriptome data and consider the advantages of independent component analysis (ICA) in local pattern mining tasks, we propose an ICA algorithm that fuses genomic features (FGFICA) to discover potential functional patterns. FGFICA first extracts and fuses the confidence information, homologous information, and genomic features implied in epi-transcriptome profiling data and then solves the model based on negative entropy maximization. Finally, to mine m6A co-methylation patterns, the probability density of the extracted independent components is estimated. In the experiment, FGFICA extracted 64 m6A co-methylation patterns from our collected MeRIP-seq high-throughput data. Further analysis of some selected patterns revealed that the m6A sites involved in these patterns were highly correlated with four m6A methylases, and these patterns were significantly enriched in some pathways known to be regulated by m6A.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Gene Expression Profiling*
  • Genomics
  • Methylation
  • Transcriptome* / genetics