RNA splicing dysfunctions are more widespread than what is believed by only estimating the effects resulting by splicing factor mutations (SFMT) in myeloid neoplasia (MN). The genetic complexity of MN is amenable to machine learning (ML) strategies. We applied an integrative ML approach to identify co-varying features by combining genomic lesions (mutations, deletions, and copy number), exon-inclusion ratio as measure of RNA splicing (percent spliced in, PSI), and gene expression (GE) of 1,258 MN and 63 normal controls. We identified 15 clusters based on mutations, GE, and PSI. Different PSI levels were present at various extents regardless of SFMT suggesting that changes in RNA splicing were not strictly related to SFMT. Combination of PSI and GE further distinguished the features and identified PSI similarities and differences, common pathways, and expression signatures across clusters. Thus, multimodal features can resolve the complex architecture of MN and help identifying convergent molecular and transcriptomic pathways amenable to therapies.
Keywords: Bioinformatics; Cancer; Omics.
© 2023 The Author(s).