Peptidomics and Machine Learning-based Evaluation of Noncoding RNA-Derived Micropeptides in Breast Cancer: Expression Patterns and Functional/Therapeutic Insights

Lab Invest. 2024 Dec;104(12):102150. doi: 10.1016/j.labinv.2024.102150. Epub 2024 Oct 10.

Abstract

Breast cancer is a highly heterogeneous disease characterized by different subtypes arising from molecular alterations that give the disease different phenotypes, clinical behaviors, and prognostic. The noncoding RNA (ncRNA)-derived micropeptides (MPs) represent a novel layer of complexity in cancer study once they can be biologically active and can present potential as biomarkers and also in therapeutics. However, few large-scale studies address the expression of these peptides at the peptidomics level or evaluate their functions and potential in peptide-based therapeutics for breast cancer. In this study, we propose deepening the landscape of ncRNA-derived MPs in breast cancer subtypes and advance the comprehension of the relevance of these molecules to the disease. First, we constructed a 16,349 unique putative MP sequence data set by integrating 2 previously published lists of predicted ncRNA-derived MPs. We evaluated its expression on high-throughput mass spectrometry data of breast tumor samples from different subtypes. Next, we applied several machine and deep learning tools, such as AntiCP 2.0, MULocDeep, PEPstrMOD, Peptipedia, and PreAIP, to predict its functions, cellular localization, tertiary structure, physicochemical features, and other properties related to therapeutics. We identified 58 peptides expressed on breast tissue, including 27 differentially expressed MPs in tumor compared with nontumor samples and MPs exhibiting tumor or subtype specificity. These peptides presented physicochemical features compatible with the canonical proteome and were predicted to influence the tumor immune environment and participate in cell communication, metabolism, and signaling processes. In addition, some MPs presented potential as anticancer, antiinflammatory, and antiangiogenic molecules. Our data demonstrate that MPs derived from ncRNAs have expression patterns associated with specific breast cancer subtypes and tumor specificity, thus highlighting their potential as biomarkers for molecular classification. We also reinforce the relevance of MPs as biologically active molecules that play a role in breast tumorigenesis, besides their potential in peptide-based therapeutics.

Keywords: machine learning; micropeptides; noncoding RNAs; subtypes; tumorigenesis.

MeSH terms

  • Biomarkers, Tumor / genetics
  • Biomarkers, Tumor / metabolism
  • Breast Neoplasms* / genetics
  • Breast Neoplasms* / metabolism
  • Female
  • Humans
  • Machine Learning
  • Micropeptides
  • Peptides* / metabolism
  • Proteomics / methods
  • RNA, Untranslated* / genetics
  • RNA, Untranslated* / metabolism

Substances

  • RNA, Untranslated
  • Peptides
  • Biomarkers, Tumor
  • Micropeptides