Visual Clustering of Transcriptomic Data from Primary and Metastatic Tumors-Dependencies and Novel Pitfalls

Genes (Basel). 2022 Jul 26;13(8):1335. doi: 10.3390/genes13081335.

Abstract

Personalized oncology is a rapidly evolving area and offers cancer patients therapy options that are more specific than ever. However, there is still a lack of understanding regarding transcriptomic similarities or differences of metastases and corresponding primary sites. Applying two unsupervised dimension reduction methods (t-Distributed Stochastic Neighbor Embedding (t-SNE) and Uniform Manifold Approximation and Projection (UMAP)) on three datasets of metastases (n =682 samples) with three different data transformations (unprocessed, log10 as well as log10 + 1 transformed values), we visualized potential underlying clusters. Additionally, we analyzed two datasets (n =616 samples) containing metastases and primary tumors of one entity, to point out potential familiarities. Using these methods, no tight link between the site of resection and cluster formation outcome could be demonstrated, or for datasets consisting of solely metastasis or mixed datasets. Instead, dimension reduction methods and data transformation significantly impacted visual clustering results. Our findings strongly suggest data transformation to be considered as another key element in the interpretation of visual clustering approaches along with initialization and different parameters. Furthermore, the results highlight the need for a more thorough examination of parameters used in the analysis of clusters.

Keywords: UMAP; cancer; metastasis; t-SNE; transcriptomic analysis; visual clustering.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cluster Analysis
  • Humans
  • Neoplasms* / genetics
  • Transcriptome* / genetics

Grants and funding

This project was supported in part by the Apulian Regional Project Medicina di Precisione to A.G.S. Moreover, M.K. (Markus Krebs) was funded by a personal grant from the Else Kröner Foundation (Else Kröner Integrative Clinician Scientist College for Translational Immunology, University Hospital Würzburg, Germany). This publication was supported by the Open Access Publication Fund of the University of Würzburg. The funding sources were not involved in the study design, collection, analysis and interpretation of data, writing of the report or in the decision to submit the article for publication.