Assessing reliability of intra-tumor heterogeneity estimates from single sample whole exome sequencing data

PLoS One. 2019 Nov 7;14(11):e0224143. doi: 10.1371/journal.pone.0224143. eCollection 2019.

Abstract

Tumors are made of evolving and heterogeneous populations of cells which arise from successive appearance and expansion of subclonal populations, following acquisition of mutations conferring them a selective advantage. Those subclonal populations can be sensitive or resistant to different treatments, and provide information about tumor aetiology and future evolution. Hence, it is important to be able to assess the level of heterogeneity of tumors with high reliability for clinical applications. In the past few years, a large number of methods have been proposed to estimate intra-tumor heterogeneity from whole exome sequencing (WES) data, but the accuracy and robustness of these methods on real data remains elusive. Here we systematically apply and compare 6 computational methods to estimate tumor heterogeneity on 1,697 WES samples from the cancer genome atlas (TCGA) covering 3 cancer types (breast invasive carcinoma, bladder urothelial carcinoma, and head and neck squamous cell carcinoma), and two distinct input mutation sets. We observe significant differences between the estimates produced by different methods, and identify several likely confounding factors in heterogeneity assessment for the different methods. We further show that the prognostic value of tumor heterogeneity for survival prediction is limited in those datasets, and find no evidence that it improves over prognosis based on other clinical variables. In conclusion, heterogeneity inference from WES data on a single sample, and its use in cancer prognosis, should be considered with caution. Other approaches to assess intra-tumoral heterogeneity such as those based on multiple samples may be preferable for clinical applications.

MeSH terms

  • Algorithms
  • Breast Neoplasms / genetics
  • Breast Neoplasms / pathology
  • Computational Biology
  • DNA Copy Number Variations / genetics*
  • Exome / genetics
  • Exome Sequencing*
  • Female
  • Genetic Heterogeneity*
  • Genome, Human / genetics*
  • Humans
  • Mutation
  • Squamous Cell Carcinoma of Head and Neck / genetics
  • Squamous Cell Carcinoma of Head and Neck / pathology
  • Urinary Bladder Neoplasms / genetics
  • Urinary Bladder Neoplasms / pathology

Grants and funding

The author(s) received no specific funding for this work. JPV is employed by Google France. The funder provided support in the form of salaries for author JPV but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific role of this author is articulated in the ‘author contributions’ section.