Polymorphic NumtS trace human population relationships

Hum Genet. 2012 May;131(5):757-71. doi: 10.1007/s00439-011-1125-3. Epub 2011 Dec 8.

Abstract

The human genome is constantly subjected to evolutionary forces which shape its architecture. Insertions of mitochondrial DNA sequences into nuclear genome (NumtS) have been described in several eukaryotic species, including Homo sapiens and other primates. The ongoing process of the generation of NumtS has made them valuable markers in primate phylogenetic studies, as well as potentially informative loci for reconstructing the genetic history of modern humans. Here, we report the identification of 53 human-specific NumtS by inspection of the UCSC genome browser, showing that they may be direct insertions of mitochondrial DNA into the human nuclear DNA after the human-chimpanzee split. In silico analyses allowed us to identify 14 NumtS which are polymorphic in terms of their presence/absence within the human genome in individuals of different ancestry. The allele frequencies of these polymorphic NumtS were calculated for 1000 Genomes Project sequence data from 13 populations worldwide, and principal components analysis and hierarchical clustering methods allowed the detection of strong signals of geographical structure related to the genetic diversity of these loci. All identified polymorphic human-specific NumtS together with a tandemly duplicated NumtS have also been validated by PCR amplification on a panel of 60 samples belonging to five native populations worldwide, confirming the expected NumtS variability. On the basis of these findings, we have succeeded in depicting the landscape of variation of a series of NumtS in several ethnic groups, making an advance in their identification as useful markers in the study on human population genetics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell Nucleus / genetics*
  • Databases as Topic
  • Ethnicity / genetics
  • Genetic Markers*
  • Genetics, Population*
  • Genome, Human*
  • Humans
  • Polymorphism, Genetic
  • Sequence Analysis, DNA

Substances

  • Genetic Markers