Rapid and ongoing evolution of repetitive sequence structures in human centromeres

Sci Adv. 2020 Dec 11;6(50):eabd9230. doi: 10.1126/sciadv.abd9230. Print 2020 Dec.

Abstract

Our understanding of centromere sequence variation across human populations is limited by its extremely long nested repeat structures called higher-order repeats that are challenging to sequence. Here, we analyzed chromosomes 11, 17, and X using long-read sequencing data for 36 individuals from diverse populations including a Han Chinese trio and 21 Japanese. We revealed substantial structural diversity with many previously unidentified variant higher-order repeats specific to individuals characterizing rapid, haplotype-specific evolution of human centromeric arrays, while frequent single-nucleotide variants are largely conserved. We found a characteristic pattern shared among prevalent variants in human and chimpanzee. Our findings pave the way for studying sequence evolution in human and primate centromeres.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Centromere* / genetics
  • DNA, Satellite*
  • Evolution, Molecular
  • Humans
  • Longitudinal Studies
  • Pan troglodytes / genetics
  • Repetitive Sequences, Nucleic Acid

Substances

  • DNA, Satellite