A genome-wide analysis of common fragile sites: what features determine chromosomal instability in the human genome?

Genome Res. 2012 Jun;22(6):993-1005. doi: 10.1101/gr.134395.111. Epub 2012 Mar 28.

Abstract

Chromosomal common fragile sites (CFSs) are unstable genomic regions that break under replication stress and are involved in structural variation. They frequently are sites of chromosomal rearrangements in cancer and of viral integration. However, CFSs are undercharacterized at the molecular level and thus difficult to predict computationally. Newly available genome-wide profiling studies provide us with an unprecedented opportunity to associate CFSs with features of their local genomic contexts. Here, we contrasted the genomic landscape of cytogenetically defined aphidicolin-induced CFSs (aCFSs) to that of nonfragile sites, using multiple logistic regression. We also analyzed aCFS breakage frequencies as a function of their genomic landscape, using standard multiple regression. We show that local genomic features are effective predictors both of regions harboring aCFSs (explaining ∼77% of the deviance in logistic regression models) and of aCFS breakage frequencies (explaining ∼45% of the variance in standard regression models). In our optimal models (having highest explanatory power), aCFSs are predominantly located in G-negative chromosomal bands and away from centromeres, are enriched in Alu repeats, and have high DNA flexibility. In alternative models, CpG island density, transcription start site density, H3K4me1 coverage, and mononucleotide microsatellite coverage are significant predictors. Also, aCFSs have high fragility when colocated with evolutionarily conserved chromosomal breakpoints. Our models are predictive of the fragility of aCFSs mapped at a higher resolution. Importantly, the genomic features we identified here as significant predictors of fragility allow us to draw valuable inferences on the molecular mechanisms underlying aCFSs.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alu Elements
  • Animals
  • Aphidicolin / pharmacology
  • Centromere
  • Chromosomal Instability*
  • Chromosome Breakage
  • Chromosome Fragile Sites*
  • Chromosomes, Human / drug effects
  • CpG Islands
  • Cytogenetic Analysis
  • Genome, Human*
  • Humans
  • Logistic Models
  • Mice
  • Microsatellite Repeats
  • Models, Genetic*
  • Reproducibility of Results
  • Transcription Initiation Site

Substances

  • Aphidicolin