Submegabase copy number variations arise during cerebral cortical neurogenesis as revealed by single-cell whole-genome sequencing

Proc Natl Acad Sci U S A. 2018 Oct 16;115(42):10804-10809. doi: 10.1073/pnas.1812702115. Epub 2018 Sep 27.

Abstract

Somatic copy number variations (CNVs) exist in the brain, but their genesis, prevalence, forms, and biological impact remain unclear, even within experimentally tractable animal models. We combined a transposase-based amplification (TbA) methodology for single-cell whole-genome sequencing with a bioinformatic approach for filtering unreliable CNVs (FUnC), developed from machine learning trained on lymphocyte V(D)J recombination. TbA-FUnC offered superior genomic coverage and removed >90% of false-positive CNV calls, allowing extensive examination of submegabase CNVs from over 500 cells throughout the neurogenic period of cerebral cortical development in Mus musculus Thousands of previously undocumented CNVs were identified. Half were less than 1 Mb in size, with deletions 4× more common than amplification events, and were randomly distributed throughout the genome. However, CNV prevalence during embryonic cortical development was nonrandom, peaking at midneurogenesis with levels triple those found at younger ages before falling to intermediate quantities. These data identify pervasive small and large CNVs as early contributors to neural genomic mosaicism, producing genomically diverse cellular building blocks that form the highly organized, mature brain.

Keywords: CNV; brain development; genomic mosaicism; machine learning; single-cell sequencing.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Cerebral Cortex / cytology*
  • Cerebral Cortex / metabolism*
  • DNA Copy Number Variations*
  • Embryo, Mammalian / cytology
  • Embryo, Mammalian / metabolism
  • Gene Expression Regulation, Developmental*
  • Genome
  • Genomics
  • Mice
  • Mice, Inbred C57BL
  • Neurogenesis / genetics*
  • Single-Cell Analysis / methods*
  • Whole Genome Sequencing / methods*