Submegabase copy number variations arise during cerebral cortical neurogenesis as revealed by single-cell whole-genome sequencing

Suzanne Rohrback; Craig April; Fiona Kaper; Richard R Rivera; Christine S Liu; Benjamin Siddoway; Jerold Chun

doi:10.1073/pnas.1812702115

Submegabase copy number variations arise during cerebral cortical neurogenesis as revealed by single-cell whole-genome sequencing

Proc Natl Acad Sci U S A. 2018 Oct 16;115(42):10804-10809. doi: 10.1073/pnas.1812702115. Epub 2018 Sep 27.

Authors

Suzanne Rohrback^{1

2}, Craig April³, Fiona Kaper³, Richard R Rivera¹, Christine S Liu^{1

2}, Benjamin Siddoway¹, Jerold Chun⁴

Affiliations

¹ Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA 92037.
² Biomedical Sciences Graduate Program, University of California, San Diego, La Jolla, CA 92093.
³ Applied Genomics Tools, Research and Technology Development, Illumina Inc., San Diego, CA 92122.
⁴ Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA 92037; [email protected].

Abstract

Somatic copy number variations (CNVs) exist in the brain, but their genesis, prevalence, forms, and biological impact remain unclear, even within experimentally tractable animal models. We combined a transposase-based amplification (TbA) methodology for single-cell whole-genome sequencing with a bioinformatic approach for filtering unreliable CNVs (FUnC), developed from machine learning trained on lymphocyte V(D)J recombination. TbA-FUnC offered superior genomic coverage and removed >90% of false-positive CNV calls, allowing extensive examination of submegabase CNVs from over 500 cells throughout the neurogenic period of cerebral cortical development in Mus musculus Thousands of previously undocumented CNVs were identified. Half were less than 1 Mb in size, with deletions 4× more common than amplification events, and were randomly distributed throughout the genome. However, CNV prevalence during embryonic cortical development was nonrandom, peaking at midneurogenesis with levels triple those found at younger ages before falling to intermediate quantities. These data identify pervasive small and large CNVs as early contributors to neural genomic mosaicism, producing genomically diverse cellular building blocks that form the highly organized, mature brain.

Keywords: CNV; brain development; genomic mosaicism; machine learning; single-cell sequencing.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Animals
Cerebral Cortex / cytology*
Cerebral Cortex / metabolism*
DNA Copy Number Variations*
Embryo, Mammalian / cytology
Embryo, Mammalian / metabolism
Gene Expression Regulation, Developmental*
Genome
Genomics
Mice
Mice, Inbred C57BL
Neurogenesis / genetics*
Single-Cell Analysis / methods*
Whole Genome Sequencing / methods*

Abstract

Publication types

MeSH terms

Grants and funding