Resolving complex duplication variants in autism spectrum disorder using long-read genome sequencing

Genome Res. 2024 Nov 20;34(11):1763-1773. doi: 10.1101/gr.279263.124.

Abstract

Rare or de novo structural variation, primarily in the form of copy number variants, is detected in 5%-10% of autism spectrum disorder (ASD) families. While complex structural variants involving duplications can generally be detected using microarray or short-read genome sequencing (GS), these methods frequently fail to characterize breakpoints at nucleotide resolution, requiring additional molecular methods for validation and fine-mapping. Here, we use Oxford Nanopore Technologies PromethION long-read GS to characterize complex genomic rearrangements (CGRs) involving large duplications that segregate with ASD in five families. In total, we investigated 13 CGR carriers and were able to resolve all breakpoint junctions at nucleotide resolution. While all breakpoints were identified, the precise genomic architecture of one rearrangement remained unresolved with three different potential structures. The findings in two families include potential fusion genes formed through duplication rearrangements, involving IL1RAPL1-DMD and SUPT16H-CHD8 In two of the families originating from the same geographical region, an identical rearrangement involving ANK2 was identified, which likely represents a founder variant. In addition, we analyze methylation status directly from the long-read data, allowing us to assess the activity of rearranged genes and regulatory regions. Investigation of methylation across the CGRs reveals aberrant methylation status in carriers across a rearrangement affecting the CREBBP locus. In aggregate, our results demonstrate the utility of nanopore sequencing to pinpoint CGRs associated with ASD in five unrelated families, and highlight the importance of a gene-centric description of disease-associated complex chromosomal rearrangements.

MeSH terms

  • Autism Spectrum Disorder* / genetics
  • Child
  • Chromosome Breakpoints
  • DNA Copy Number Variations*
  • DNA Methylation
  • Female
  • Gene Duplication*
  • Gene Rearrangement
  • Genome, Human
  • Humans
  • Male
  • Pedigree
  • Whole Genome Sequencing / methods