The GENESIS database and tools: A decade of discovery in Mendelian genomics

Exp Neurol. 2024 Dec:382:114978. doi: 10.1016/j.expneurol.2024.114978. Epub 2024 Sep 30.

Abstract

In the past decade, human genetics research saw an acceleration of disease gene discovery and further dissection of the genetic architectures of many disorders. Much of this progress was enabled via data aggregation projects, collaborative data sharing among researchers, and the adoption of sophisticated and standardized bioinformatics analyses pipelines. In 2012, we launched the GENESIS platform, formerly known as GEM.app, with the aims to 1) empower clinical and basic researchers without bioinformatics expertise to analyze and explore genome level data and 2) facilitate the detection of novel pathogenic variation and novel disease genes by leveraging data aggregation and genetic matchmaking. The GENESIS database has grown to over 20,000 datasets from rare disease patients, which were provided by multiple academic research consortia and many individual investigators. Some of the largest global collections of genome-level data are available for Charcot-Marie-Tooth disease, hereditary spastic paraplegia, and cerebellar ataxia. A number of rare disease consortia and networks are archiving their data in this database. Over the past decade, more than 1500 scientists have registered and used this resource and published over 200 papers on gene and variant identifications, which garnered >6000 citations. GENESIS has supported >100 gene discoveries and contributed to approximately half of all gene identifications in the fields of inherited peripheral neuropathies and spastic paraplegia in this time frame. Many diagnostic odysseys of rare disease patients have been resolved. The concept of genomes-to-therapy has borne out for a number of such discoveries that let to rapid clinical trials and expedited natural history studies. This marks GENESIS as one of the most impactful data aggregation initiatives in rare monogenic diseases.

Keywords: Data aggregation; Data sharing; Genome sequencing; Monogenic diseases; Neuromuscular diseases.

Publication types

  • Review

MeSH terms

  • Computational Biology / methods
  • Databases, Genetic* / trends
  • Genomics* / methods
  • Humans