Construction of a searchable database for gene expression changes in spinal cord injury experiments

Eric C Rouchka; Carlos de Almeida; Randi B House; Jonah C Daneshmand; Julia H Chariker; Sujata Saraswat-Ohri; Cynthia Gomes; Morgan Sharp; Alice Shum-Siu; Greta M Cesarz; Jeffrey C Petruska; David S K Magnuson

doi:10.1101/2023.02.01.526630

Construction of a searchable database for gene expression changes in spinal cord injury experiments

bioRxiv [Preprint]. 2023 Feb 3:2023.02.01.526630. doi: 10.1101/2023.02.01.526630.

Authors

Eric C Rouchka^{1

2

3}, Carlos de Almeida^{4

5}, Randi B House^{5

6}, Jonah C Daneshmand³, Julia H Chariker^{2

7}, Sujata Saraswat-Ohri^{5

8}, Cynthia Gomes^{5

9}, Morgan Sharp^{5

8}, Alice Shum-Siu^{5

8}, Greta M Cesarz⁵, Jeffrey C Petruska^{5

8

9}, David S K Magnuson^{4

5

8

9}

Affiliations

¹ Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, University of Louisville, Louisville, KY USA.
² Kentucky IDeA Networks of Biomedical Research Excellence (KY INBRE) Bioinformatics Core, University of Louisville School of Medicine, 522 East Gray Street, Louisville, KY USA 40202.
³ Bioinformatics Program, School of Interdisciplinary and Graduate Studies, University of Louisville, Louisville, KY.
⁴ Translational Neuroscience Program, School of Interdisciplinary and Graduate Studies, University of Louisville, Louisville, KY.
⁵ Kentucky Spinal Cord Injury Research Center, School of Medicine, University of Louisville, Louisville, KY.
⁶ Department of Bioengineering, Speed School of Engineering, University of Louisville, Louisville, KY.
⁷ Department of Neuroscience Training, School of Medicine, University of Louisville, Louisville, KY.
⁸ Department of Neurological Surgery, School of Medicine, University of Louisville, Louisville, KY USA.
⁹ Department of Anatomical Sciences and Neurobiology, School of Medicine, University of Louisville, Louisville, KY.

Abstract

Spinal cord injury (SCI) is a debilitating disease resulting in an estimated 18,000 new cases in the United States on an annual basis. Significant behavioral research on animal models has led to a large amount of data, some of which has been catalogued in the Open Data Commons for Spinal Cord Injury (ODC-SCI). More recently, high throughput sequencing experiments have been utilized to understand molecular mechanisms associated with SCI, with nearly 6,000 samples from over 90 studies available in the Sequence Read Archive. However, to date, no resource is available for efficiently mining high throughput sequencing data from SCI experiments. Therefore, we have developed a protocol for processing RNA-Seq samples from high-throughput sequencing experiments related to SCI resulting in both raw and normalized data that can be efficiently mined for comparisons across studies as well as homologous discovery across species. We have processed 1,196 publicly available RNA-seq samples from 50 bulk RNA-Seq studies across nine different species, resulting in an SQLite database that can be used by the SCI research community for further discovery. We provide both the database as well as a web-based front-end that can be used to query the database for genes of interest, differential gene expression, genes with high variance, and gene set enrichments.

Keywords: ODC-SCI; RNA-Seq; SCI; SQLite; bulk RNA-Seq; differential gene expression; spinal cord injury; transcriptomics.

Publication types

Preprint

Grants and funding

P20 GM103436/GM/NIGMS NIH HHS/United States