DRAMS: A tool to detect and re-align mixed-up samples for integrative studies of multi-omics data

Yi Jiang; Gina Giase; Kay Grennan; Annie W Shieh; Yan Xia; Lide Han; Quan Wang; Qiang Wei; Rui Chen; Sihan Liu; Kevin P White; Chao Chen; Bingshan Li; Chunyu Liu

doi:10.1371/journal.pcbi.1007522

DRAMS: A tool to detect and re-align mixed-up samples for integrative studies of multi-omics data

PLoS Comput Biol. 2020 Apr 13;16(4):e1007522. doi: 10.1371/journal.pcbi.1007522. eCollection 2020 Apr.

Authors

Yi Jiang^{1

2

3}, Gina Giase⁴, Kay Grennan⁵, Annie W Shieh⁵, Yan Xia^{1

5}, Lide Han³, Quan Wang^{2

3}, Qiang Wei^{2

3}, Rui Chen^{2

3}, Sihan Liu¹, Kevin P White^{6

7}, Chao Chen^{1

8}, Bingshan Li^{2

3}, Chunyu Liu^{1

5

9}

Affiliations

¹ Center for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan, China.
² Department of Molecular Physiology & Biophysics, Vanderbilt University, Nashville, Tennessee, United States of America.
³ Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America.
⁴ School of Public Health, University of Illinois at Chicago, Chicago, Illinois, United States of America.
⁵ Department of Psychiatry, SUNY Upstate Medical University, Syracuse, New York, United States of America.
⁶ Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America.
⁷ Tempus Labs Inc, Chicago, Illinois, United States of America.
⁸ National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan, China.
⁹ School of Psychology, Shaanxi Normal University, Xi'an, Shaanxi, China.

Abstract

Studies of complex disorders benefit from integrative analyses of multiple omics data. Yet, sample mix-ups frequently occur in multi-omics studies, weakening statistical power and risking false findings. Accurately aligning sample information, genotype, and corresponding omics data is critical for integrative analyses. We developed DRAMS (https://github.com/Yi-Jiang/DRAMS) to Detect and Re-Align Mixed-up Samples to address the sample mix-up problem. It uses a logistic regression model followed by a modified topological sorting algorithm to identify the potential true IDs based on data relationships of multi-omics. According to tests using simulated data, the more types of omics data used or the smaller the proportion of mix-ups, the better that DRAMS performs. Applying DRAMS to real data from the PsychENCODE BrainGVEX project, we detected and corrected 201 (12.5% of total data generated) mix-ups. Of the 21 mix-ups involving errors of racial identity, DRAMS re-assigned all data to the correct racial group in the 1000 Genomes project. In doing so, quantitative trait loci (QTL) (FDR<0.01) increased by an average of 1.62-fold. The use of DRAMS in multi-omics studies will strengthen statistical power of the study and improve quality of the results. Even though very limited studies have multi-omics data in place, we expect such data will increase quickly with the needs of DRAMS.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Chromatin / chemistry
Computational Biology / methods*
Computer Simulation
Ethnicity
Female
Frontal Lobe / metabolism*
Genome
Genomics / methods*
Genotype
Humans
Logistic Models
Male
Models, Genetic
Oligonucleotide Array Sequence Analysis
Polymorphism, Single Nucleotide*
RNA-Seq
Reproducibility of Results
Sex Factors
Software
User-Computer Interface
Whole Genome Sequencing

Substances

Chromatin

Abstract

Publication types

MeSH terms

Substances

Grants and funding