Comparison of missing data approaches in linkage analysis

Chao Xing; Fredrick R Schumacher; David V Conti; John S Witte

doi:10.1186/1471-2156-4-S1-S44

Comparison of missing data approaches in linkage analysis

BMC Genet. 2003 Dec 31;4 Suppl 1(Suppl 1):S44. doi: 10.1186/1471-2156-4-S1-S44.

Authors

Chao Xing¹, Fredrick R Schumacher, David V Conti, John S Witte

Affiliation

¹ Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, Ohio, USA. [email protected]

Abstract

Background: Observational cohort studies have been little used in linkage analyses due to their general lack of large, disease-specific pedigrees. Nevertheless, the longitudinal nature of such studies makes them potentially valuable for assessing the linkage between genotypes and temporal trends in phenotypes. The repeated phenotype measures in cohort studies (i.e., across time), however, can have extensive missing information. Existing methods for handling missing data in observational studies may decrease efficiency, introduce biases, and give spurious results. The impact of such methods when undertaking linkage analysis of cohort studies is unclear. Therefore, we compare here six methods of imputing missing repeated phenotypes on results from genome-wide linkage analyses of four quantitative traits from the Framingham Heart Study cohort.

Results: We found that simply deleting observations with missing values gave many more nominally statistically significant linkages than the other five approaches. Among the latter, those with similar underlying methodology (i.e., imputation- versus model-based) gave the most consistent results, although some discrepancies remained.

Conclusion: Different methods for addressing missing values in linkage analyses of cohort studies can give substantially diverse results, and must be carefully considered to protect against biases and spurious findings.

Publication types

Comparative Study

MeSH terms

Algorithms
Blood Pressure / genetics
Body Mass Index
Chromosomes, Human, Pair 10 / genetics
Chromosomes, Human, Pair 2 / genetics
Epidemiologic Methods*
Female
Genetic Linkage / genetics*
Genetic Markers / genetics
Humans
Likelihood Functions
Male
Markov Chains
Models, Statistical
Monte Carlo Method
Pedigree

Substances

Genetic Markers

Abstract

Publication types

MeSH terms

Substances

Grants and funding