Allele frequency misspecification: effect on power and Type I error of model-dependent linkage analysis of quantitative traits under random ascertainment

Diptasri M Mandal; Alexa J M Sorant; Larry D Atwood; Alexander F Wilson; Joan E Bailey-Wilson

doi:10.1186/1471-2156-7-21

Allele frequency misspecification: effect on power and Type I error of model-dependent linkage analysis of quantitative traits under random ascertainment

BMC Genet. 2006 Apr 20:7:21. doi: 10.1186/1471-2156-7-21.

Authors

Diptasri M Mandal¹, Alexa J M Sorant, Larry D Atwood, Alexander F Wilson, Joan E Bailey-Wilson

Affiliation

¹ Department of Genetics, Louisiana State University Health Sciences Center, CSRB 6-16, New Orleans, LA 70112, USA. [email protected]

Abstract

Background: Studies of model-based linkage analysis show that trait or marker model misspecification leads to decreasing power or increasing Type I error rate. An increase in Type I error rate is seen when marker related parameters (e.g., allele frequencies) are misspecified and ascertainment is through the trait, but lod-score methods are expected to be robust when ascertainment is random (as is often the case in linkage studies of quantitative traits). In previous studies, the power of lod-score linkage analysis using the "correct" generating model for the trait was found to increase when the marker allele frequencies were misspecified and parental data were missing. An investigation of Type I error rates, conducted in the absence of parental genotype data and with misspecification of marker allele frequencies, showed that an inflation in Type I error rate was the cause of at least part of this apparent increased power. To investigate whether the observed inflation in Type I error rate in model-based LOD score linkage was due to sampling variation, the trait model was estimated from each sample using REGCHUNT, an automated segregation analysis program used to fit models by maximum likelihood using many different sets of initial parameter estimates.

Results: The Type I error rates observed using the trait models generated by REGCHUNT were usually closer to the nominal levels than those obtained when assuming the generating trait model.

Conclusion: This suggests that the observed inflation of Type I error upon misspecification of marker allele frequencies is at least partially due to sampling variation. Thus, with missing parental genotype data, lod-score linkage is not as robust to misspecification of marker allele frequencies as has been commonly thought.

Publication types

Research Support, N.I.H., Extramural
Research Support, N.I.H., Intramural

MeSH terms

Alleles*
Computer Simulation
False Positive Reactions
Gene Frequency
Humans
Likelihood Functions
Lod Score*
Models, Genetic*
Nuclear Family
Quantitative Trait, Heritable*
Random Allocation
Sensitivity and Specificity

Abstract

Publication types

MeSH terms

Grants and funding