Pooling biospecimens and limits of detection: effects on ROC curve analysis

Biostatistics. 2006 Oct;7(4):585-98. doi: 10.1093/biostatistics/kxj027. Epub 2006 Mar 10.

Abstract

Frequently, epidemiological studies deal with two restrictions in the evaluation of biomarkers: cost and instrument sensitivity. Costs can hamper the evaluation of the effectiveness of new biomarkers. In addition, many assays are affected by a limit of detection (LOD), depending on the instrument sensitivity. Two common strategies used to cut costs include taking a random sample of the available samples and pooling biospecimens. We compare the two sampling strategies when an LOD effect exists. These strategies are compared by examining the efficiency of receiver operating characteristic (ROC) curve analysis, specifically the estimation of the area under the ROC curve (AUC) for normally distributed markers. We propose and examine a method to estimate AUC when dealing with data from pooled and unpooled samples where an LOD is in effect. In conclusion, pooling is the most efficient cost-cutting strategy when the LOD affects less than 50% of the data. However, when much more than 50% of the data are affected, utilization of the pooling design is not recommended.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Biomarkers / analysis*
  • Biometry / methods*
  • Data Interpretation, Statistical
  • Epidemiologic Measurements
  • Humans
  • Likelihood Functions
  • Models, Biological
  • Models, Statistical
  • ROC Curve*

Substances

  • Biomarkers