Objective: To develop a lung cancer risk prediction model for female non-smokers. Methods: Based on the Kailuan prospective dynamic cohort (2006.05-2015.12), a nested case-control study was conducted. Participants diagnosed with primary pathologically confirmed lung cancer during follow-up were identified as the case group, and others were identified as the control group. A total of 24 701 subjects were included in the study, including 86 lung cancer cases and 24 615 control population, respectively. Questionnaires, physical examinations, and laboratory tests were conducted to collect relevant information. Multivariable-adjusted logistic regressions were conducted to develop a lung cancer risk prediction model. Area Under the Curve (AUC) and Hosmer-Lemeshow tests were used to evaluate discrimination and calibration, respectively. Ten-fold cross-validation was used for internal validation. Results: Two sets of models were developed: the simple model (including age and monthly income) and the metabolic index model [including age, monthly income, fasting blood glucose (FBG), total cholesterol (TC) and high-density lipoprotein cholesterol (HDL-C)].The AUC (95%CI) [0.745 (0.719-0.771)] of the metabolic index model was higher than that of the simple prediction model [0.688 (0.660-0.716)] (P=0.004). Both the simple model (PHL=0.287) and the metabolic index model (PHL=0.134) were well-calibrated. The results of ten-fold cross-validation indicated sufficient stability, with an average AUC of 0.699 and a standard error (SD) of 0.010. Conclusion: By incorporating metabolic markers, accurate and reliable lung cancer risk prediction model for female non smokers could be developed.
目的: 构建我国非吸烟女性的肺癌风险预测模型。 方法: 于2006年5月至2015年12月,采用巢式病例对照研究设计,基于开滦前瞻性动态队列研究,以随访到的原发性病理学确诊肺癌患者为病例组,以随访期间未发病者为对照组。最终共纳入24 701名研究对象,病例组和对照组分别为86、24 615名。采用问卷调查、体格检查和实验室检测收集相关信息,采用多因素logistic回归模型建立肺癌风险预测模型,使用曲线下面积(AUC)与Hosmer-Lemeshow检验评价模型的预测效能与拟合度,同时利用十倍交叉验证方法进行预测模型的内部验证。 结果: 共构建了两套风险预测模型:基本模型(纳入年龄与月收入情况2个预测指标)和代谢指标模型(纳入年龄、月收入情况、空腹血糖、总胆固醇和高密度脂蛋白胆固醇5个预测指标)。代谢指标模型的AUC(95%CI)[0.745(0.719~0.771)]大于基本模型的AUC(95%CI)[0.688(0.660~0.716)](P=0.004);基本模型(PHL=0.287)与代谢指标模型(PHL=0.134)的拟合度均良好;十倍交叉验证结果提示代谢指标模型预测效果稳定,平均AUC值为0.699,标准误为0.010。 结论: 通过纳入代谢指标,可构建精准可靠的非吸烟女性肺癌风险预测模型。.
Keywords: Female; Forecasting; Lung; Neoplasms; Non-smokers.