Improving glomerular filtration rate estimation by semi-supervised learning: a development and external validation study

Ningshan Li; Hui Huang; Lv Linsheng; Hui Lu; Xun Liu

doi:10.1007/s11255-020-02771-w

Improving glomerular filtration rate estimation by semi-supervised learning: a development and external validation study

Int Urol Nephrol. 2021 Aug;53(8):1649-1658. doi: 10.1007/s11255-020-02771-w. Epub 2021 Mar 12.

Authors

Ningshan Li^#¹, Hui Huang^#², Lv Linsheng³, Hui Lu^{4

5

6}, Xun Liu^{7

8}

Affiliations

¹ SJTU-Yale Joint Center for Biostatistics and Data Science, Department of Bioinformatics and Biostatistics, School of Life Science and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
² Cardiovascular Department, The Eighth Affiliated Hospital, Sun Yat-Sen University, Shenzhen, China.
³ Operation Room, The Third Affiliated Hospital of Sun Yat-Sen University, Guangdong, China.
⁴ SJTU-Yale Joint Center for Biostatistics and Data Science, Department of Bioinformatics and Biostatistics, School of Life Science and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China. [email protected].
⁵ MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University, Shanghai, China. [email protected].
⁶ Shanghai Engineering Research Center for Big Data in Pediatric Precision Medicine, Shanghai, China. [email protected].
⁷ Clinical Data Center of the Third Affiliated Hospital of Sun Yat-Sen University, Guangdong, China. [email protected].
⁸ Division of Nephrology, Department of Internal Medicine, The Third Affiliated Hospital of Sun Yat-Sen University, Guangzhou, 510630, Guangdong, China. [email protected].

^# Contributed equally.

PMID: 33710531
DOI: 10.1007/s11255-020-02771-w

Abstract

Background: Accurate estimating glomerular filtration rate (GFR) is crucial both in clinical practice and epidemiological survey. We incorporated semi-supervised learning technology to improve GFR estimation performance.

Methods: AASK [African American Study of Kidney Disease and Hypertension], CRIC [Chronic Renal Insufficiency Cohort] and DCCT [Diabetes Control and Complications Trial] studies were pooled together for model development, whereas MDRD [Modification of Diet in Renal Disease] and CRISP [Consortium for Radiological Imaging Studies of Polycystic Kidney Disease] studies for model external validation. A total of seven variables (Serum creatinine, Age, Sex, Black race, Diabetes status, Hypertension and Body Mass Index) were included as independent variables, while the outcome variable GFR was measured as the urinary clearance of ¹²⁵I-iothalamate. The revised CKD-EPI [Chronic Kidney Disease Epidemiology Collaboration] creatinine equations was selected as benchmark for performance comparisons. Head-to-head performance comparisons from four-variable to seven-variable combination were conducted between revised CKD-EPI equations and semi-supervised models.

Results: In each independent variables combination, the semi-supervised models consistently achieved superior results in all three performance indicators compared with corresponding revised CKD-EPI equations in the external validation data set. Furthermore, compared with revised four-variable CKD-EPI equation, the seven-variable semi-supervised model performed less biased (mean of difference: 0.03 [- 0.28, 0.34] vs 1.53 [1.28, 1.85], P < 0.001), more precise (interquartile range of difference: 7.94 [7.37, 8.50] vs 8.28 [7.76, 8.83], P = 0.1) and accurate (P30: 88.9% [87.4%, 90.2%] vs 86.0% [84.4%, 87.4%], P < 0.001.

Conclusions: The superior performance of the semi-supervised models during head-to-head comparisons supported the hypothesis that semi-supervised learning technology could improve GFR estimation performance.

Keywords: Chronic kidney disease (CKD); Estimating equation; Glomerular filtration rate (GFR); Semi-supervised learning; Serum creatinine.

Publication types

Validation Study

MeSH terms

Adult
Female
Glomerular Filtration Rate*
Humans
Kidney Function Tests / standards*
Male
Middle Aged
Supervised Machine Learning*

Abstract

Publication types

MeSH terms

Grants and funding