Artificial neural network identified the significant genes to distinguish Idiopathic pulmonary fibrosis

Sci Rep. 2023 Jan 21;13(1):1225. doi: 10.1038/s41598-023-28536-w.

Abstract

Idiopathic pulmonary fibrosis (IPF) is a progressive interstitial lung disease that causes irreversible damage to lung tissue characterized by excessive deposition of extracellular matrix (ECM) and remodeling of lung parenchyma. The current diagnosis of IPF is complex and usually completed by a multidisciplinary team including clinicians, radiologists and pathologists they work together and make decision for an effective treatment, it is imperative to introduce novel practical methods for IPF diagnosis. This study provided a new diagnostic model of idiopathic pulmonary fibrosis based on machine learning. Six genes including CDH3, DIO2, ADAMTS14, HS6ST2, IL13RA2, and IGFL2 were identified based on the differentially expressed genes in IPF patients compare to healthy subjects through a random forest classifier with the existing gene expression databases. An artificial neural network model was constructed for IPF diagnosis based these genes, and this model was validated by the distinctive public datasets with a satisfactory diagnostic accuracy. These six genes identified were significant correlated with lung function, and among them, CDH3 and DIO2 were further determined to be significantly associated with the survival. Putting together, artificial neural network model identified the significant genes to distinguish idiopathic pulmonary fibrosis from healthy people and it is potential for molecular diagnosis of IPF.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Idiopathic Pulmonary Fibrosis* / diagnosis
  • Idiopathic Pulmonary Fibrosis* / genetics
  • Lung
  • Lung Diseases, Interstitial*
  • Sulfotransferases

Substances

  • HS6ST2 protein, human
  • Sulfotransferases