Background: Hepatocellular carcinoma (HCC) is the third prime cause of malignancy-related mortality worldwide. Early and accurate identification of HCC is crucial for good prognosis, efficacy of therapy, and survival rates of the patients. We aimed to develop a machine-learning model incorporating differentially expressed RNA signatures with laboratory parameters to construct an RNA signature-based diagnostic model for HCC.
Methods: We have used five classifiers (KNN, RF, SVM, LGBM, and DNNs) to predict the liver disease (HCC). The classifiers were trained on 187 samples and then tested on 80 samples. The model included 22 features (age, sex, smoking, cirrhosis, non-cirrhosis, albumin, ALT, AST bilirubin (total and direct), INR, AFP, HBV Ag, HCV Abs, RQmiR-1298, RQmiR-1262, RQmiR-106b-3p, RQmRNARAB11A, and RQSTAT1, RQmRNAATG12, RQLnc-WRAP53, RQLncRNA- RP11-513I15.6).
Results: LGBM achieved the highest accuracy of 98.75% in predicting HCC among all models surpassing Random Forest (96.25%), DNN (91.25%), SVC (88.75%), and KNN (87.50%).
Conclusion: Our machine-learning model incorporating the expression data of RAB11A/STAT1/ATG12/miR-1262/miR-1298/miR-106b-3p/lncRNA-RP11-513I15.6/lncRNA-WRAP53 signature and clinical data represents a potential novel diagnostic model for HCC.
Keywords: HCC; LGBM; autophagy; hepatocellular carcinoma; machine-learning.
© 2024 Indian National Association for Study of the Liver. Published by Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.