Classification and sequential pattern analysis for improving managerial efficiency and providing better medical service in public healthcare centers

Healthc Inform Res. 2010 Jun;16(2):67-76. doi: 10.4258/hir.2010.16.2.67. Epub 2010 Jun 30.

Abstract

Objectives: THIS STUDY SOUGHT TO FIND ANSWERS TO THE FOLLOWING QUESTIONS: 1) Can we predict whether a patient will revisit a healthcare center? 2) Can we anticipate diseases of patients who revisit the center?

Methods: For the first question, we applied 5 classification algorithms (decision tree, artificial neural network, logistic regression, Bayesian networks, and Naïve Bayes) and the stacking-bagging method for building classification models. To solve the second question, we performed sequential pattern analysis.

Results: WE DETERMINED: 1) In general, the most influential variables which impact whether a patient of a public healthcare center will revisit it or not are personal burden, insurance bill, period of prescription, age, systolic pressure, name of disease, and postal code. 2) The best plain classification model is dependent on the dataset. 3) Based on average of classification accuracy, the proposed stacking-bagging method outperformed all traditional classification models and our sequential pattern analysis revealed 16 sequential patterns.

Conclusions: Classification models and sequential patterns can help public healthcare centers plan and implement healthcare service programs and businesses that are more appropriate to local residents, encouraging them to revisit public health centers.

Keywords: Classification Analysis; Data Mining; Ensemble Method; Public Healthcare Center; Sequential Pattern Analysis.