Objective: The objective of this study was to determine whether an artificial neural network, a new data analysis method, offers increased performance over conventional logistic regression in predicting the presence of a pulmonary embolism for patients in a well-known data set.
Materials and methods: Data from the 1064 patients who received an angiographically based diagnosis of pulmonary embolism in the Prospective Investigation of Pulmonary Embolism Diagnosis study were encoded using a previously described method. The 21 input variables represented abnormalities identified on each patient's ventilation-perfusion scan and chest radiograph. Two methods-an artificial neural network with one hidden layer and a multivariate logistic regression-were compared for accuracy in predicting the presence or absence of pulmonary embolism on subsequent pulmonary arteriography.
Results: No significant difference was observed between the two methods. Areas under the receiver operating characteristic curves +/- standard deviation were 0.78 +/- 0.02 for the artificial neural network model and 0.79 +/- 0.02 for the logistic regression model. Furthermore, use of these two methods resulted in no more diagnostic accuracy than did the use of a simple threshold model based only on the number of subsegmental perfusion defects, which was the dominant input variable.
Conclusion: In the study population, the usefulness of data from ventilation-perfusion scans as predictors of the presence of a pulmonary embolism was similar for the three analytic methods, a finding that reinforces the importance of making comparisons to simpler or more established methods when performing studies involving complex analytic models, such as artificial neural networks.