Seizure Detection: Interreader Agreement and Detection Algorithm Assessments Using a Large Dataset

Mark L Scheuer; Scott B Wilson; Arun Antony; Gena Ghearing; Alexandra Urban; Anto I Bagić

doi:10.1097/WNP.0000000000000709

Seizure Detection: Interreader Agreement and Detection Algorithm Assessments Using a Large Dataset

J Clin Neurophysiol. 2021 Sep 1;38(5):439-447. doi: 10.1097/WNP.0000000000000709.

Authors

Mark L Scheuer¹, Scott B Wilson¹, Arun Antony², Gena Ghearing³, Alexandra Urban², Anto I Bagić²

Affiliations

¹ Persyst Development Corporation, Solana Beach, California, U.S.A.
² University of Pittsburgh Comprehensive Epilepsy Center (UPCEC), University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, U.S.A.; and.
³ Department of Neurology, University of Iowa, Iowa City, Iowa, U.S.A.

Abstract

Purpose: To compare the seizure detection performance of three expert humans and two computer algorithms in a large set of epilepsy monitoring unit EEG recordings.

Methods: One hundred twenty prolonged EEGs, 100 containing clinically reported EEG-evident seizures, were evaluated. Seizures were marked by the experts and algorithms. Pairwise sensitivity and false-positive rates were calculated for each human-human and algorithm-human pair. Differences in human pairwise performance were calculated and compared with the range of algorithm versus human performance differences as a type of statistical modified Turing test.

Results: A total of 411 individual seizure events were marked by the experts in 2,805 hours of EEG. Mean, pairwise human sensitivities and false-positive rates were 84.9%, 73.7%, and 72.5%, and 1.0, 0.4, and 1.0/day, respectively. Only the Persyst 14 algorithm was comparable with humans-78.2% and 1.0/day. Evaluation of pairwise differences in sensitivity and false-positive rate demonstrated that Persyst 14 met statistical noninferiority criteria compared with the expert humans.

Conclusions: Evaluating typical prolonged EEG recordings, human experts had a modest level of agreement in seizure marking and low false-positive rates. The Persyst 14 algorithm was statistically noninferior to the humans. For the first time, a seizure detection algorithm and human experts performed similarly.

MeSH terms

Algorithms*
Correlation of Data
Electroencephalography
Humans
Seizures* / diagnosis
Sensitivity and Specificity