Spike sorting is a commonly used analysis method for identifying single-units and multi-units from extracellular recordings. The extracellular recordings contain a mixture of signal components, such as neural and non-neural events, possibly due to motion and breathing artifacts or electrical interference. Identifying single and multi-unit spikes using a simple threshold-crossing method may lead to uncertainty in differentiating the actual neural spikes from non-neural spikes. The traditional method for classifying neural and non-neural units from spike sorting results is manual curation by a trained person. This subjective method suffers from human error and variability and is further complicated by the absence of ground truth in experimental extracellular recordings. Moreover, the manual curation process is time consuming and is becoming intractable due to the growing size and complexity of extracellular datasets. To address these challenges, we, for the first time, present a novel automatic curation method based on an autoencoder model, which is trained on features of simulated extracellular spike waveforms. The model is then applied to experimental electrophysiology datasets, where the reconstruction error is used as the metric for classifying neural and non-neural spikes. As an alternative to the traditional frequency domain and statistical techniques, our proposed method offers a time-domain evaluation model to automate the analysis of extracellular recordings based on learned time-domain features. The model exhibits excellent performance and throughput when applied to real-world extracellular datasets without any retraining, highlighting its generalizability. This method can be integrated into spike sorting pipelines as a pre-processing filtering step or a post-processing curation method.
Keywords: anomaly detection; automatic curation; signal processing; spike sorting; unsupervised learning.
Creative Commons Attribution license.