Delayed knowledge transfer: Cross-modal knowledge transfer from delayed stimulus to EEG for continuous attention detection based on spike-represented EEG signals

Pengfei Sun; Jorg De Winne; Malu Zhang; Paul Devos; Dick Botteldooren

doi:10.1016/j.neunet.2024.107003

Delayed knowledge transfer: Cross-modal knowledge transfer from delayed stimulus to EEG for continuous attention detection based on spike-represented EEG signals

Neural Netw. 2025 Mar:183:107003. doi: 10.1016/j.neunet.2024.107003. Epub 2024 Dec 6.

Authors

Pengfei Sun¹, Jorg De Winne², Malu Zhang³, Paul Devos⁴, Dick Botteldooren⁵

Affiliations

¹ WAVES Research Group, Department of Information Technology, Ghent University, Gent, Belgium. Electronic address: [email protected].
² WAVES Research Group, Department of Information Technology, Ghent University, Gent, Belgium. Electronic address: [email protected].
³ School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 610054, China. Electronic address: [email protected].
⁴ WAVES Research Group, Department of Information Technology, Ghent University, Gent, Belgium. Electronic address: [email protected].
⁵ WAVES Research Group, Department of Information Technology, Ghent University, Gent, Belgium. Electronic address: [email protected].

PMID: 39667216
DOI: 10.1016/j.neunet.2024.107003

Abstract

Decoding visual and auditory stimuli from brain activities, such as electroencephalography (EEG), offers promising advancements for enhancing machine-to-human interaction. However, effectively representing EEG signals remains a significant challenge. In this paper, we introduce a novel Delayed Knowledge Transfer (DKT) framework that employs spiking neurons for attention detection, using our experimental EEG dataset. This framework extracts patterns from audiovisual stimuli to model brain responses in EEG signals, while accounting for inherent response delays. By aligning audiovisual features with EEG signals through a shared embedding space, our approach improves the performance of brain-computer interface (BCI) systems. We also present WithMeAttention, a multimodal dataset designed to facilitate research in continuously distinguishing between target and distractor responses. Our methodology demonstrates a 3% improvement in accuracy on the WithMeAttention dataset compared to a baseline model that decodes EEG signals from scratch. This significant performance increase highlights the effectiveness of our approach Comprehensive analysis across four distinct conditions shows that rhythmic enhancement of visual information can optimize multi-sensory information processing. Notably, the two conditions featuring rhythmic target presentation - with and without accompanying beeps - achieved significantly superior performance compared to other scenarios. Furthermore, the delay distribution observed under different conditions indicates that our delay layer effectively emulates the neural processing delays in response to stimuli.

Keywords: Brain–computer interface; Delay learning; Electroencephalography (EEG); Knowledge distillation; Spiking neural network.

MeSH terms

Acoustic Stimulation / methods
Attention* / physiology
Auditory Perception / physiology
Brain / physiology
Brain-Computer Interfaces*
Electroencephalography* / methods
Humans
Neural Networks, Computer
Neurons / physiology
Photic Stimulation / methods
Visual Perception / physiology