This study introduces MinSnore, a novel deep learning model tailored for real-time snoring detection and reduction, specifically designed for deployment on low-configuration edge devices. By integrating MobileViTV3 blocks into the Dynamic MobileNetV3 backbone model architecture, MinSnore leverages both Convolutional Neural Networks (CNNs) and transformers to deliver enhanced feature representations with minimal computational overhead. The model was pre-trained on a diverse dataset of 46,349 audio files using the Self-Supervised Learning with Barlow Twins (SSL-BT) method, followed by fine-tuning on 17,355 segmented clips extracted from this dataset. MinSnore represents a significant breakthrough in snoring detection, achieving an accuracy of 96.37 %, precision of 96.31 %, recall of 94.12 %, and an F1-score of 95.02 %. When deployed on a single-board computer like a Raspberry Pi, the system demonstrated a reduction in snoring duration during real-world experiments. These results underscore the importance of this work in addressing sleep-related health issues through an efficient, low-cost, and highly accurate snoring mitigation solution.
Keywords: Dynamic convolutions; Edge device; MinSnore; Self-supervise learning; Snoring prevention.
Copyright © 2024 Elsevier Ltd. All rights reserved.