FacialNet: facial emotion recognition for mental health analysis using UNet segmentation with transfer learning model

In-Seop Na; Asma Aldrees; Abeer Hakeem; Linda Mohaisen; Muhammad Umer; Dina Abdulaziz AlHammadi; Shtwai Alsubai; Nisreen Innab; Imran Ashraf

doi:10.3389/fncom.2024.1485121

FacialNet: facial emotion recognition for mental health analysis using UNet segmentation with transfer learning model

Front Comput Neurosci. 2024 Dec 11:18:1485121. doi: 10.3389/fncom.2024.1485121. eCollection 2024.

Authors

In-Seop Na¹, Asma Aldrees², Abeer Hakeem³, Linda Mohaisen³, Muhammad Umer⁴, Dina Abdulaziz AlHammadi⁵, Shtwai Alsubai⁶, Nisreen Innab⁷, Imran Ashraf⁸

Affiliations

¹ Division of Culture Contents, Chonnam National University, Yeosu, Republic of Korea.
² Department of Informatics and Computer Systems, College of Computer Science, King Khalid University, Abha, Saudi Arabia.
³ Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia.
⁴ Department of Computer Science & Information Technology, The Islamia University of Bahawalpur, Bahawalpur, Pakistan.
⁵ Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.
⁶ Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, Al-Kharj, Saudi Arabia.
⁷ Department of Computer Science and Information Systems, College of Applied Sciences, AlMaarefa University, Diriyah, Saudi Arabia.
⁸ Department of Information and Communication Engineering, Yeungnam University, Gyeongsan, Republic of Korea.

Abstract

Facial emotion recognition (FER) can serve as a valuable tool for assessing emotional states, which are often linked to mental health. However, mental health encompasses a broad range of factors that go beyond facial expressions. While FER provides insights into certain aspects of emotional well-being, it can be used in conjunction with other assessments to form a more comprehensive understanding of an individual's mental health. This research work proposes a framework for human FER using UNet image segmentation and transfer learning with the EfficientNetB4 model (called FacialNet). The proposed model demonstrates promising results, achieving an accuracy of 90% for six emotion classes (happy, sad, fear, pain, anger, and disgust) and 96.39% for binary classification (happy and sad). The significance of FacialNet is judged by extensive experiments conducted against various machine learning and deep learning models, as well as state-of-the-art previous research works in FER. The significance of FacialNet is further validated using a cross-validation technique, ensuring reliable performance across different data splits. The findings highlight the effectiveness of leveraging UNet image segmentation and EfficientNetB4 transfer learning for accurate and efficient human facial emotion recognition, offering promising avenues for real-world applications in emotion-aware systems and effective computing platforms. Experimental findings reveal that the proposed approach performs substantially better than existing works with an improved accuracy of 96.39% compared to existing 94.26%.

Keywords: EfficientNet; UNET; facial emotion recognition; image processing; transfer learning.

Grants and funding

The authors extend their appreciation to the Deanship of Research and Graduate Studies at King Khalid University for funding this work through small group research under grant number RGP1/296/45. This work was supported through Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2024R508), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia. Nisreen Innab would like to express sincere gratitude to AlMaarefa University, Riyadh, Saudi Arabia, for supporting this research.