Zum Hauptinhalt springen

Showing 1–24 of 24 results for author: Valstar, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.05166  [pdf, other

    cs.CV

    REACT 2024: the Second Multiple Appropriate Facial Reaction Generation Challenge

    Authors: Siyang Song, Micol Spitale, Cheng Luo, Cristina Palmero, German Barquero, Hengde Zhu, Sergio Escalera, Michel Valstar, Tobias Baur, Fabien Ringeval, Elisabeth Andre, Hatice Gunes

    Abstract: In dyadic interactions, humans communicate their intentions and state of mind using verbal and non-verbal cues, where multiple different facial reactions might be appropriate in response to a specific speaker behaviour. Then, how to develop a machine learning (ML) model that can automatically generate multiple appropriate, diverse, realistic and synchronised human facial reactions from an previous… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    MSC Class: 68T40

  2. arXiv:2306.06583  [pdf, other

    cs.CV

    REACT2023: the first Multi-modal Multiple Appropriate Facial Reaction Generation Challenge

    Authors: Siyang Song, Micol Spitale, Cheng Luo, German Barquero, Cristina Palmero, Sergio Escalera, Michel Valstar, Tobias Baur, Fabien Ringeval, Elisabeth Andre, Hatice Gunes

    Abstract: The Multi-modal Multiple Appropriate Facial Reaction Generation Challenge (REACT2023) is the first competition event focused on evaluating multimedia processing and machine learning techniques for generating human-appropriate facial reactions in various dyadic interaction scenarios, with all participants competing strictly under the same conditions. The goal of the challenge is to provide the firs… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    MSC Class: 68T40

  3. arXiv:2207.01113  [pdf, other

    cs.CV cs.HC cs.MM

    Are 3D Face Shapes Expressive Enough for Recognising Continuous Emotions and Action Unit Intensities?

    Authors: Mani Kumar Tellamekala, Ömer Sümer, Björn W. Schuller, Elisabeth André, Timo Giesbrecht, Michel Valstar

    Abstract: Recognising continuous emotions and action unit (AU) intensities from face videos requires a spatial and temporal understanding of expression dynamics. Existing works primarily rely on 2D face appearances to extract such dynamics. This work focuses on a promising alternative based on parametric 3D face shape alignment models, which disentangle different factors of variation, including expression-i… ▽ More

    Submitted 27 May, 2023; v1 submitted 3 July, 2022; originally announced July 2022.

    Comments: Accepted to IEEE Transactions on Affective Computing

  4. arXiv:2206.05833  [pdf, other

    cs.CV cs.HC cs.MM

    COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition

    Authors: Mani Kumar Tellamekala, Shahin Amiriparian, Björn W. Schuller, Elisabeth André, Timo Giesbrecht, Michel Valstar

    Abstract: Automatically recognising apparent emotions from face and voice is hard, in part because of various sources of uncertainty, including in the input data and the labels used in a machine learning framework. This paper introduces an uncertainty-aware audiovisual fusion approach that quantifies modality-wise uncertainty towards emotion prediction. To this end, we propose a novel fusion framework in wh… ▽ More

    Submitted 16 October, 2023; v1 submitted 12 June, 2022; originally announced June 2022.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence

  5. arXiv:2203.13285  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Continuous-Time Audiovisual Fusion with Recurrence vs. Attention for In-The-Wild Affect Recognition

    Authors: Vincent Karas, Mani Kumar Tellamekala, Adria Mallol-Ragolta, Michel Valstar, Björn W. Schuller

    Abstract: In this paper, we present our submission to 3rd Affective Behavior Analysis in-the-wild (ABAW) challenge. Learningcomplex interactions among multimodal sequences is critical to recognise dimensional affect from in-the-wild audiovisual data. Recurrence and attention are the two widely used sequence modelling mechanisms in the literature. To clearly understand the performance differences between rec… ▽ More

    Submitted 29 March, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: 10 pages, 1 figures, added references and an overview figure

  6. arXiv:2111.15266  [pdf, other

    cs.CV

    Two-stage Temporal Modelling Framework for Video-based Depression Recognition using Graph Representation

    Authors: Jiaqi Xu, Siyang Song, Keerthy Kusumam, Hatice Gunes, Michel Valstar

    Abstract: Video-based automatic depression analysis provides a fast, objective and repeatable self-assessment solution, which has been widely developed in recent years. While depression clues may be reflected by human facial behaviours of various temporal scales, most existing approaches either focused on modelling depression from short-term or video-level facial behaviours. In this sense, we propose a two-… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    MSC Class: 68T40 ACM Class: I.2.1

  7. arXiv:2110.13570  [pdf, other

    cs.CV

    Learning Graph Representation of Person-specific Cognitive Processes from Audio-visual Behaviours for Automatic Personality Recognition

    Authors: Siyang Song, Zilong Shao, Shashank Jaiswal, Linlin Shen, Michel Valstar, Hatice Gunes

    Abstract: This approach builds on two following findings in cognitive science: (i) human cognition partially determines expressed behaviour and is directly linked to true personality traits; and (ii) in dyadic interactions individuals' nonverbal behaviours are influenced by their conversational partner behaviours. In this context, we hypothesise that during a dyadic interaction, a target subject's facial re… ▽ More

    Submitted 27 October, 2021; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Submitted to IJCV

    MSC Class: 68T40 ACM Class: I.2.1

  8. arXiv:2103.13372  [pdf, other

    cs.CV cs.LG

    Affective Processes: stochastic modelling of temporal context for emotion and facial expression recognition

    Authors: Enrique Sanchez, Mani Kumar Tellamekala, Michel Valstar, Georgios Tzimiropoulos

    Abstract: Temporal context is key to the recognition of expressions of emotion. Existing methods, that rely on recurrent or self-attention models to enforce temporal consistency, work on the feature level, ignoring the task-specific temporal dependencies, and fail to model context uncertainty. To alleviate these issues, we build upon the framework of Neural Processes to propose a method for apparent emotion… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: Accepted at CVPR 2021

  9. arXiv:2004.07165  [pdf, other

    cs.CV

    A recurrent cycle consistency loss for progressive face-to-face synthesis

    Authors: Enrique Sanchez, Michel Valstar

    Abstract: This paper addresses a major flaw of the cycle consistency loss when used to preserve the input appearance in the face-to-face synthesis domain. In particular, we show that the images generated by a network trained using this loss conceal a noise that hinders their use for further tasks. To overcome this limitation, we propose a ''recurrent cycle consistency loss" which for different sequences of… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: Accepted to FG 2020 (Oral). arXiv admin note: substantial text overlap with arXiv:1811.03492

  10. arXiv:2004.06657  [pdf, other

    cs.CV

    A Transfer Learning approach to Heatmap Regression for Action Unit intensity estimation

    Authors: Ioanna Ntinou, Enrique Sanchez, Adrian Bulat, Michel Valstar, Georgios Tzimiropoulos

    Abstract: Action Units (AUs) are geometrically-based atomic facial muscle movements known to produce appearance changes at specific facial locations. Motivated by this observation we propose a novel AU modelling problem that consists of jointly estimating their localisation and intensity. To this end, we propose a simple yet efficient approach based on Heatmap Regression that merges both problems into a sin… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: Submitted for review to IEEE Trans. on Affective Computing

  11. arXiv:2001.07739  [pdf, ps, other

    cs.CV cs.LG eess.IV

    EMOPAIN Challenge 2020: Multimodal Pain Evaluation from Facial and Bodily Expressions

    Authors: Joy O. Egede, Siyang Song, Temitayo A. Olugbade, Chongyang Wang, Amanda Williams, Hongying Meng, Min Aung, Nicholas D. Lane, Michel Valstar, Nadia Bianchi-Berthouze

    Abstract: The EmoPain 2020 Challenge is the first international competition aimed at creating a uniform platform for the comparison of machine learning and multimedia processing methods of automatic chronic pain assessment from human expressive behaviour, and also the identification of pain-related behaviours. The objective of the challenge is to promote research in the development of assistive technologies… ▽ More

    Submitted 9 March, 2020; v1 submitted 21 January, 2020; originally announced January 2020.

    Comments: 8 pages

  12. arXiv:1907.11510  [pdf, ps, other

    cs.HC cs.CV cs.IR cs.LG stat.ML

    AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition

    Authors: Fabien Ringeval, Björn Schuller, Michel Valstar, NIcholas Cummins, Roddy Cowie, Leili Tavabi, Maximilian Schmitt, Sina Alisamir, Shahin Amiriparian, Eva-Maria Messner, Siyang Song, Shuo Liu, Ziping Zhao, Adria Mallol-Ragolta, Zhao Ren, Mohammad Soleymani, Maja Pantic

    Abstract: The Audio/Visual Emotion Challenge and Workshop (AVEC 2019) "State-of-Mind, Detecting Depression with AI, and Cross-cultural Affect Recognition" is the ninth competition event aimed at the comparison of multimedia processing and machine learning methods for automatic audiovisual health and emotion analysis, with all participants competing strictly under the same conditions. The goal of the Challen… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

  13. arXiv:1904.02382  [pdf, other

    cs.CV

    Inferring Dynamic Representations of Facial Actions from a Still Image

    Authors: Siyang Song, Enrique Sánchez-Lozano, Linlin Shen, Alan Johnston, Michel Valstar

    Abstract: Facial actions are spatio-temporal signals by nature, and therefore their modeling is crucially dependent on the availability of temporal information. In this paper, we focus on inferring such temporal dynamics of facial actions when no explicit temporal information is available, i.e. from still images. We present a novel approach to capture multiple scales of such temporal dynamics, with an appli… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: 10 pages, 5 figures

    MSC Class: 65D19

  14. arXiv:1811.03492  [pdf, other

    cs.CV

    Triple consistency loss for pairing distributions in GAN-based face synthesis

    Authors: Enrique Sanchez, Michel Valstar

    Abstract: Generative Adversarial Networks have shown impressive results for the task of object translation, including face-to-face translation. A key component behind the success of recent approaches is the self-consistency loss, which encourages a network to recover the original input image when the output generated for a desired attribute is itself passed through the same network, but with the target attr… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

    Comments: Project site https://github.com/ESanchezLozano/GANnotation , https://youtu.be/-8r7zexg4yg

  15. arXiv:1805.03487  [pdf, other

    cs.CV

    Joint Action Unit localisation and intensity estimation through heatmap regression

    Authors: Enrique Sanchez-Lozano, Georgios Tzimiropoulos, Michel Valstar

    Abstract: This paper proposes a supervised learning approach to jointly perform facial Action Unit (AU) localisation and intensity estimation. Contrary to previous works that try to learn an unsupervised representation of the Action Unit regions, we propose to directly and jointly estimate all AU intensities through heatmap regression, along with the location in the face where they cause visible changes. Ou… ▽ More

    Submitted 20 July, 2018; v1 submitted 9 May, 2018; originally announced May 2018.

    Comments: BMVC 2018. Code and model will be available to download from https://github.com/ESanchezLozano/Action-Units-Heatmaps

  16. arXiv:1805.01259  [pdf, ps, other

    cs.SD cs.CV eess.AS

    Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification

    Authors: Siyang Song, Shuimei Zhang, Björn Schuller, Linlin Shen, Michel Valstar

    Abstract: The performance of speaker-related systems usually degrades heavily in practical applications largely due to the presence of background noise. To improve the robustness of such systems in unknown noisy environments, this paper proposes a simple pre-processing method called Noise Invariant Frame Selection (NIFS). Based on several noisy constraints, it selects noise invariant frames from utterances… ▽ More

    Submitted 3 May, 2018; originally announced May 2018.

    Comments: Paper accepted in IJCNN 2018

    MSC Class: 68T10

  17. arXiv:1802.02565  [pdf, other

    cs.HC cs.AI cs.LG stat.ML

    Applying Cooperative Machine Learning to Speed Up the Annotation of Social Signals in Large Multi-modal Corpora

    Authors: Johannes Wagner, Tobias Baur, Yue Zhang, Michel F. Valstar, Björn Schuller, Elisabeth André

    Abstract: Scientific disciplines, such as Behavioural Psychology, Anthropology and recently Social Signal Processing are concerned with the systematic exploration of human behaviour. A typical work-flow includes the manual annotation (also called coding) of social signals in multi-modal corpora of considerable size. For the involved annotators this defines an exhausting and time-consuming task. In the artic… ▽ More

    Submitted 7 February, 2018; originally announced February 2018.

  18. arXiv:1702.04174  [pdf, other

    cs.CV

    FERA 2017 - Addressing Head Pose in the Third Facial Expression Recognition and Analysis Challenge

    Authors: Michel F. Valstar, Enrique Sánchez-Lozano, Jeffrey F. Cohn, László A. Jeni, Jeffrey M. Girard, Zheng Zhang, Lijun Yin, Maja Pantic

    Abstract: The field of Automatic Facial Expression Analysis has grown rapidly in recent years. However, despite progress in new approaches as well as benchmarking efforts, most evaluations still focus on either posed expressions, near-frontal recordings, or both. This makes it hard to tell how existing expression recognition approaches perform under conditions where faces appear in a wide range of poses (or… ▽ More

    Submitted 14 February, 2017; originally announced February 2017.

    Comments: FERA 2017 Baseline Paper

  19. arXiv:1701.04540  [pdf, other

    cs.CV

    Fusing Deep Learned and Hand-Crafted Features of Appearance, Shape, and Dynamics for Automatic Pain Estimation

    Authors: Joy Egede, Michel Valstar, Brais Martinez

    Abstract: Automatic continuous time, continuous value assessment of a patient's pain from face video is highly sought after by the medical profession. Despite the recent advances in deep learning that attain impressive results in many domains, pain estimation risks not being able to benefit from this due to the difficulty in obtaining data sets of considerable size. In this work we propose a combination of… ▽ More

    Submitted 17 January, 2017; originally announced January 2017.

    Comments: 8 pages, 5 figures

  20. arXiv:1612.02374  [pdf, other

    cs.CV

    Automatic Detection of ADHD and ASD from Expressive Behaviour in RGBD Data

    Authors: Shashank Jaiswal, Michel Valstar, Alinda Gillott, David Daley

    Abstract: Attention Deficit Hyperactivity Disorder (ADHD) and Autism Spectrum Disorder (ASD) are neurodevelopmental conditions which impact on a significant number of children and adults. Currently, the diagnosis of such disorders is done by experts who employ standard questionnaires and look for certain behavioural markers through manual observation. Such methods for their diagnosis are not only subjective… ▽ More

    Submitted 7 December, 2016; originally announced December 2016.

  21. A Functional Regression approach to Facial Landmark Tracking

    Authors: Enrique Sánchez-Lozano, Georgios Tzimiropoulos, Brais Martinez, Fernando De la Torre, Michel Valstar

    Abstract: Linear regression is a fundamental building block in many face detection and tracking algorithms, typically used to predict shape displacements from image features through a linear mapping. This paper presents a Functional Regression solution to the least squares problem, which we coin Continuous Regression, resulting in the first real-time incremental face tracker. Contrary to prior work in Funct… ▽ More

    Submitted 20 September, 2017; v1 submitted 7 December, 2016; originally announced December 2016.

    Comments: Accepted at IEEE TPAMI. This is authors' version. 0162-8828 ©2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017

  22. arXiv:1609.09642  [pdf, other

    cs.CV

    A CNN Cascade for Landmark Guided Semantic Part Segmentation

    Authors: Aaron Jackson, Michel Valstar, Georgios Tzimiropoulos

    Abstract: This paper proposes a CNN cascade for semantic part segmentation guided by pose-specific information encoded in terms of a set of landmarks (or keypoints). There is large amount of prior work on each of these tasks separately, yet, to the best of our knowledge, this is the first time in literature that the interplay between pose estimation and semantic part segmentation is investigated. To address… ▽ More

    Submitted 30 September, 2016; originally announced September 2016.

    Comments: accepted to Geometry Meets Deep Learning ECCV 2016 Workshop

  23. arXiv:1608.01137  [pdf, other

    cs.CV

    Cascaded Continuous Regression for Real-time Incremental Face Tracking

    Authors: Enrique Sánchez-Lozano, Brais Martinez, Georgios Tzimiropoulos, Michel Valstar

    Abstract: This paper introduces a novel real-time algorithm for facial landmark tracking. Compared to detection, tracking has both additional challenges and opportunities. Arguably the most important aspect in this domain is updating a tracker's models as tracking progresses, also known as incremental (face) tracking. While this should result in more accurate localisation, how to do this online and in real… ▽ More

    Submitted 6 August, 2016; v1 submitted 3 August, 2016; originally announced August 2016.

    Comments: ECCV 2016 accepted paper, with supplementary material included as appendices. References to Equations fixed

  24. arXiv:1605.01600  [pdf, other

    cs.CV cs.HC cs.MM

    AVEC 2016 - Depression, Mood, and Emotion Recognition Workshop and Challenge

    Authors: Michel Valstar, Jonathan Gratch, Bjorn Schuller, Fabien Ringeval, Denis Lalanne, Mercedes Torres Torres, Stefan Scherer, Guiota Stratou, Roddy Cowie, Maja Pantic

    Abstract: The Audio/Visual Emotion Challenge and Workshop (AVEC 2016) "Depression, Mood and Emotion" will be the sixth competition event aimed at comparison of multimedia processing and machine learning methods for automatic audio, visual and physiological depression and emotion analysis, with all participants competing under strictly the same conditions. The goal of the Challenge is to provide a common ben… ▽ More

    Submitted 22 November, 2016; v1 submitted 5 May, 2016; originally announced May 2016.

    Comments: Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, AVEC'16, co-located with the 24th ACM International Conference on Multimedia, MM 2016, pages 3-10, Amsterdam, The Netherlands, October 2016. ACM