Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Papayiannis, C

.
  1. arXiv:2212.03398   

    eess.AS cs.CL cs.SD

    Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue

    Authors: Daxin Tan, Nikos Kargas, David McHardy, Constantinos Papayiannis, Antonio Bonafonte, Marek Strelec, Jonas Rohnke, Agis Oikonomou Filandras, Trevor Wood

    Abstract: Entrainment is the phenomenon by which an interlocutor adapts their speaking style to align with their partner in conversations. It has been found in different dimensions as acoustic, prosodic, lexical or syntactic. In this work, we explore and utilize the entrainment phenomenon to improve spoken dialogue systems for voice assistants. We first examine the existence of the entrainment phenomenon in… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: This version has been removed by arXiv administrators because the submitter did not have the right to assign a license at the time of submission

  2. arXiv:2102.06357  [pdf, other

    cs.SD cs.LG eess.AS

    Contrastive Unsupervised Learning for Speech Emotion Recognition

    Authors: Mao Li, Bo Yang, Joshua Levy, Andreas Stolcke, Viktor Rozgic, Spyros Matsoukas, Constantinos Papayiannis, Daniel Bone, Chao Wang

    Abstract: Speech emotion recognition (SER) is a key technology to enable more natural human-machine communication. However, SER has long suffered from a lack of public large-scale labeled datasets. To circumvent this problem, we investigate how unsupervised representation learning on unlabeled datasets can benefit SER. We show that the contrastive predictive coding (CPC) method can learn salient representat… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  3. arXiv:1901.05852  [pdf, other

    eess.AS cs.SD

    Detecting Sound-Absorbing Materials in a Room from a Single Impulse Response using a CRNN

    Authors: Constantinos Papayiannis, Christine Evers, Patrick A. Naylor

    Abstract: The materials of surfaces in a room play an important room in shaping the auditory experience within them. Different materials absorb energy at different levels. The level of absorption also varies across frequencies. This paper investigates how cues from a measured impulse response in the room can be exploited by machines to detect the materials present. With this motivation, this paper proposes… ▽ More

    Submitted 27 October, 2019; v1 submitted 17 January, 2019; originally announced January 2019.

    Comments: Submitted for review for IEEE ICASSP 2020

  4. arXiv:1901.03257  [pdf, other

    eess.AS cs.SD

    Data Augmentation of Room Classifiers using Generative Adversarial Networks

    Authors: Constantinos Papayiannis, Christine Evers, Patrick A. Naylor

    Abstract: The classification of acoustic environments allows for machines to better understand the auditory world around them. The use of deep learning in order to teach machines to discriminate between different rooms is a new area of research. Similarly to other learning tasks, this task suffers from the high-dimensionality and the limited availability of training data. Data augmentation methods have prov… ▽ More

    Submitted 4 December, 2020; v1 submitted 10 January, 2019; originally announced January 2019.

    Comments: Submitted to IEEE/ACM Transactions on Audio, Speech, and Language Processing

  5. arXiv:1812.09324  [pdf, other

    eess.AS cs.SD

    End-to-End Classification of Reverberant Rooms using DNNs

    Authors: Constantinos Papayiannis, Christine Evers, Patrick A. Naylor

    Abstract: Reverberation is present in our workplaces, our homes, concert halls and theatres. This paper investigates how deep learning can use the effect of reverberation on speech to classify a recording in terms of the room in which it was recorded. Existing approaches in the literature rely on domain expertise to manually select acoustic parameters as inputs to classifiers. Estimation of these parameters… ▽ More

    Submitted 1 November, 2020; v1 submitted 21 December, 2018; originally announced December 2018.

    Comments: Accepted for publication in IEEE/ACM Transactions on Audio, Speech, and Language Processing