Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Khaitan, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.08577  [pdf, other

    cs.SD cs.CL cs.HC eess.AS

    AffectEcho: Speaker Independent and Language-Agnostic Emotion and Affect Transfer for Speech Synthesis

    Authors: Hrishikesh Viswanath, Aneesh Bhattacharya, Pascal Jutras-Dubé, Prerit Gupta, Mridu Prashanth, Yashvardhan Khaitan, Aniket Bera

    Abstract: Affect is an emotional characteristic encompassing valence, arousal, and intensity, and is a crucial attribute for enabling authentic conversations. While existing text-to-speech (TTS) and speech-to-speech systems rely on strength embedding vectors and global style tokens to capture emotions, these models represent emotions as a component of style or represent them in discrete categories. We propo… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.