Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Zaidi, J

Searching in archive cs. Search in all archives.
.
  1. A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

    Authors: Benjamin van Niekerk, Marc-André Carbonneau, Julian Zaïdi, Mathew Baas, Hugo Seuté, Herman Kamper

    Abstract: The goal of voice conversion is to transform source speech into a target voice, keeping the content unchanged. In this paper, we focus on self-supervised representation learning for voice conversion. Specifically, we compare discrete and soft speech units as input features. We find that discrete representations effectively remove speaker information but discard some linguistic content - leading to… ▽ More

    Submitted 8 June, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: 5 pages, 2 figures, 2 tables. Accepted at ICASSP 2022

  2. Daft-Exprt: Cross-Speaker Prosody Transfer on Any Text for Expressive Speech Synthesis

    Authors: Julian Zaïdi, Hugo Seuté, Benjamin van Niekerk, Marc-André Carbonneau

    Abstract: This paper presents Daft-Exprt, a multi-speaker acoustic model advancing the state-of-the-art for cross-speaker prosody transfer on any text. This is one of the most challenging, and rarely directly addressed, task in speech synthesis, especially for highly expressive data. Daft-Exprt uses FiLM conditioning layers to strategically inject different prosodic information in all parts of the architect… ▽ More

    Submitted 5 April, 2022; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: Submitted to Interspeech 2022, 5 pages, 5 figures, 2 tables

    Journal ref: Proc. Interspeech (2022) 4591-4595

  3. arXiv:2012.09276  [pdf, ps, other

    cs.LG cs.AI

    Measuring Disentanglement: A Review of Metrics

    Authors: Marc-André Carbonneau, Julian Zaidi, Jonathan Boilard, Ghyslain Gagnon

    Abstract: Learning to disentangle and represent factors of variation in data is an important problem in AI. While many advances have been made to learn these representations, it is still unclear how to quantify disentanglement. While several metrics exist, little is known on their implicit assumptions, what they truly measure, and their limits. In consequence, it is difficult to interpret results when compa… ▽ More

    Submitted 9 May, 2022; v1 submitted 16 December, 2020; originally announced December 2020.