Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Jouvet, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.02244  [pdf, other

    cs.SD eess.AS

    Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions

    Authors: Sandipana Dowerah, Ajinkya Kulkarni, Romain Serizel, Denis Jouvet

    Abstract: The paper introduces Diff-Filter, a multichannel speech enhancement approach based on the diffusion probabilistic model, for improving speaker verification performance under noisy and reverberant conditions. It also presents a new two-step training procedure that takes the benefit of self-supervised learning. In the first stage, the Diff-Filter is trained by conducting timedomain speech filtering… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  2. arXiv:2305.01759  [pdf, other

    eess.AS cs.AI cs.CL

    Evaluation of Speaker Anonymization on Emotional Speech

    Authors: Hubert Nourtel, Pierre Champion, Denis Jouvet, Anthony Larcher, Marie Tahon

    Abstract: Speech data carries a range of personal information, such as the speaker's identity and emotional state. These attributes can be used for malicious purposes. With the development of virtual assistants, a new generation of privacy threats has emerged. Current studies have addressed the topic of preserving speech privacy. One of them, the VoicePrivacy initiative aims to promote the development of pr… ▽ More

    Submitted 15 April, 2023; originally announced May 2023.

    Journal ref: Proc. 2021 ISCA Symposium on Security and Privacy in Speech Communication (62-66)

  3. arXiv:2210.08834  [pdf

    cs.SD cs.HC eess.AS

    How to Leverage DNN-based speech enhancement for multi-channel speaker verification?

    Authors: Sandipana Dowerah, Romain Serizel, Denis Jouvet, Mohammad Mohammadamini, Driss Matrouf

    Abstract: Speaker verification (SV) suffers from unsatisfactory performance in far-field scenarios due to environmental noise andthe adverse impact of room reverberation. This work presents a benchmark of multichannel speech enhancement for far-fieldspeaker verification. One approach is a deep neural network-based, and the other is a combination of deep neural network andsignal processing. We integrated a D… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Journal ref: 4th International Conference on Advances in Signal Processing and Artificial Intelligence (ASPAI' 2022), Oct 2022, Corfu, Greece

  4. arXiv:2208.10497  [pdf, other

    cs.SD cs.AI cs.CR cs.LG eess.AS

    Are disentangled representations all you need to build speaker anonymization systems?

    Authors: Pierre Champion, Denis Jouvet, Anthony Larcher

    Abstract: Speech signals contain a lot of sensitive information, such as the speaker's identity, which raises privacy concerns when speech data get collected. Speaker anonymization aims to transform a speech signal to remove the source speaker's identity while leaving the spoken content unchanged. Current methods perform the transformation by relying on content/speaker disentanglement and voice conversion.… ▽ More

    Submitted 13 January, 2023; v1 submitted 22 August, 2022; originally announced August 2022.

    Journal ref: INTERSPEECH 2022 - Human and Humanizing Speech Technology, Sep 2022, incheon, South Korea

  5. arXiv:2203.09518  [pdf, other

    eess.AS cs.AI cs.CL cs.CR cs.SD

    Privacy-Preserving Speech Representation Learning using Vector Quantization

    Authors: Pierre Champion, Denis Jouvet, Anthony Larcher

    Abstract: With the popularity of virtual assistants (e.g., Siri, Alexa), the use of speech recognition is now becoming more and more widespread.However, speech signals contain a lot of sensitive information, such as the speaker's identity, which raises privacy concerns.The presented experiments show that the representations extracted by the deep layers of speech recognition networks contain speaker informat… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: Journ{é}es d'{É}tudes sur la Parole - JEP2022, Jun 2022, {Î}le de Noirmoutier, France

  6. arXiv:2110.05431  [pdf, other

    eess.AS cs.CR cs.LG cs.SD

    On the invertibility of a voice privacy system using embedding alignement

    Authors: Pierre Champion, Thomas Thebaud, Gaël Le Lan, Anthony Larcher, Denis Jouvet

    Abstract: This paper explores various attack scenarios on a voice anonymization system using embeddings alignment techniques. We use Wasserstein-Procrustes (an algorithm initially designed for unsupervised translation) or Procrustes analysis to match two sets of x-vectors, before and after voice anonymization, to mimic this transformation as a rotation function. We compute the optimal rotation and compare t… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

    Journal ref: ASRU 2021 - IEEE Automatic Speech Recognition and Understanding Workshop, Dec 2021, Cartagena, Colombia

  7. arXiv:2109.11946  [pdf, other

    cs.SD cs.AI cs.CR eess.AS

    Evaluating X-vector-based Speaker Anonymization under White-box Assessment

    Authors: Pierre Champion, Denis Jouvet, Anthony Larcher

    Abstract: In the scenario of the Voice Privacy challenge, anonymization is achieved by converting all utterances from a source speaker to match the same target identity; this identity being randomly selected. In this context, an attacker with maximum knowledge about the anonymization system can not infer the target identity. This article proposed to constrain the target selection to a specific identity, i.e… ▽ More

    Submitted 30 September, 2021; v1 submitted 24 September, 2021; originally announced September 2021.

    Journal ref: 23rd International Conference on Speech and Computer - SPECOM 2021, Sep 2021, Saint Petersburg, Russia

  8. arXiv:2101.08478  [pdf, other

    eess.AS cs.CR cs.SD

    A Study of F0 Modification for X-Vector Based Speech Pseudonymization Across Gender

    Authors: Pierre Champion, Denis Jouvet, Anthony Larcher

    Abstract: Speech pseudonymization aims at altering a speech signal to map the identifiable personal characteristics of a given speaker to another identity. In other words, it aims to hide the source speaker identity while preserving the intelligibility of the spoken content. This study takes place in the VoicePrivacy 2020 challenge framework, where the baseline system performs pseudonymization by modifying… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Journal ref: The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence, Feb 2021, Nancy, France