Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Saabas, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09928  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Personalized Speech Enhancement Without a Separate Speaker Embedding Model

    Authors: Tanel Pärnamaa, Ando Saabas

    Abstract: Personalized speech enhancement (PSE) models can improve the audio quality of teleconferencing systems by adapting to the characteristics of a speaker's voice. However, most existing methods require a separate speaker embedding model to extract a vector representation of the speaker from enrollment audio, which adds complexity to the training and deployment process. We propose to use the internal… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  2. arXiv:2402.16927  [pdf, ps, other

    cs.SD eess.AS

    The ICASSP 2024 Audio Deep Packet Loss Concealment Challenge

    Authors: Lorenz Diener, Solomiya Branets, Ando Saabas, Ross Cutler

    Abstract: Audio packet loss concealment is the hiding of gaps in VoIP audio streams caused by network packet loss. With the ICASSP 2024 Audio Deep Packet Loss Concealment Grand Challenge, we build on the success of the previous Audio PLC Challenge held at INTERSPEECH 2022. We evaluate models on an overall harder dataset, and use the new ITU-T P.804 evaluation procedure to more closely evaluate the performan… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  3. arXiv:2401.14444  [pdf, other

    cs.SD cs.AI cs.CV eess.AS

    ICASSP 2024 Speech Signal Improvement Challenge

    Authors: Nicolae Catalin Ristea, Ando Saabas, Ross Cutler, Babak Naderi, Sebastian Braun, Solomiya Branets

    Abstract: The ICASSP 2024 Speech Signal Improvement Grand Challenge is intended to stimulate research in the area of improving the speech signal quality in communication systems. This marks our second challenge, building upon the success from the previous ICASSP 2023 Grand Challenge. We enhance the competition by introducing a dataset synthesizer, enabling all participating teams to start at a higher baseli… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  4. arXiv:2309.12553  [pdf, other

    eess.AS cs.SD

    ICASSP 2023 Acoustic Echo Cancellation Challenge

    Authors: Ross Cutler, Ando Saabas, Tanel Parnamaa, Marju Purin, Evgenii Indenbom, Nicolae-Catalin Ristea, Jegor Gužvin, Hannes Gamper, Sebastian Braun, Robert Aichner

    Abstract: The ICASSP 2023 Acoustic Echo Cancellation Challenge is intended to stimulate research in acoustic echo cancellation (AEC), which is an important area of speech enhancement and is still a top issue in audio communication. This is the fourth AEC challenge and it is enhanced by adding a second track for personalized acoustic echo cancellation, reducing the algorithmic + buffering latency to 20ms, as… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.13290, arXiv:2009.04972

  5. arXiv:2306.03177  [pdf, other

    cs.SD cs.CV eess.AS

    DeepVQE: Real Time Deep Voice Quality Enhancement for Joint Acoustic Echo Cancellation, Noise Suppression and Dereverberation

    Authors: Evgenii Indenbom, Nicolae-Catalin Ristea, Ando Saabas, Tanel Parnamaa, Jegor Guzvin, Ross Cutler

    Abstract: Acoustic echo cancellation (AEC), noise suppression (NS) and dereverberation (DR) are an integral part of modern full-duplex communication systems. As the demand for teleconferencing systems increases, addressing these tasks is required for an effective and efficient online meeting experience. Most prior research proposes solutions for these tasks separately, combining them with digital signal pro… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  6. arXiv:2305.15127  [pdf, other

    cs.SD eess.AS

    PLCMOS -- a data-driven non-intrusive metric for the evaluation of packet loss concealment algorithms

    Authors: Lorenz Diener, Marju Purin, Sten Sootla, Ando Saabas, Robert Aichner, Ross Cutler

    Abstract: Speech quality assessment is a problem for every researcher working on models that produce or process speech. Human subjective ratings, the gold standard in speech quality assessment, are expensive and time-consuming to acquire in a quantity that is sufficient to get reliable data, while automated objective metrics show a low correlation with gold standard ratings. This paper presents PLCMOS, a no… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: to appear: INTERSPEECH 2023, associated model release: https://aka.ms/PLCMOS

  7. arXiv:2208.11308  [pdf, other

    cs.SD cs.CV eess.AS

    Deep model with built-in cross-attention alignment for acoustic echo cancellation

    Authors: Evgenii Indenbom, Nicolae-Cătălin Ristea, Ando Saabas, Tanel Pärnamaa, Jegor Gužvin

    Abstract: With recent research advances, deep learning models have become an attractive choice for acoustic echo cancellation (AEC) in real-time teleconferencing applications. Since acoustic echo is one of the major sources of poor audio quality, a wide variety of deep models have been proposed. However, an important but often omitted requirement for good echo cancellation quality is the synchronization of… ▽ More

    Submitted 14 March, 2023; v1 submitted 24 August, 2022; originally announced August 2022.

  8. arXiv:2204.05222  [pdf, other

    cs.SD eess.AS

    INTERSPEECH 2022 Audio Deep Packet Loss Concealment Challenge

    Authors: Lorenz Diener, Sten Sootla, Solomiya Branets, Ando Saabas, Robert Aichner, Ross Cutler

    Abstract: Audio Packet Loss Concealment (PLC) is the hiding of gaps in audio streams caused by data transmission failures in packet switched networks. This is a common problem, and of increasing importance as end-to-end VoIP telephony and teleconference systems become the default and ever more widely used form of communication in business as well as in personal usage. This paper presents the INTERSPEECH 202… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 4 pages + 1 page references, 1 figure, 2 tables. Submitted to INTERSPEECH 2022

  9. arXiv:2202.13290  [pdf, other

    eess.AS cs.SD

    ICASSP 2022 Acoustic Echo Cancellation Challenge

    Authors: Ross Cutler, Ando Saabas, Tanel Parnamaa, Marju Purin, Hannes Gamper, Sebastian Braun, Karsten Sørensen, Robert Aichner

    Abstract: The ICASSP 2022 Acoustic Echo Cancellation Challenge is intended to stimulate research in acoustic echo cancellation (AEC), which is an important area of speech enhancement and still a top issue in audio communication. This is the third AEC challenge and it is enhanced by including mobile scenarios, adding speech recognition rate in the challenge goal metrics, and making the default sample rate 48… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2009.04972

  10. arXiv:2110.03010  [pdf, other

    eess.AS cs.SD

    AECMOS: A speech quality assessment metric for echo impairment

    Authors: Marju Purin, Sten Sootla, Mateja Sponza, Ando Saabas, Ross Cutler

    Abstract: Traditionally, the quality of acoustic echo cancellers is evaluated using intrusive speech quality assessment measures such as ERLE \cite{g168} and PESQ \cite{p862}, or by carrying out subjective laboratory tests. Unfortunately, the former are not well correlated with human subjective measures, while the latter are time and resource consuming to carry out. We provide a new tool for speech quality… ▽ More

    Submitted 27 January, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

  11. arXiv:2010.13063  [pdf, other

    eess.AS cs.SD

    Crowdsourcing approach for subjective evaluation of echo impairment

    Authors: Ross Cutler, Babak Naderi, Markus Loide, Sten Sootla, Ando Saabas

    Abstract: The quality of acoustic echo cancellers (AECs) in real-time communication systems is typically evaluated using objective metrics like ERLE and PESQ, and less commonly with lab-based subjective tests like ITU-T Rec. P.831. We will show that these objective measures are not well correlated to subjective measures. We then introduce an open-source crowdsourcing approach for subjective evaluation of ec… ▽ More

    Submitted 27 February, 2022; v1 submitted 25 October, 2020; originally announced October 2020.

  12. arXiv:2009.04972  [pdf, other

    eess.AS cs.SD

    ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets, Testing Framework, and Results

    Authors: Kusha Sridhar, Ross Cutler, Ando Saabas, Tanel Parnamaa, Markus Loide, Hannes Gamper, Sebastian Braun, Robert Aichner, Sriram Srinivasan

    Abstract: The ICASSP 2021 Acoustic Echo Cancellation Challenge is intended to stimulate research in the area of acoustic echo cancellation (AEC), which is an important part of speech enhancement and still a top issue in audio communication and conferencing systems. Many recent AEC studies report good performance on synthetic datasets where the train and test samples come from the same underlying distributio… ▽ More

    Submitted 30 October, 2020; v1 submitted 10 September, 2020; originally announced September 2020.