Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Dantrey, A

Searching in archive cs. Search in all archives.
.
  1. CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram

    Authors: Zhifeng Kong, Wei Ping, Ambrish Dantrey, Bryan Catanzaro

    Abstract: In this work, we present CleanUNet 2, a speech denoising model that combines the advantages of waveform denoiser and spectrogram denoiser and achieves the best of both worlds. CleanUNet 2 uses a two-stage framework inspired by popular speech synthesis methods that consist of a waveform model and a spectrogram model. Specifically, CleanUNet 2 builds upon CleanUNet, the state-of-the-art waveform den… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: INTERSPEECH 2023

    Journal ref: Proc. INTERSPEECH 2023, pages 790--794

  2. arXiv:2202.07790  [pdf, other

    cs.SD cs.LG eess.AS

    Speech Denoising in the Waveform Domain with Self-Attention

    Authors: Zhifeng Kong, Wei Ping, Ambrish Dantrey, Bryan Catanzaro

    Abstract: In this work, we present CleanUNet, a causal speech denoising model on the raw waveform. The proposed model is based on an encoder-decoder architecture combined with several self-attention blocks to refine its bottleneck representations, which is crucial to obtain good results. The model is optimized through a set of losses defined over both waveform and multi-resolution spectrograms. The proposed… ▽ More

    Submitted 6 July, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: Published in ICASSP 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Listen to audio samples from CleanUNet at: https://cleanunet.github.io/