Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Turetzky, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12206  [pdf, other

    cs.CL cs.SD eess.AS

    A Language Modeling Approach to Diacritic-Free Hebrew TTS

    Authors: Amit Roth, Arnon Turetzky, Yossi Adi

    Abstract: We tackle the task of text-to-speech (TTS) in Hebrew. Traditional Hebrew contains Diacritics, which dictate the way individuals should pronounce given words, however, modern Hebrew rarely uses them. The lack of diacritics in modern Hebrew results in readers expected to conclude the correct pronunciation and understand which phonemes to use based on the context. This imposes a fundamental challenge… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted at Interspeech24

  2. arXiv:2407.07566  [pdf, other

    cs.CL cs.SD eess.AS

    HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing

    Authors: Arnon Turetzky, Or Tal, Yael Segal-Feldman, Yehoshua Dissen, Ella Zeldes, Amit Roth, Eyal Cohen, Yosi Shrem, Bronya R. Chernyak, Olga Seleznova, Joseph Keshet, Yossi Adi

    Abstract: We present HebDB, a weakly supervised dataset for spoken language processing in the Hebrew language. HebDB offers roughly 2500 hours of natural and spontaneous speech recordings in the Hebrew language, consisting of a large variety of speakers and topics. We provide raw recordings together with a pre-processed, weakly supervised, and filtered version. The goal of HebDB is to further enhance resear… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted at Interspeech2024

  3. Deep Audio Waveform Prior

    Authors: Arnon Turetzky, Tzvi Michelson, Yossi Adi, Shmuel Peleg

    Abstract: Convolutional neural networks contain strong priors for generating natural looking images [1]. These priors enable image denoising, super resolution, and inpainting in an unsupervised manner. Previous attempts to demonstrate similar ideas in audio, namely deep audio priors, (i) use hand picked architectures such as harmonic convolutions, (ii) only work with spectrogram input, and (iii) have been u… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: Interspeech 2022