Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Imamura, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17722  [pdf, other

    cs.SD eess.AS

    Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals

    Authors: Kentaro Seki, Shinnosuke Takamichi, Norihiro Takamune, Yuki Saito, Kanami Imamura, Hiroshi Saruwatari

    Abstract: This paper proposes a new task called spatial voice conversion, which aims to convert a target voice while preserving spatial information and non-target signals. Traditional voice conversion methods focus on single-channel waveforms, ignoring the stereo listening experience inherent in human hearing. Our baseline approach addresses this gap by integrating blind source separation (BSS), voice conve… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  2. Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides

    Authors: Kanami Imamura, Tomohiko Nakamura, Norihiro Takamune, Kohei Yatabe, Hiroshi Saruwatari

    Abstract: In this paper, we propose algorithms for handling non-integer strides in sampling-frequency-independent (SFI) convolutional and transposed convolutional layers. The SFI layers have been developed for handling various sampling frequencies (SFs) by a single neural network. They are replaceable with their non-SFI counterparts and can be introduced into various network architectures. However, they cou… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: 5 pages, 3 figures, accepted for European Signal Processing Conference 2023 (EUSIPCO 2023)

    Journal ref: European Signal Processing Conference, Sep. 2023, pp. 326--330

  3. arXiv:2211.15965  [pdf, ps, other

    cs.CL

    Extending the Subwording Model of Multilingual Pretrained Models for New Languages

    Authors: Kenji Imamura, Eiichiro Sumita

    Abstract: Multilingual pretrained models are effective for machine translation and cross-lingual processing because they contain multiple languages in one model. However, they are pretrained after their tokenizers are fixed; therefore it is difficult to change the vocabulary after pretraining. When we extend the pretrained models to new languages, we must modify the tokenizers simultaneously. In this paper,… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Code: https://github.com/kenji-imamura/sentpiece_mimic

  4. arXiv:1907.03060  [pdf, ps, other

    cs.CL

    Exploiting Out-of-Domain Parallel Data through Multilingual Transfer Learning for Low-Resource Neural Machine Translation

    Authors: Aizhan Imankulova, Raj Dabre, Atsushi Fujita, Kenji Imamura

    Abstract: This paper proposes a novel multilingual multistage fine-tuning approach for low-resource neural machine translation (NMT), taking a challenging Japanese--Russian pair for benchmarking. Although there are many solutions for low-resource scenarios, such as multilingual NMT and back-translation, we have empirically confirmed their limited success when restricted to in-domain data. We therefore propo… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

    Comments: Accepted at the 17th Machine Translation Summit