Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Uchida, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.16261  [pdf, other

    cs.LG cs.AI

    Evaluating Time-Series Training Dataset through Lens of Spectrum in Deep State Space Models

    Authors: Sekitoshi Kanai, Yasutoshi Ida, Kazuki Adachi, Mihiro Uchida, Tsukasa Yoshida, Shin'ya Yamaguchi

    Abstract: This study investigates a method to evaluate time-series datasets in terms of the performance of deep neural networks (DNNs) with state space models (deep SSMs) trained on the dataset. SSMs have attracted attention as components inside DNNs to address time-series data. Since deep SSMs have powerful representation capacities, training datasets play a crucial role in solving a new task. However, the… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: 11 pages, 5 figures

  2. arXiv:2408.14788  [pdf, other

    cs.LG

    Learning from Complementary Features

    Authors: Kosuke Sugiyama, Masato Uchida

    Abstract: While precise data observation is essential for the learning processes of predictive models, it can be challenging owing to factors such as insufficient observation accuracy, high collection costs, and privacy constraints. In this paper, we examines cases where some qualitative features are unavailable as precise information indicating "what it is," but rather as complementary information indicati… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: 16 pages, 7 figures

    MSC Class: 68T01 ACM Class: I.5

  3. arXiv:2306.02273  [pdf, ps, other

    cs.CL cs.SD eess.AS

    End-to-End Joint Target and Non-Target Speakers ASR

    Authors: Ryo Masumura, Naoki Makishima, Taiga Yamane, Yoshihiko Yamazaki, Saki Mizuno, Mana Ihori, Mihiro Uchida, Keita Suzuki, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando

    Abstract: This paper proposes a novel automatic speech recognition (ASR) system that can transcribe individual speaker's speech while identifying whether they are target or non-target speakers from multi-talker overlapped speech. Target-speaker ASR systems are a promising way to only transcribe a target speaker's speech by enrolling the target speaker's information. However, in conversational ASR applicatio… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted at Interspeech 2023

  4. arXiv:2305.04539  [pdf, ps, other

    cs.LG stat.ML

    Q&A Label Learning

    Authors: Kota Kawamoto, Masato Uchida

    Abstract: Assigning labels to instances is crucial for supervised machine learning. In this paper, we proposed a novel annotation method called Q&A labeling, which involves a question generator that asks questions about the labels of the instances to be assigned, and an annotator who answers the questions and assigns the corresponding labels to the instances. We derived a generative model of labels assigned… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 46 pages, 5 figures

  5. arXiv:2202.09979  [pdf, other

    cs.CL cs.CV

    Audio Visual Scene-Aware Dialog Generation with Transformer-based Video Representations

    Authors: Yoshihiro Yamazaki, Shota Orihashi, Ryo Masumura, Mihiro Uchida, Akihiko Takashima

    Abstract: There have been many attempts to build multimodal dialog systems that can respond to a question about given audio-visual information, and the representative task for such systems is the Audio Visual Scene-Aware Dialog (AVSD). Most conventional AVSD models adopt the Convolutional Neural Network (CNN)-based video feature extractor to understand visual information. While a CNN tends to obtain both te… ▽ More

    Submitted 20 February, 2022; originally announced February 2022.

    Comments: Accepted at DSTC10 Workshop at AAAI 2022

  6. arXiv:2002.02158  [pdf, ps, other

    cs.LG stat.ML

    Bridging Ordinary-Label Learning and Complementary-Label Learning

    Authors: Yasuhiro Katsura, Masato Uchida

    Abstract: A supervised learning framework has been proposed for the situation where each training data is provided with a complementary label that represents a class to which the pattern does not belong. In the existing literature, complementary-label learning has been studied independently from ordinary-label learning, which assumes that each training data is provided with a label representing the class to… ▽ More

    Submitted 25 June, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

    MSC Class: 68T10