Skip to main content

Showing 1–3 of 3 results for author: Imai, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04047  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis

    Authors: Cong-Thanh Do, Shuhei Imai, Rama Doddipatla, Thomas Hain

    Abstract: This paper investigates the use of unsupervised text-to-speech synthesis (TTS) as a data augmentation method to improve accented speech recognition. TTS systems are trained with a small amount of accented speech training data and their pseudo-labels rather than manual transcriptions, and hence unsupervised. This approach enables the use of accented speech data without manual transcriptions to perf… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted to EUSIPCO 2024

  2. arXiv:2109.14549  [pdf, other

    cs.RO cs.CV cs.LG

    Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay Randomization

    Authors: Chieko Sarah Imai, Minghao Zhang, Yuchen Zhang, Marcin Kierebinski, Ruihan Yang, Yuzhe Qin, Xiaolong Wang

    Abstract: Developing robust vision-guided controllers for quadrupedal robots in complex environments, with various obstacles, dynamical surroundings and uneven terrains, is very challenging. While Reinforcement Learning (RL) provides a promising paradigm for agile locomotion skills with vision inputs in simulation, it is still very challenging to deploy the RL policy in the real world. Our key insight is th… ▽ More

    Submitted 23 July, 2022; v1 submitted 29 September, 2021; originally announced September 2021.

    Comments: IROS 2022, Project page: https://mehooz.github.io/mmdr-wild/

  3. arXiv:2003.01310  [pdf, other

    cs.DC cs.NI

    Performance Optimization for Edge-Cloud Serverless Platforms via Dynamic Task Placement

    Authors: Anirban Das, Shigeru Imai, Mike P. Wittie, Stacy Patterson

    Abstract: We present a framework for performance optimization in serverless edge-cloud platforms using dynamic task placement. We focus on applications for smart edge devices, for example, smart cameras or speakers, that need to perform processing tasks on input data in real to near-real time. Our framework allows the user to specify cost and latency requirements for each application task, and for each inpu… ▽ More

    Submitted 19 May, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: 10 pages, 6 figures, 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing