Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Yarmohammadi, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.10303  [pdf, other

    eess.AS cs.CL

    Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation

    Authors: Ruizhe Huang, Mahsa Yarmohammadi, Sanjeev Khudanpur, Daniel Povey

    Abstract: Existing research suggests that automatic speech recognition (ASR) models can benefit from additional contexts (e.g., contact lists, user specified vocabulary). Rare words and named entities can be better recognized with contexts. In this work, we propose two simple yet effective techniques to improve context-aware ASR models. First, we inject contexts into the encoders at an early stage instead o… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: Accepted to INTERSPEECH 2024