Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Kruchinin, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05199  [pdf, other

    eess.AS cs.SD

    XANE: eXplainable Acoustic Neural Embeddings

    Authors: Sri Harsha Dumpala, Dushyant Sharma, Chandramouli Shama Sastri, Stanislav Kruchinin, James Fosburgh, Patrick A. Naylor

    Abstract: We present a novel method for extracting neural embeddings that model the background acoustics of a speech signal. The extracted embeddings are used to estimate specific parameters related to the background acoustic properties of the signal in a non-intrusive manner, which allows the embeddings to be explainable in terms of those parameters. We illustrate the value of these embeddings by performin… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2203.13919  [pdf

    eess.AS cs.AI

    Spatial Processing Front-End For Distant ASR Exploiting Self-Attention Channel Combinator

    Authors: Dushyant Sharma, Rong Gong, James Fosburgh, Stanislav Yu. Kruchinin, Patrick A. Naylor, Ljubomir Milanovic

    Abstract: We present a novel multi-channel front-end based on channel shortening with theWeighted Prediction Error (WPE) method followed by a fixed MVDR beamformer used in combination with a recently proposed self-attention-based channel combination (SACC) scheme, for tackling the distant ASR problem. We show that the proposed system used as part of a ContextNet based end-to-end (E2E) ASR system outperforms… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: to be presented at ICASSP 2022