Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Bremerman, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.12514  [pdf, other

    cs.CL

    Machine Translation Robustness to Natural Asemantic Variation

    Authors: Jacob Bremerman, Xiang Ren, Jonathan May

    Abstract: Current Machine Translation (MT) models still struggle with more challenging input, such as noisy data and tail-end words and phrases. Several works have addressed this robustness issue by identifying specific categories of noise and variation then tuning models to perform better on them. An important yet under-studied category involves minor variations in nuance (non-typos) that preserve meaning… ▽ More

    Submitted 9 November, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted to EMNLP 2022

  2. arXiv:2102.01757  [pdf, other

    cs.CL

    The Multilingual TEDx Corpus for Speech Recognition and Translation

    Authors: Elizabeth Salesky, Matthew Wiesner, Jacob Bremerman, Roldano Cattoni, Matteo Negri, Marco Turchi, Douglas W. Oard, Matt Post

    Abstract: We present the Multilingual TEDx corpus, built to support speech recognition (ASR) and speech translation (ST) research across many non-English source languages. The corpus is a collection of audio recordings from TEDx talks in 8 source languages. We segment transcripts into sentences and align them to the source-language audio and target-language translations. The corpus is released along with op… ▽ More

    Submitted 14 June, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: Accepted to Interspeech 2021