Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Thulke, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.13337  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Medical Spoken Named Entity Recognition

    Authors: Khai Le-Duc, David Thulke, Hung-Phong Tran, Long Vo-Dang, Khai-Nguyen Nguyen, Truong-Son Hy, Ralf Schlüter

    Abstract: Spoken Named Entity Recognition (NER) aims to extracting named entities from speech and categorizing them into types like person, location, organization, etc. In this work, we present VietMed-NER - the first spoken NER dataset in the medical domain. To our best knowledge, our real-world dataset is the largest spoken NER dataset in the world in terms of the number of entity types, featuring 18 dist… ▽ More

    Submitted 20 July, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Preprint, 41 pages

  2. arXiv:2210.13700  [pdf, other

    eess.AS cs.CL cs.LG

    Does Joint Training Really Help Cascaded Speech Translation?

    Authors: Viet Anh Khoa Tran, David Thulke, Yingbo Gao, Christian Herold, Hermann Ney

    Abstract: Currently, in speech translation, the straightforward approach - cascading a recognition system with a translation system - delivers state-of-the-art results. However, fundamental challenges such as error propagation from the automatic speech recognition system still remain. To mitigate these problems, recently, people turn their attention to direct data and propose various joint training methods.… ▽ More

    Submitted 24 November, 2022; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022

  3. arXiv:2104.10507  [pdf, ps, other

    cs.CL cs.SD eess.AS stat.ML

    On Sampling-Based Training Criteria for Neural Language Modeling

    Authors: Yingbo Gao, David Thulke, Alexander Gerstenberger, Khoa Viet Tran, Ralf Schlüter, Hermann Ney

    Abstract: As the vocabulary size of modern word-based language models becomes ever larger, many sampling-based training criteria are proposed and investigated. The essence of these sampling methods is that the softmax-related traversal over the entire vocabulary can be simplified, giving speedups compared to the baseline. A problem we notice about the current landscape of such sampling methods is the lack o… ▽ More

    Submitted 17 June, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: Accepted at INTERSPEECH 2021