Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Deoghare, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.11312  [pdf, other

    cs.CL

    APE-then-QE: Correcting then Filtering Pseudo Parallel Corpora for MT Training Data Creation

    Authors: Akshay Batheja, Sourabh Deoghare, Diptesh Kanojia, Pushpak Bhattacharyya

    Abstract: Automatic Post-Editing (APE) is the task of automatically identifying and correcting errors in the Machine Translation (MT) outputs. We propose a repair-filter-use methodology that uses an APE system to correct errors on the target side of the MT training data. We select the sentence pairs from the original and corrected sentence pairs based on the quality scores computed using a Quality Estimatio… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: text overlap with arXiv:2306.03507

  2. arXiv:2305.12518  [pdf, other

    cs.CL

    VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages

    Authors: Shivam Mhaskar, Vineet Bhat, Akshay Batheja, Sourabh Deoghare, Paramveer Choudhary, Pushpak Bhattacharyya

    Abstract: In this work, we present our deployment-ready Speech-to-Speech Machine Translation (SSMT) system for English-Hindi, English-Marathi, and Hindi-Marathi language pairs. We develop the SSMT system by cascading Automatic Speech Recognition (ASR), Disfluency Correction (DC), Machine Translation (MT), and Text-to-Speech Synthesis (TTS) models. We discuss the challenges faced during the research and deve… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.