Zum Hauptinhalt springen

Showing 1–33 of 33 results for author: Post, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11853  [pdf, other

    cs.CL

    PyMarian: Fast Neural Machine Translation and Evaluation in Python

    Authors: Thamme Gowda, Roman Grundkiewicz, Elijah Rippeth, Matt Post, Marcin Junczys-Dowmunt

    Abstract: The deep learning language of choice these days is Python; measured by factors such as available libraries and technical support, it is hard to beat. At the same time, software written in lower-level programming languages like C++ retain advantages in speed. We describe a Python interface to Marian NMT, a C++-based training and inference toolkit for sequence-to-sequence models, focusing on machine… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  2. arXiv:2406.03869  [pdf, other

    cs.CL

    Recovering document annotations for sentence-level bitext

    Authors: Rachel Wicks, Matt Post, Philipp Koehn

    Abstract: Data availability limits the scope of any given task. In machine translation, historical models were incapable of handling longer contexts, so the lack of document-level datasets was less noticeable. Now, despite the emergence of long-sequence methods, we remain within a sentence-level paradigm and without data to adequately approach context-aware machine translation. Most large-scale datasets hav… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Findings

  3. arXiv:2401.06760  [pdf, other

    cs.CL

    Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies

    Authors: Tom Kocmi, Vilém Zouhar, Christian Federmann, Matt Post

    Abstract: Ten years ago a single metric, BLEU, governed progress in machine translation research. For better or worse, there is no such consensus today, and consequently it is difficult for researchers to develop and retain the kinds of heuristic intuitions about metric deltas that drove earlier research and deployment decisions. This paper investigates the "dynamic range" of a number of modern metrics in a… ▽ More

    Submitted 10 June, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  4. arXiv:2311.15507  [pdf, other

    cs.CL cs.AI

    Improving Word Sense Disambiguation in Neural Machine Translation with Salient Document Context

    Authors: Elijah Rippeth, Marine Carpuat, Kevin Duh, Matt Post

    Abstract: Lexical ambiguity is a challenging and pervasive problem in machine translation (\mt). We introduce a simple and scalable approach to resolve translation ambiguity by incorporating a small amount of extra-sentential context in neural \mt. Our approach requires no sense annotation and no change to standard model architectures. Since actual document context is not available for the vast majority of… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  5. arXiv:2311.02321  [pdf, other

    cs.CL

    Identifying Context-Dependent Translations for Evaluation Set Production

    Authors: Rachel Wicks, Matt Post

    Abstract: A major impediment to the transition to context-aware machine translation is the absence of good evaluation metrics and test sets. Sentences that require context to be translated correctly are rare in test sets, reducing the utility of standard corpus-level metrics such as COMET or BLEU. On the other hand, datasets that annotate such sentences are also rare, small in scale, and available for only… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: WMT 2023 Proceedings

  6. arXiv:2309.08832  [pdf, other

    cs.CL cs.AI

    SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window

    Authors: Vikas Raunak, Tom Kocmi, Matt Post

    Abstract: Reference-based metrics that operate at the sentence-level typically outperform quality estimation metrics, which have access only to the source and system output. This is unsurprising, since references resolve ambiguities that may be present in the source. In this paper, we investigate whether additional source context can effectively substitute for a reference. We present a metric named SLIDE (S… ▽ More

    Submitted 2 April, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: NAACL 2024

  7. arXiv:2308.07489  [pdf, other

    cs.CL

    SOTASTREAM: A Streaming Approach to Machine Translation Training

    Authors: Matt Post, Thamme Gowda, Roman Grundkiewicz, Huda Khayrallah, Rohit Jain, Marcin Junczys-Dowmunt

    Abstract: Many machine translation toolkits make use of a data preparation step wherein raw data is transformed into a tensor format that can be used directly by the trainer. This preparation step is increasingly at odds with modern research and development practices because this process produces a static, unchangeable version of the training data, making common training-time needs difficult (e.g., subword… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  8. arXiv:2305.16806  [pdf, other

    cs.CL cs.AI

    Do GPTs Produce Less Literal Translations?

    Authors: Vikas Raunak, Arul Menezes, Matt Post, Hany Hassan Awadalla

    Abstract: Large Language Models (LLMs) such as GPT-3 have emerged as general-purpose language models capable of addressing many natural language generation or understanding tasks. On the task of Machine Translation (MT), multiple works have investigated few-shot prompting mechanisms to elicit better translations from LLMs. However, there has been relatively little investigation on how such translations diff… ▽ More

    Submitted 5 June, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  9. arXiv:2305.14280  [pdf, other

    cs.CL

    Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer

    Authors: Elizabeth Salesky, Neha Verma, Philipp Koehn, Matt Post

    Abstract: We introduce and demonstrate how to effectively train multilingual machine translation models with pixel representations. We experiment with two different data settings with a variety of language and script coverage, demonstrating improved performance compared to subword embeddings. We explore various properties of pixel representations such as parameter sharing within and across scripts to better… ▽ More

    Submitted 24 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  10. arXiv:2304.12959  [pdf, other

    cs.CL

    Escaping the sentence-level paradigm in machine translation

    Authors: Matt Post, Marcin Junczys-Dowmunt

    Abstract: It is well-known that document context is vital for resolving a range of translation ambiguities, and in fact the document setting is the most natural setting for nearly all translation. It is therefore unfortunate that machine translation -- both research and production -- largely remains stuck in a decades-old sentence-level translation paradigm. It is also an increasingly glaring problem in lig… ▽ More

    Submitted 16 May, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

  11. arXiv:2212.00006  [pdf, other

    cs.HC cs.CL cs.CV cs.CY

    Operationalizing Specifications, In Addition to Test Sets for Evaluating Constrained Generative Models

    Authors: Vikas Raunak, Matt Post, Arul Menezes

    Abstract: In this work, we present some recommendations on the evaluation of state-of-the-art generative models for constrained generation tasks. The progress on generative models has been rapid in recent years. These large-scale models have had three impacts: firstly, the fluency of generation in both language and vision modalities has rendered common average-case evaluation metrics much less useful in dia… ▽ More

    Submitted 19 November, 2022; originally announced December 2022.

    Comments: NeurIPS 2022 Workshop on Human Evaluation of Generative Models

  12. arXiv:2210.12727  [pdf, other

    cs.CL

    Additive Interventions Yield Robust Multi-Domain Machine Translation Models

    Authors: Elijah Rippeth, Matt Post

    Abstract: Additive interventions are a recently-proposed mechanism for controlling target-side attributes in neural machine translation. In contrast to tag-based approaches which manipulate the raw source sequence, interventions work by directly modulating the encoder representation of all tokens in the sequence. We examine the role of additive interventions in a large-scale multi-domain machine translation… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: 7 pages, 7 figures, WMT22 (Research Track)

  13. arXiv:2205.09988  [pdf, other

    cs.CL cs.AI

    SALTED: A Framework for SAlient Long-Tail Translation Error Detection

    Authors: Vikas Raunak, Matt Post, Arul Menezes

    Abstract: Traditional machine translation (MT) metrics provide an average measure of translation quality that is insensitive to the long tail of behavioral problems in MT. Examples include translation of numbers, physical units, dropped content and hallucinations. These errors, which occur rarely and unpredictably in Neural Machine Translation (NMT), greatly undermine the reliability of state-of-the-art MT… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  14. arXiv:2204.05352  [pdf, other

    cs.CL eess.AS

    Large-Scale Streaming End-to-End Speech Translation with Neural Transducers

    Authors: Jian Xue, Peidong Wang, Jinyu Li, Matt Post, Yashesh Gaur

    Abstract: Neural transducers have been widely used in automatic speech recognition (ASR). In this paper, we introduce it to streaming end-to-end speech translation (ST), which aims to convert audio signals to texts in other languages directly. Compared with cascaded ST that performs ASR followed by text-based machine translation (MT), the proposed Transformer transducer (TT)-based ST model drastically reduc… ▽ More

    Submitted 1 July, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: The paper was submitted to Interspeech 2022

  15. arXiv:2109.08724  [pdf, other

    cs.CL

    The JHU-Microsoft Submission for WMT21 Quality Estimation Shared Task

    Authors: Shuoyang Ding, Marcin Junczys-Dowmunt, Matt Post, Christian Federmann, Philipp Koehn

    Abstract: This paper presents the JHU-Microsoft joint submission for WMT 2021 quality estimation shared task. We only participate in Task 2 (post-editing effort estimation) of the shared task, focusing on the target-side word-level quality estimation. The techniques we experimented with include Levenshtein Transformer training and data augmentation with a combination of forward, backward, round-trip transla… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: 7 Pages, Accepted to WMT21 (System Description)

  16. arXiv:2109.05611  [pdf, other

    cs.CL

    Levenshtein Training for Word-level Quality Estimation

    Authors: Shuoyang Ding, Marcin Junczys-Dowmunt, Matt Post, Philipp Koehn

    Abstract: We propose a novel scheme to use the Levenshtein Transformer to perform the task of word-level quality estimation. A Levenshtein Transformer is a natural fit for this task: trained to perform decoding in an iterative manner, a Levenshtein Transformer can learn to post-edit without explicit supervision. To further minimize the mismatch between the translation task and the word-level QE task, we pro… ▽ More

    Submitted 15 September, 2021; v1 submitted 12 September, 2021; originally announced September 2021.

    Comments: 10 pages, 1 figure, Accepted to EMNLP 2021. Fixed a minor typo in Table 2 (en-zh WMT20 best result)

  17. arXiv:2104.08211  [pdf

    cs.CL

    Robust Open-Vocabulary Translation from Visual Text Representations

    Authors: Elizabeth Salesky, David Etter, Matt Post

    Abstract: Machine translation models have discrete vocabularies and commonly use subword segmentation techniques to achieve an 'open vocabulary.' This approach relies on consistent and correct underlying unicode sequences, and makes models susceptible to degradation from common types of noise and variation. Motivated by the robustness of human language processing, we propose the use of visual text represent… ▽ More

    Submitted 9 December, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021 + additional appendix

  18. arXiv:2102.01757  [pdf, other

    cs.CL

    The Multilingual TEDx Corpus for Speech Recognition and Translation

    Authors: Elizabeth Salesky, Matthew Wiesner, Jacob Bremerman, Roldano Cattoni, Matteo Negri, Marco Turchi, Douglas W. Oard, Matt Post

    Abstract: We present the Multilingual TEDx corpus, built to support speech recognition (ASR) and speech translation (ST) research across many non-English source languages. The corpus is a collection of audio recordings from TEDx talks in 8 source languages. We segment transcripts into sentences and align them to the source-language audio and target-language translations. The corpus is released along with op… ▽ More

    Submitted 14 June, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: Accepted to Interspeech 2021

  19. arXiv:2008.04935  [pdf, other

    cs.CL

    Paraphrase Generation as Zero-Shot Multilingual Translation: Disentangling Semantic Similarity from Lexical and Syntactic Diversity

    Authors: Brian Thompson, Matt Post

    Abstract: Recent work has shown that a multilingual neural machine translation (NMT) model can be used to judge how well a sentence paraphrases another sentence in the same language (Thompson and Post, 2020); however, attempting to generate paraphrases from such a model using standard beam search produces trivial copies or near copies. We introduce a simple paraphrase generation algorithm which discourages… ▽ More

    Submitted 27 October, 2020; v1 submitted 11 August, 2020; originally announced August 2020.

    Comments: WMT2020

  20. arXiv:2004.14989  [pdf, ps, other

    cs.CL

    A Study in Improving BLEU Reference Coverage with Diverse Automatic Paraphrasing

    Authors: Rachel Bawden, Biao Zhang, Lisa Yankovskaya, Andre Tättar, Matt Post

    Abstract: We investigate a long-perceived shortcoming in the typical use of BLEU: its reliance on a single reference. Using modern neural paraphrasing techniques, we study whether automatically generating additional diverse references can provide better coverage of the space of valid translations and thereby improve its correlation with human judgments. Our experiments on the into-English language direction… ▽ More

    Submitted 8 October, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: Accepted in the Findings of EMNLP 2020

  21. arXiv:2004.14564  [pdf, other

    cs.CL

    Automatic Machine Translation Evaluation in Many Languages via Zero-Shot Paraphrasing

    Authors: Brian Thompson, Matt Post

    Abstract: We frame the task of machine translation evaluation as one of scoring machine translation output with a sequence-to-sequence paraphraser, conditioned on a human reference. We propose training the paraphraser as a multilingual NMT system, treating paraphrasing as a zero-shot translation task (e.g., Czech to Czech). This results in the paraphraser's output mode being centered around a copy of the in… ▽ More

    Submitted 27 October, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: EMNLP2020

  22. Simulated Multiple Reference Training Improves Low-Resource Machine Translation

    Authors: Huda Khayrallah, Brian Thompson, Matt Post, Philipp Koehn

    Abstract: Many valid translations exist for a given sentence, yet machine translation (MT) is trained with a single reference translation, exacerbating data sparsity in low-resource settings. We introduce Simulated Multiple Reference Training (SMRT), a novel MT training method that approximates the full space of possible translations by sampling a paraphrase of the reference sentence from a paraphraser and… ▽ More

    Submitted 13 October, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: EMNLP 2020 camera ready

  23. arXiv:1911.11690  [pdf, other

    cs.SE

    Generating Commit Messages from Git Diffs

    Authors: S. R. P. van Hal, M. Post, K. Wendel

    Abstract: Commit messages aid developers in their understanding of a continuously evolving codebase. However, developers not always document code changes properly. Automatically generating commit messages would relieve this burden on developers. Recently, a number of different works have demonstrated the feasibility of using methods from neural machine translation to generate commit messages. This work aims… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

  24. arXiv:1909.00444  [pdf, other

    cs.CL

    A Discriminative Neural Model for Cross-Lingual Word Alignment

    Authors: Elias Stengel-Eskin, Tzu-Ray Su, Matt Post, Benjamin Van Durme

    Abstract: We introduce a novel discriminative word alignment model, which we integrate into a Transformer-based machine translation model. In experiments based on a small number of labeled examples (~1.7K-5K sentences) we evaluate its performance intrinsically on both English-Chinese and English-Arabic alignment, where we achieve major improvements over unsupervised baselines (11-27 F1). We evaluate the mod… ▽ More

    Submitted 1 September, 2019; originally announced September 2019.

    Comments: EMNLP 2019

  25. arXiv:1904.05506  [pdf, other

    cs.LG cs.CL stat.ML

    Membership Inference Attacks on Sequence-to-Sequence Models: Is My Data In Your Machine Translation System?

    Authors: Sorami Hisamoto, Matt Post, Kevin Duh

    Abstract: Data privacy is an important issue for "machine learning as a service" providers. We focus on the problem of membership inference attacks: given a data sample and black-box access to a model's API, determine whether the sample existed in the model's training data. Our contribution is an investigation of this problem in the context of sequence-to-sequence models, which are important in applications… ▽ More

    Submitted 16 March, 2020; v1 submitted 10 April, 2019; originally announced April 2019.

    Journal ref: Tansactions of the Association for Computational Linguistics (TACL) Volume 8, 2020 p.49-63

  26. arXiv:1901.03644  [pdf, other

    cs.CL

    ParaBank: Monolingual Bitext Generation and Sentential Paraphrasing via Lexically-constrained Neural Machine Translation

    Authors: J. Edward Hu, Rachel Rudinger, Matt Post, Benjamin Van Durme

    Abstract: We present ParaBank, a large-scale English paraphrase dataset that surpasses prior work in both quantity and quality. Following the approach of ParaNMT, we train a Czech-English neural machine translation (NMT) system to generate novel paraphrases of English reference sentences. By adding lexical constraints to the NMT decoding procedure, however, we are able to produce multiple high-quality sente… ▽ More

    Submitted 11 January, 2019; originally announced January 2019.

    Comments: To be presented at AAAI 2019. 8 pages

  27. arXiv:1804.08771  [pdf, other

    cs.CL

    A Call for Clarity in Reporting BLEU Scores

    Authors: Matt Post

    Abstract: The field of machine translation faces an under-recognized problem because of inconsistency in the reporting of scores from its dominant metric. Although people refer to "the" BLEU score, BLEU is in fact a parameterized metric whose values can vary wildly with changes to these parameters. These parameters are often not reported or are hard to find, and consequently, BLEU scores between papers cann… ▽ More

    Submitted 12 September, 2018; v1 submitted 23 April, 2018; originally announced April 2018.

    Comments: 6 pages, 1 figure

    Journal ref: Proceedings of the Third Conference on Machine Translation (WMT18). 2018

  28. arXiv:1804.06609  [pdf, other

    cs.CL

    Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation

    Authors: Matt Post, David Vilar

    Abstract: The end-to-end nature of neural machine translation (NMT) removes many ways of manually guiding the translation process that were available in older paradigms. Recent work, however, has introduced a new capability: lexically constrained or guided decoding, a modification to beam search that forces the inclusion of pre-specified words and phrases in the output. However, while theoretically sound, e… ▽ More

    Submitted 9 November, 2018; v1 submitted 18 April, 2018; originally announced April 2018.

    Comments: 11 pages, 9 figures, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)

  29. arXiv:1712.05690  [pdf, other

    cs.CL cs.LG stat.ML

    Sockeye: A Toolkit for Neural Machine Translation

    Authors: Felix Hieber, Tobias Domhan, Michael Denkowski, David Vilar, Artem Sokolov, Ann Clifton, Matt Post

    Abstract: We describe Sockeye (version 1.12), an open-source sequence-to-sequence toolkit for Neural Machine Translation (NMT). Sockeye is a production-ready framework for training and applying models as well as an experimental platform for researchers. Written in Python and built on MXNet, the toolkit offers scalable training and inference for the three most prominent encoder-decoder architectures: attenti… ▽ More

    Submitted 1 June, 2018; v1 submitted 15 December, 2017; originally announced December 2017.

  30. arXiv:1707.00299  [pdf, ps, other

    cs.CL

    Grammatical Error Correction with Neural Reinforcement Learning

    Authors: Keisuke Sakaguchi, Matt Post, Benjamin Van Durme

    Abstract: We propose a neural encoder-decoder model with reinforcement learning (NRL) for grammatical error correction (GEC). Unlike conventional maximum likelihood estimation (MLE), the model directly optimizes towards an objective that considers a sentence-level, task-specific evaluation metric, avoiding the exposure bias issue in MLE. We demonstrate that NRL outperforms MLE both in human and automated ev… ▽ More

    Submitted 2 July, 2017; originally announced July 2017.

  31. arXiv:1706.00321  [pdf, other

    cs.CL

    Using of heterogeneous corpora for training of an ASR system

    Authors: Jan Trmal, Gaurav Kumar, Vimal Manohar, Sanjeev Khudanpur, Matt Post, Paul McNamee

    Abstract: The paper summarizes the development of the LVCSR system built as a part of the Pashto speech-translation system at the SCALE (Summer Camp for Applied Language Exploration) 2015 workshop on "Speech-to-text-translation for low-resource languages". The Pashto language was chosen as a good "proxy" low-resource language, exhibiting multiple phenomena which make the speech-recognition and and speech-to… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

  32. arXiv:1608.02214  [pdf, other

    cs.CL

    Robsut Wrod Reocginiton via semi-Character Recurrent Neural Network

    Authors: Keisuke Sakaguchi, Kevin Duh, Matt Post, Benjamin Van Durme

    Abstract: Language processing mechanism by humans is generally more robust than computers. The Cmabrigde Uinervtisy (Cambridge University) effect from the psycholinguistics literature has demonstrated such a robust word processing mechanism, where jumbled words (e.g. Cmabrigde / Cambridge) are recognized with little cost. On the other hand, computational models for word recognition (e.g. spelling checkers)… ▽ More

    Submitted 7 February, 2017; v1 submitted 7 August, 2016; originally announced August 2016.

  33. arXiv:1605.02592  [pdf, other

    cs.CL

    GLEU Without Tuning

    Authors: Courtney Napoles, Keisuke Sakaguchi, Matt Post, Joel Tetreault

    Abstract: The GLEU metric was proposed for evaluating grammatical error corrections using n-gram overlap with a set of reference sentences, as opposed to precision/recall of specific annotated errors (Napoles et al., 2015). This paper describes improvements made to the GLEU metric that address problems that arise when using an increasing number of reference sets. Unlike the originally presented metric, the… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.