Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Latouche, G L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12881  [pdf, other

    cs.CL cs.AI

    BinaryAlign: Word Alignment as Binary Sequence Labeling

    Authors: Gaetan Lopez Latouche, Marc-André Carbonneau, Ben Swanson

    Abstract: Real world deployments of word alignment are almost certain to cover both high and low resource languages. However, the state-of-the-art for this task recommends a different model class depending on the availability of gold alignment training data for a particular language pair. We propose BinaryAlign, a novel word alignment technique based on binary sequence labeling that outperforms existing app… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted to ACL 2024

  2. arXiv:2407.11854  [pdf, other

    cs.CL cs.AI

    Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in Grammatical Error Detection

    Authors: Gaetan Lopez Latouche, Marc-André Carbonneau, Ben Swanson

    Abstract: Grammatical Error Detection (GED) methods rely heavily on human annotated error corpora. However, these annotations are unavailable in many low-resource languages. In this paper, we investigate GED in this context. Leveraging the zero-shot cross-lingual transfer capabilities of multilingual pre-trained language models, we train a model using data from a diverse set of languages to generate synthet… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Submitted to EMNLP 2024