Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Fitriany, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.04651  [pdf, other

    cs.CL

    ParaCotta: Synthetic Multilingual Paraphrase Corpora from the Most Diverse Translation Sample Pair

    Authors: Alham Fikri Aji, Tirana Noor Fatyanosa, Radityo Eko Prasojo, Philip Arthur, Suci Fitriany, Salma Qonitah, Nadhifa Zulfa, Tomi Santoso, Mahendra Data

    Abstract: We release our synthetic parallel paraphrase corpus across 17 languages: Arabic, Catalan, Czech, German, English, Spanish, Estonian, French, Hindi, Indonesian, Italian, Dutch, Romanian, Russian, Swedish, Vietnamese, and Chinese. Our method relies only on monolingual data and a neural machine translation system to generate paraphrases, hence simple to apply. We generate multiple translation samples… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: 10 pages, 3 figures, 6 tables. Accepted at PACLIC 2021. (ACL Anthology link: https://aclanthology.org/2021.paclic-1.56/)

    MSC Class: 68T50 ACM Class: I.2.7; I.2.6

  2. arXiv:2011.03286  [pdf, other

    cs.CL

    Semi-Supervised Low-Resource Style Transfer of Indonesian Informal to Formal Language with Iterative Forward-Translation

    Authors: Haryo Akbarianto Wibowo, Tatag Aziz Prawiro, Muhammad Ihsan, Alham Fikri Aji, Radityo Eko Prasojo, Rahmad Mahendra, Suci Fitriany

    Abstract: In its daily use, the Indonesian language is riddled with informality, that is, deviations from the standard in terms of vocabulary, spelling, and word order. On the other hand, current available Indonesian NLP models are typically developed with the standard Indonesian in mind. In this work, we address a style-transfer from informal to formal Indonesian as a low-resource machine translation probl… ▽ More

    Submitted 22 December, 2020; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: 6 pages, Camera ready to be presented at IALP 2020

    MSC Class: 68T50