Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Sukhoo, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.02421  [pdf, other

    cs.CL

    MorisienMT: A Dataset for Mauritian Creole Machine Translation

    Authors: Raj Dabre, Aneerav Sukhoo

    Abstract: In this paper, we describe MorisienMT, a dataset for benchmarking machine translation quality of Mauritian Creole. Mauritian Creole (Morisien) is the lingua franca of the Republic of Mauritius and is a French-based creole language. MorisienMT consists of a parallel corpus between English and Morisien, French and Morisien and a monolingual corpus for Morisien. We first give an overview of Morisien… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: Work in progress! (obviously) Dataset is here: https://huggingface.co/datasets/prajdabre/MorisienMT