Skip to main content

Showing 1–6 of 6 results for author: Imankulova, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.00551  [pdf, other

    cs.CL

    Gender Bias in Masked Language Models for Multiple Languages

    Authors: Masahiro Kaneko, Aizhan Imankulova, Danushka Bollegala, Naoaki Okazaki

    Abstract: Masked Language Models (MLMs) pre-trained by predicting masked tokens on large corpora have been used successfully in natural language processing tasks for a variety of languages. Unfortunately, it was reported that MLMs also learn discriminative biases regarding attributes such as gender and race. Because most studies have focused on MLMs in English, the bias of MLMs in other languages has rarely… ▽ More

    Submitted 4 May, 2022; v1 submitted 1 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  2. arXiv:2106.06689  [pdf, other

    cs.CL

    Neural Combinatory Constituency Parsing

    Authors: Zhousi Chen, Longtu Zhang, Aizhan Imankulova, Mamoru Komachi

    Abstract: We propose two fast neural combinatory models for constituency parsing: binary and multi-branching. Our models decompose the bottom-up parsing process into 1) classification of tags, labels, and binary orientations or chunks and 2) vector composition based on the computed orientations or chunks. These models have theoretical sub-quadratic complexity and empirical linear complexity. The binary mode… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

    Comments: Findings of ACL 2021; 15 pages

  3. arXiv:2105.07316  [pdf, other

    cs.CL

    From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding

    Authors: Rob van der Goot, Ibrahim Sharaf, Aizhan Imankulova, Ahmet Üstün, Marija Stepanović, Alan Ramponi, Siti Oryza Khairunnisa, Mamoru Komachi, Barbara Plank

    Abstract: The lack of publicly available evaluation data for low-resource languages limits progress in Spoken Language Understanding (SLU). As key tasks like intent classification and slot filling require abundant training data, it is desirable to reuse existing data in high-resource languages to develop models for low-resource scenarios. We introduce xSID, a new benchmark for cross-lingual Slot and Intent… ▽ More

    Submitted 15 May, 2021; originally announced May 2021.

    Comments: To appear in the proceedings of NAACL 2021

  4. arXiv:2104.07410  [pdf, other

    cs.CL

    Simultaneous Multi-Pivot Neural Machine Translation

    Authors: Raj Dabre, Aizhan Imankulova, Masahiro Kaneko, Abhisek Chakrabarty

    Abstract: Parallel corpora are indispensable for training neural machine translation (NMT) models, and parallel corpora for most language pairs do not exist or are scarce. In such cases, pivot language NMT can be helpful where a pivot language is used such that there exist parallel corpora between the source and pivot and pivot and target languages. Naturally, the quality of pivot language translation is mo… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: preliminary work. pardon the messy writing and mistakes. will be submitted to emnlp after major overhaul

  5. arXiv:2004.03180  [pdf, other

    cs.CL

    Towards Multimodal Simultaneous Neural Machine Translation

    Authors: Aizhan Imankulova, Masahiro Kaneko, Tosho Hirasawa, Mamoru Komachi

    Abstract: Simultaneous translation involves translating a sentence before the speaker's utterance is completed in order to realize real-time understanding in multiple languages. This task is significantly more challenging than the general full sentence translation because of the shortage of input information during decoding. To alleviate this shortage, we propose multimodal simultaneous neural machine trans… ▽ More

    Submitted 23 October, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: 10 pages; WMT 2020

  6. arXiv:1907.03060  [pdf, ps, other

    cs.CL

    Exploiting Out-of-Domain Parallel Data through Multilingual Transfer Learning for Low-Resource Neural Machine Translation

    Authors: Aizhan Imankulova, Raj Dabre, Atsushi Fujita, Kenji Imamura

    Abstract: This paper proposes a novel multilingual multistage fine-tuning approach for low-resource neural machine translation (NMT), taking a challenging Japanese--Russian pair for benchmarking. Although there are many solutions for low-resource scenarios, such as multilingual NMT and back-translation, we have empirically confirmed their limited success when restricted to in-domain data. We therefore propo… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

    Comments: Accepted at the 17th Machine Translation Summit