Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Malaysha, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.20663  [pdf, other

    cs.CL

    ArabicNLU 2024: The First Arabic Natural Language Understanding Shared Task

    Authors: Mohammed Khalilia, Sanad Malaysha, Reem Suwaileh, Mustafa Jarrar, Alaa Aljabari, Tamer Elsayed, Imed Zitouni

    Abstract: This paper presents an overview of the Arabic Natural Language Understanding (ArabicNLU 2024) shared task, focusing on two subtasks: Word Sense Disambiguation (WSD) and Location Mention Disambiguation (LMD). The task aimed to evaluate the ability of automated systems to resolve word ambiguity and identify locations mentioned in Arabic text. We provided participants with novel datasets, including a… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: In Proceedings of the Second Arabic Natural Language Processing Conference (ArabicNLP 2024), Bangkok, Thailand. Association for Computational Linguistics

  2. arXiv:2407.09818  [pdf, other

    cs.CL

    AraFinNLP 2024: The First Arabic Financial NLP Shared Task

    Authors: Sanad Malaysha, Mo El-Haj, Saad Ezzini, Mohammed Khalilia, Mustafa Jarrar, Sultan Almujaiwel, Ismail Berrada, Houda Bouamor

    Abstract: The expanding financial markets of the Arab world require sophisticated Arabic NLP tools. To address this need within the banking domain, the Arabic Financial NLP (AraFinNLP) shared task proposes two subtasks: (i) Multi-dialect Intent Detection and (ii) Cross-dialect Translation and Intent Preservation. This shared task uses the updated ArBanking77 dataset, which includes about 39k parallel querie… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  3. arXiv:2405.00659  [pdf, other

    cs.CL

    NLU-STR at SemEval-2024 Task 1: Generative-based Augmentation and Encoder-based Scoring for Semantic Textual Relatedness

    Authors: Sanad Malaysha, Mustafa Jarrar, Mohammed Khalilia

    Abstract: Semantic textual relatedness is a broader concept of semantic similarity. It measures the extent to which two chunks of text convey similar meaning or topics, or share related concepts or contexts. This notion of relatedness can be applied in various applications, such as document clustering and summarizing. SemRel-2024, a shared task in SemEval-2024, aims at reducing the gap in the semantic relat… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  4. arXiv:2310.19029  [pdf, other

    cs.CL

    SALMA: Arabic Sense-Annotated Corpus and WSD Benchmarks

    Authors: Mustafa Jarrar, Sanad Malaysha, Tymaa Hammouda, Mohammed Khalilia

    Abstract: SALMA, the first Arabic sense-annotated corpus, consists of ~34K tokens, which are all sense-annotated. The corpus is annotated using two different sense inventories simultaneously (Modern and Ghani). SALMA novelty lies in how tokens and senses are associated. Instead of linking a token to only one intended sense, SALMA links a token to multiple senses and provides a score to each sense. A smart w… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

  5. arXiv:2302.03126  [pdf, other

    cs.CL

    Context-Gloss Augmentation for Improving Arabic Target Sense Verification

    Authors: Sanad Malaysha, Mustafa Jarrar, Mohammed Khalilia

    Abstract: Arabic language lacks semantic datasets and sense inventories. The most common semantically-labeled dataset for Arabic is the ArabGlossBERT, a relatively small dataset that consists of 167K context-gloss pairs (about 60K positive and 107K negative pairs), collected from Arabic dictionaries. This paper presents an enrichment to the ArabGlossBERT dataset, by augmenting it using (Arabic-English-Arabi… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Journal ref: The 12th International Global Wordnet Conference (GWC2023), Global Wordnet Association. (pp. ). San Sebastian, Spain, 2023