Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Awal, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.16772  [pdf, other

    cs.CV cs.CL cs.LG

    VisMin: Visual Minimal-Change Understanding

    Authors: Rabiul Awal, Saba Ahmadi, Le Zhang, Aishwarya Agrawal

    Abstract: Fine-grained understanding of objects, attributes, and relationships between objects is crucial for visual-language models (VLMs). Existing benchmarks primarily focus on evaluating VLMs' capability to distinguish between two very similar \textit{captions} given an image. In this paper, we introduce a new, challenging benchmark termed \textbf{Vis}ual \textbf{Min}imal-Change Understanding (VisMin),… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: Project URL at https://vismin.net/

  2. arXiv:2407.10920  [pdf, other

    cs.CV cs.AI cs.CL

    Benchmarking Vision Language Models for Cultural Understanding

    Authors: Shravan Nayak, Kanishk Jain, Rabiul Awal, Siva Reddy, Sjoerd van Steenkiste, Lisa Anne Hendricks, Karolina Stańczak, Aishwarya Agrawal

    Abstract: Foundation models and vision-language pre-training have notably advanced Vision Language Models (VLMs), enabling multimodal processing of visual and linguistic data. However, their performance has been typically assessed on general scene understanding - recognizing objects, attributes, and actions - rather than cultural comprehension. This study introduces CulturalVQA, a visual question-answering… ▽ More

    Submitted 18 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

  3. arXiv:2306.09996  [pdf, other

    cs.CV cs.CL

    Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering

    Authors: Rabiul Awal, Le Zhang, Aishwarya Agrawal

    Abstract: In this paper, we explore effective prompting techniques to enhance zero- and few-shot Visual Question Answering (VQA) performance in contemporary Vision-Language Models (VLMs). Central to our investigation is the role of question templates in guiding VLMs to generate accurate answers. We identify that specific templates significantly influence VQA outcomes, underscoring the need for strategic tem… ▽ More

    Submitted 9 January, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: Codes available at https://github.com/rabiulcste/vqazero

  4. arXiv:2306.08832  [pdf, other

    cs.CV

    Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding

    Authors: Le Zhang, Rabiul Awal, Aishwarya Agrawal

    Abstract: Vision-Language Models (VLMs), such as CLIP, exhibit strong image-text comprehension abilities, facilitating advances in several downstream tasks such as zero-shot image classification, image-text retrieval, and text-to-image generation. However, the compositional reasoning abilities of existing VLMs remains subpar. The root of this limitation lies in the inadequate alignment between the images an… ▽ More

    Submitted 25 April, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: CVPR 2024

  5. arXiv:2305.17680  [pdf, other

    cs.CL cs.AI

    Evaluating GPT-3 Generated Explanations for Hateful Content Moderation

    Authors: Han Wang, Ming Shan Hee, Md Rabiul Awal, Kenny Tsu Wei Choo, Roy Ka-Wei Lee

    Abstract: Recent research has focused on using large language models (LLMs) to generate explanations for hate speech through fine-tuning or prompting. Despite the growing interest in this area, these generated explanations' effectiveness and potential limitations remain poorly understood. A key concern is that these explanations, generated by LLMs, may lead to erroneous judgments about the nature of flagged… ▽ More

    Submitted 30 August, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: 9 pages, 2 figures, Accepted by International Joint Conference on Artificial Intelligence(IJCAI)

    ACM Class: I.2.7

  6. arXiv:2303.02513  [pdf, other

    cs.CL cs.SI

    Model-Agnostic Meta-Learning for Multilingual Hate Speech Detection

    Authors: Md Rabiul Awal, Roy Ka-Wei Lee, Eshaan Tanwar, Tanmay Garg, Tanmoy Chakraborty

    Abstract: Hate speech in social media is a growing phenomenon, and detecting such toxic content has recently gained significant traction in the research community. Existing studies have explored fine-tuning language models (LMs) to perform hate speech detection, and these solutions have yielded significant performance. However, most of these studies are limited to detecting hate speech only in English, negl… ▽ More

    Submitted 4 March, 2023; originally announced March 2023.

  7. arXiv:2103.11800  [pdf, other

    cs.CL cs.SI

    AngryBERT: Joint Learning Target and Emotion for Hate Speech Detection

    Authors: Md Rabiul Awal, Rui Cao, Roy Ka-Wei Lee, Sandra Mitrovic

    Abstract: Automated hate speech detection in social media is a challenging task that has recently gained significant traction in the data mining and Natural Language Processing community. However, most of the existing methods adopt a supervised approach that depended heavily on the annotated hate speech datasets, which are imbalanced and often lack training samples for hateful content. This paper addresses… ▽ More

    Submitted 14 March, 2021; originally announced March 2021.

    Comments: Paper Accepted for 25th Pacific-Asia Conference on Knowledge Discovery and Data Mining

  8. arXiv:2007.10712  [pdf, other

    cs.SI cs.CL

    On Analyzing Antisocial Behaviors Amid COVID-19 Pandemic

    Authors: Md Rabiul Awal, Rui Cao, Sandra Mitrovic, Roy Ka-Wei Lee

    Abstract: The COVID-19 pandemic has developed to be more than a bio-crisis as global news has reported a sharp rise in xenophobia and discrimination in both online and offline communities. Such toxic behaviors take a heavy toll on society, especially during these daunting times. Despite the gravity of the issue, very few studies have studied online antisocial behaviors amid the COVID-19 pandemic. In this pa… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

  9. arXiv:2006.13507  [pdf, other

    cs.SI cs.CL

    On Analyzing Annotation Consistency in Online Abusive Behavior Datasets

    Authors: Md Rabiul Awal, Rui Cao, Roy Ka-Wei Lee, Sandra Mitrović

    Abstract: Online abusive behavior is an important issue that breaks the cohesiveness of online social communities and even raises public safety concerns in our societies. Motivated by this rising issue, researchers have proposed, collected, and annotated online abusive content datasets. These datasets play a critical role in facilitating the research on online hate speech and abusive behaviors. However, the… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.