Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Harshavardhan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.12860  [pdf, other

    cs.CL cs.CY

    Probing LLMs for hate speech detection: strengths and vulnerabilities

    Authors: Sarthak Roy, Ashish Harshavardhan, Animesh Mukherjee, Punyajoy Saha

    Abstract: Recently efforts have been made by social media platforms as well as researchers to detect hateful or toxic language using large language models. However, none of these works aim to use explanation, additional context and victim community information in the detection process. We utilise different prompt variation, input information and evaluate large language models in zero shot setting (without a… ▽ More

    Submitted 28 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: 13 pages, 9 figures, 7 tables, accepted to findings of EMNLP 2023