Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Ondracek, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.18957  [pdf, other

    cs.CY cs.CL cs.LG cs.SI

    Moderating Illicit Online Image Promotion for Unsafe User-Generated Content Games Using Large Vision-Language Models

    Authors: Keyan Guo, Ayush Utkarsh, Wenbo Ding, Isabelle Ondracek, Ziming Zhao, Guo Freeman, Nishant Vishwamitra, Hongxin Hu

    Abstract: Online user generated content games (UGCGs) are increasingly popular among children and adolescents for social interaction and more creative online entertainment. However, they pose a heightened risk of exposure to explicit content, raising growing concerns for the online safety of children and adolescents. Despite these concerns, few studies have addressed the issue of illicit image-based promoti… ▽ More

    Submitted 12 August, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: To Appear in the 33rd USENIX Security Symposium, August 14-16, 2024

  2. arXiv:2312.15099  [pdf, other

    cs.CL cs.CY cs.LG cs.SI

    Moderating New Waves of Online Hate with Chain-of-Thought Reasoning in Large Language Models

    Authors: Nishant Vishwamitra, Keyan Guo, Farhan Tajwar Romit, Isabelle Ondracek, Long Cheng, Ziming Zhao, Hongxin Hu

    Abstract: Online hate is an escalating problem that negatively impacts the lives of Internet users, and is also subject to rapid changes due to evolving events, resulting in new waves of online hate that pose a critical threat. Detecting and mitigating these new waves present two key challenges: it demands reasoning-based complex decision-making to determine the presence of hateful content, and the limited… ▽ More

    Submitted 10 May, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: To Appear in the 45th IEEE Symposium on Security and Privacy, May 20-23, 2024