Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Mammen, P M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.12241  [pdf, other

    cs.CL cs.AI

    Introducing v0.5 of the AI Safety Benchmark from MLCommons

    Authors: Bertie Vidgen, Adarsh Agrawal, Ahmed M. Ahmed, Victor Akinwande, Namir Al-Nuaimi, Najla Alfaraj, Elie Alhajjar, Lora Aroyo, Trupti Bavalatti, Max Bartolo, Borhane Blili-Hamelin, Kurt Bollacker, Rishi Bomassani, Marisa Ferrara Boston, Siméon Campos, Kal Chakra, Canyu Chen, Cody Coleman, Zacharie Delpierre Coudert, Leon Derczynski, Debojyoti Dutta, Ian Eisenberg, James Ezick, Heather Frase, Brian Fuller , et al. (75 additional authors not shown)

    Abstract: This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess the safety risks of AI systems that use chat-tuned language models. We introduce a principled approach to specifying and constructing the benchmark, which for v0.5 covers only a single use case (an adult chatting to a general-pu… ▽ More

    Submitted 13 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  2. arXiv:2309.05227  [pdf, other

    cs.CL cs.AI

    Detecting Natural Language Biases with Prompt-based Learning

    Authors: Md Abdul Aowal, Maliha T Islam, Priyanka Mary Mammen, Sandesh Shetty

    Abstract: In this project, we want to explore the newly emerging field of prompt engineering and apply it to the downstream task of detecting LM biases. More concretely, we explore how to design prompts that can indicate 4 different types of biases: (1) gender, (2) race, (3) sexual orientation, and (4) religion-based. Within our project, we experiment with different manually crafted prompts that can draw ou… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  3. arXiv:2102.03690  [pdf, other

    eess.SP cs.LG

    WiSleep: Inferring Sleep Duration at Scale Using Passive WiFi Sensing

    Authors: Priyanka Mary Mammen, Camellia Zakaria, Tergel Molom-Ochir, Amee Trivedi, Prashant Shenoy, Rajesh Balan

    Abstract: Sleep deprivation is a public health concern that significantly impacts one's well-being and performance. Sleep is an intimate experience, and state-of-the-art sleep monitoring solutions are highly-personalized to individual users. With a motivation to expand sleep monitoring capabilities at a large scale and contribute sleep data to public health understanding, we present Wisleep, a system for in… ▽ More

    Submitted 14 March, 2022; v1 submitted 6 February, 2021; originally announced February 2021.

    Comments: 14 pages, 17 figures

  4. arXiv:2101.05428  [pdf, other

    cs.LG cs.DC

    Federated Learning: Opportunities and Challenges

    Authors: Priyanka Mary Mammen

    Abstract: Federated Learning (FL) is a concept first introduced by Google in 2016, in which multiple devices collaboratively learn a machine learning model without sharing their private data under the supervision of a central server. This offers ample opportunities in critical domains such as healthcare, finance etc, where it is risky to share private user information to other organisations or devices. Whil… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  5. arXiv:1910.08719  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    Explainable AI: Deep Reinforcement Learning Agents for Residential Demand Side Cost Savings in Smart Grids

    Authors: Hareesh Kumar, Priyanka Mary Mammen, Krithi Ramamritham

    Abstract: Motivated by recent advancements in Deep Reinforcement Learning (RL), we have developed an RL agent to manage the operation of storage devices in a household and is designed to maximize demand-side cost savings. The proposed technique is data-driven, and the RL agent learns from scratch how to efficiently use the energy storage device given variable tariff structures. In most of the studies, the R… ▽ More

    Submitted 30 October, 2019; v1 submitted 19 October, 2019; originally announced October 2019.