Zum Hauptinhalt springen

Showing 1–13 of 13 results for author: Lahoti, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17104  [pdf, other

    cs.CL

    Automated Adversarial Discovery for Safety Classifiers

    Authors: Yash Kumar Lal, Preethi Lahoti, Aradhana Sinha, Yao Qin, Ananth Balashankar

    Abstract: Safety classifiers are critical in mitigating toxicity on online forums such as social media and in chatbots. Still, they continue to be vulnerable to emergent, and often innumerable, adversarial attacks. Traditional automated adversarial data generation methods, however, tend to produce attacks that are not diverse, but variations of previously observed harm types. We formalize the task of automa… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Published at Fourth Workshop on TrustworthyNLP (TrustNLP) at NAACL 2024

  2. arXiv:2406.16738  [pdf, other

    cs.LG cs.AI cs.CY

    Inducing Group Fairness in LLM-Based Decisions

    Authors: James Atwood, Preethi Lahoti, Ananth Balashankar, Flavien Prost, Ahmad Beirami

    Abstract: Prompting Large Language Models (LLMs) has created new and interesting means for classifying textual data. While evaluating and remediating group fairness is a well-studied problem in classifier fairness literature, some classical approaches (e.g., regularization) do not carry over, and some new opportunities arise (e.g., prompt-based remediation). We measure fairness of LLM-based classifiers on a… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  4. arXiv:2312.02592  [pdf, other

    cs.LG cs.CY

    FRAPPE: A Group Fairness Framework for Post-Processing Everything

    Authors: Alexandru Tifrea, Preethi Lahoti, Ben Packer, Yoni Halpern, Ahmad Beirami, Flavien Prost

    Abstract: Despite achieving promising fairness-error trade-offs, in-processing mitigation techniques for group fairness cannot be employed in numerous practical applications with limited computation resources or no access to the training pipeline of the prediction model. In these situations, post-processing is a viable alternative. However, current methods are tailored to specific problem settings and fairn… ▽ More

    Submitted 20 June, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Conference paper at ICML 2024

  5. arXiv:2311.08592  [pdf, other

    cs.SE cs.AI cs.CL

    AART: AI-Assisted Red-Teaming with Diverse Data Generation for New LLM-powered Applications

    Authors: Bhaktipriya Radharapu, Kevin Robinson, Lora Aroyo, Preethi Lahoti

    Abstract: Adversarial testing of large language models (LLMs) is crucial for their safe and responsible deployment. We introduce a novel approach for automated generation of adversarial evaluation datasets to test the safety of LLM generations on new downstream applications. We call it AI-assisted Red-Teaming (AART) - an automated alternative to current manual red-teaming efforts. AART offers a data generat… ▽ More

    Submitted 29 November, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

  6. arXiv:2310.16523  [pdf, other

    cs.CL cs.AI

    Improving Diversity of Demographic Representation in Large Language Models via Collective-Critiques and Self-Voting

    Authors: Preethi Lahoti, Nicholas Blumm, Xiao Ma, Raghavendra Kotikalapudi, Sahitya Potluri, Qijun Tan, Hansa Srinivasan, Ben Packer, Ahmad Beirami, Alex Beutel, Jilin Chen

    Abstract: A crucial challenge for generative large language models (LLMs) is diversity: when a user's prompt is under-specified, models may follow implicit assumptions while generating a response, which may result in homogenization of the responses, as well as certain demographic groups being under-represented or even erased from the generated responses. In this paper, we formalize diversity of representati… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: To appear at EMNLP 2023 main conference

  7. arXiv:2109.04432  [pdf, other

    cs.LG cs.IR stat.ML

    Detecting and Mitigating Test-time Failure Risks via Model-agnostic Uncertainty Learning

    Authors: Preethi Lahoti, Krishna P. Gummadi, Gerhard Weikum

    Abstract: Reliably predicting potential failure risks of machine learning (ML) systems when deployed with production data is a crucial aspect of trustworthy AI. This paper introduces Risk Advisor, a novel post-hoc meta-learner for estimating failure risks and predictive uncertainties of any already-trained black-box classification model. In addition to providing a risk score, the Risk Advisor decomposes the… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: To appear in the 21st IEEE International Conference on Data Mining (ICDM 2021), Auckland, New Zealand

  8. Accounting for Model Uncertainty in Algorithmic Discrimination

    Authors: Junaid Ali, Preethi Lahoti, Krishna P. Gummadi

    Abstract: Traditional approaches to ensure group fairness in algorithmic decision making aim to equalize ``total'' error rates for different subgroups in the population. In contrast, we argue that the fairness approaches should instead focus only on equalizing errors arising due to model uncertainty (a.k.a epistemic uncertainty), caused due to lack of knowledge about the best model or due to lack of data. I… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: 12 pages, Accepted at AIES 2021

  9. arXiv:2006.13114  [pdf, other

    cs.LG stat.ML

    Fairness without Demographics through Adversarially Reweighted Learning

    Authors: Preethi Lahoti, Alex Beutel, Jilin Chen, Kang Lee, Flavien Prost, Nithum Thain, Xuezhi Wang, Ed H. Chi

    Abstract: Much of the previous machine learning (ML) fairness literature assumes that protected features such as race and sex are present in the dataset, and relies upon them to mitigate fairness concerns. However, in practice factors like privacy and regulation often preclude the collection of protected features, or their use for training or inference, severely limiting the applicability of traditional fai… ▽ More

    Submitted 3 November, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: To appear at 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada

  10. arXiv:1910.10255  [pdf, other

    cs.CY cs.AI cs.LG

    An Empirical Study on Learning Fairness Metrics for COMPAS Data with Human Supervision

    Authors: Hanchen Wang, Nina Grgic-Hlaca, Preethi Lahoti, Krishna P. Gummadi, Adrian Weller

    Abstract: The notion of individual fairness requires that similar people receive similar treatment. However, this is hard to achieve in practice since it is difficult to specify the appropriate similarity metric. In this work, we attempt to learn such similarity metric from human annotated data. We gather a new dataset of human judgments on a criminal recidivism prediction (COMPAS) task. By assuming the hum… ▽ More

    Submitted 31 October, 2019; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: Accepted at NeurIPS 2019 HCML Workshop

  11. Operationalizing Individual Fairness with Pairwise Fair Representations

    Authors: Preethi Lahoti, Krishna P. Gummadi, Gerhard Weikum

    Abstract: We revisit the notion of individual fairness proposed by Dwork et al. A central challenge in operationalizing their approach is the difficulty in eliciting a human specification of a similarity metric. In this paper, we propose an operationalization of individual fairness that does not rely on a human specification of a distance metric. Instead, we propose novel approaches to elicit and leverage s… ▽ More

    Submitted 1 December, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: To be published in the proceedings of the VLDB Endowment, Vol. 13, Issue. 4

  12. arXiv:1806.01059  [pdf, other

    cs.LG cs.IR stat.ML

    iFair: Learning Individually Fair Data Representations for Algorithmic Decision Making

    Authors: Preethi Lahoti, Krishna P. Gummadi, Gerhard Weikum

    Abstract: People are rated and ranked, towards algorithmic decision making in an increasing number of applications, typically based on machine learning. Research on how to incorporate fairness into such tasks has prevalently pursued the paradigm of group fairness: giving adequate success rates to specifically protected groups. In contrast, the alternative paradigm of individual fairness has received relativ… ▽ More

    Submitted 6 February, 2019; v1 submitted 4 June, 2018; originally announced June 2018.

    Comments: Accepted at ICDE 2019. Please cite the ICDE 2019 proceedings version

  13. arXiv:1711.10251  [pdf, other

    cs.SI

    Joint Non-negative Matrix Factorization for Learning Ideological Leaning on Twitter

    Authors: Preethi Lahoti, Kiran Garimella, Aristides Gionis

    Abstract: People are shifting from traditional news sources to online news at an incredibly fast rate. However, the technology behind online news consumption promotes content that confirms the users' existing point of view. This phenomenon has led to polarization of opinions and intolerance towards opposing views. Thus, a key problem is to model information filter bubbles on social media and design methods… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

    Comments: Accepted at WSDM 2018. Please cite the WSDM version, not the arxiv version