Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Kumar, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11016  [pdf, other

    cs.CL cs.LG

    LongLaMP: A Benchmark for Personalized Long-form Text Generation

    Authors: Ishita Kumar, Snigdha Viswanathan, Sushrita Yerra, Alireza Salemi, Ryan A. Rossi, Franck Dernoncourt, Hanieh Deilamsalehy, Xiang Chen, Ruiyi Zhang, Shubham Agarwal, Nedim Lipka, Hamed Zamani

    Abstract: Long-text generation is seemingly ubiquitous in real-world applications of large language models such as generating an email or writing a review. Despite the fundamental importance and prevalence of long-text generation in many practical applications, existing work on personalized generation has focused on the generation of very short text. To overcome these limitations, we study the problem of pe… ▽ More

    Submitted 26 June, 2024; originally announced July 2024.

    Comments: 9 pages, 4 figures, 20 tables(including appendix) submitted to EMNLP

  2. arXiv:2402.18803  [pdf, other

    cs.LG cs.CY

    To Pool or Not To Pool: Analyzing the Regularizing Effects of Group-Fair Training on Shared Models

    Authors: Cyrus Cousins, I. Elizabeth Kumar, Suresh Venkatasubramanian

    Abstract: In fair machine learning, one source of performance disparities between groups is over-fitting to groups with relatively few training samples. We derive group-specific bounds on the generalization error of welfare-centric fair machine learning that benefit from the larger sample size of the majority group. We do this by considering group-specific Rademacher averages over a restricted hypothesis cl… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  3. arXiv:2311.14744  [pdf

    physics.chem-ph cs.AI cs.LG

    Coarse-Grained Configurational Polymer Fingerprints for Property Prediction using Machine Learning

    Authors: Ishan Kumar, Prateek K Jha

    Abstract: In this work, we present a method to generate a configurational level fingerprint for polymers using the Bead-Spring-Model. Unlike some of the previous fingerprinting approaches that employ monomer-level information where atomistic descriptors are computed using quantum chemistry calculations, this approach incorporates configurational information from a coarse-grained model of a long polymer chai… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  4. arXiv:2311.02790  [pdf, other

    cs.CL cs.AI cs.CY cs.IR cs.LG

    CausalCite: A Causal Formulation of Paper Citations

    Authors: Ishan Kumar, Zhijing Jin, Ehsan Mokhtarian, Siyuan Guo, Yuen Chen, Mrinmaya Sachan, Bernhard Schölkopf

    Abstract: Citation count of a paper is a commonly used proxy for evaluating the significance of a paper in the scientific community. Yet citation measures are widely criticized for failing to accurately reflect the true impact of a paper. Thus, we propose CausalCite, a new way to measure the significance of a paper by assessing the causal impact of the paper on its follow-up papers. CausalCite is based on a… ▽ More

    Submitted 27 May, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: ACL 2024 Findings

  5. arXiv:2306.06546  [pdf, other

    cs.SD cs.LG eess.AS

    High-Fidelity Audio Compression with Improved RVQGAN

    Authors: Rithesh Kumar, Prem Seetharaman, Alejandro Luebs, Ishaan Kumar, Kundan Kumar

    Abstract: Language models have been successfully used to model natural signals, such as images, speech, and music. A key component of these models is a high quality neural compression model that can compress high-dimensional natural signals into lower dimensional discrete tokens. To that end, we introduce a high-fidelity universal neural audio compression algorithm that achieves ~90x compression of 44.1 KHz… ▽ More

    Submitted 26 October, 2023; v1 submitted 10 June, 2023; originally announced June 2023.

    Comments: Accepted at NeurIPS 2023 (spotlight)

  6. arXiv:2301.03034  [pdf, other

    cs.DB cs.SE

    Hunter: Using Change Point Detection to Hunt for Performance Regressions

    Authors: Matt Fleming, Piotr Kołaczkowski, Ishita Kumar, Shaunak Das, Sean McCarthy, Pushkala Pattabhiraman, Henrik Ingo

    Abstract: Change point detection has recently gained popularity as a method of detecting performance changes in software due to its ability to cope with noisy data. In this paper we present Hunter, an open source tool that automatically detects performance regressions and improvements in time-series data. Hunter uses a modified E-divisive means algorithm to identify statistically significant changes in norm… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

  7. Equalizing Credit Opportunity in Algorithms: Aligning Algorithmic Fairness Research with U.S. Fair Lending Regulation

    Authors: I. Elizabeth Kumar, Keegan E. Hines, John P. Dickerson

    Abstract: Credit is an essential component of financial wellbeing in America, and unequal access to it is a large factor in the economic disparities between demographic groups that exist today. Today, machine learning algorithms, sometimes trained on alternative data, are increasingly being used to determine access to credit, yet research has shown that machine learning can encode many different versions of… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Journal ref: AIES '22: Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society

  8. The Fallacy of AI Functionality

    Authors: Inioluwa Deborah Raji, I. Elizabeth Kumar, Aaron Horowitz, Andrew D. Selbst

    Abstract: Deployed AI systems often do not work. They can be constructed haphazardly, deployed indiscriminately, and promoted deceptively. However, despite this reality, scholars, the press, and policymakers pay too little attention to functionality. This leads to technical and policy solutions focused on "ethical" or value-aligned deployments, often skipping over the prior question of whether a given syste… ▽ More

    Submitted 1 July, 2022; v1 submitted 19 June, 2022; originally announced June 2022.

    Journal ref: 2022 ACM Conference on Fairness, Accountability, and Transparency (FAccT '22)

  9. Epistemic values in feature importance methods: Lessons from feminist epistemology

    Authors: Leif Hancox-Li, I. Elizabeth Kumar

    Abstract: As the public seeks greater accountability and transparency from machine learning algorithms, the research literature on methods to explain algorithms and their outputs has rapidly expanded. Feature importance methods form a popular class of explanation methods. In this paper, we apply the lens of feminist epistemology to recent feature importance research. We investigate what epistemic values are… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

    Comments: Accepted to ACM FAccT 2021

  10. arXiv:2002.11097  [pdf, other

    cs.AI cs.LG stat.ML

    Problems with Shapley-value-based explanations as feature importance measures

    Authors: I. Elizabeth Kumar, Suresh Venkatasubramanian, Carlos Scheidegger, Sorelle Friedler

    Abstract: Game-theoretic formulations of feature importance have become popular as a way to "explain" machine learning models. These methods define a cooperative game between the features of a model and distribute influence among these input elements using some form of the game's unique Shapley values. Justification for these methods rests on two pillars: their desirable mathematical properties, and their a… ▽ More

    Submitted 30 June, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: Accepted to ICML 2020