Zum Hauptinhalt springen

Showing 1–13 of 13 results for author: Vinayak, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08469  [pdf, other

    cs.LG

    PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences

    Authors: Daiwei Chen, Yi Chen, Aniket Rege, Ramya Korlakai Vinayak

    Abstract: Large foundation models pretrained on raw web-scale data are not readily deployable without additional step of extensive alignment to human preferences. Such alignment is typically done by collecting large amounts of pairwise comparisons from humans ("Do you prefer output A or B?") and learning a reward model or a policy with the Bradley-Terry-Luce (BTL) model as a proxy for a human's underlying i… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 22 pages, 14 figures, 5 tables

  2. arXiv:2406.01566  [pdf, other

    cs.DC cs.CL cs.LG

    Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs

    Authors: Yixuan Mei, Yonghao Zhuang, Xupeng Miao, Juncheng Yang, Zhihao Jia, Rashmi Vinayak

    Abstract: This paper introduces Helix, a distributed system for high-throughput, low-latency large language model (LLM) serving on heterogeneous GPU clusters. A key idea behind Helix is to formulate inference computation of LLMs over heterogeneous GPUs and network connections as a max-flow problem for a directed, weighted graph, whose nodes represent GPU instances and edges capture both GPU and network hete… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  3. arXiv:2404.16954  [pdf, other

    cs.LG cs.AI stat.ML

    Taming False Positives in Out-of-Distribution Detection with Human Feedback

    Authors: Harit Vishwakarma, Heguang Lin, Ramya Korlakai Vinayak

    Abstract: Robustness to out-of-distribution (OOD) samples is crucial for safely deploying machine learning models in the open world. Recent works have focused on designing scoring functions to quantify OOD uncertainty. Setting appropriate thresholds for these scoring functions for OOD detection is challenging as OOD samples are often unavailable up front. Typically, thresholds are set to achieve a desired t… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Appeared in the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)

    Journal ref: PMLR 238:1486-1494, 2024

  4. arXiv:2404.16188  [pdf, other

    cs.LG cs.AI stat.ML

    Pearls from Pebbles: Improved Confidence Functions for Auto-labeling

    Authors: Harit Vishwakarma, Reid, Chen, Sui Jiet Tay, Satya Sai Srinath Namburi, Frederic Sala, Ramya Korlakai Vinayak

    Abstract: Auto-labeling is an important family of techniques that produce labeled training sets with minimum manual labeling. A prominent variant, threshold-based auto-labeling (TBAL), works by finding a threshold on a model's confidence scores above which it can accurately label unlabeled data points. However, many models are known to produce overconfident scores, leading to poor TBAL performance. While a… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  5. arXiv:2403.19629  [pdf, other

    cs.LG stat.ML

    Metric Learning from Limited Pairwise Preference Comparisons

    Authors: Zhi Wang, Geelon So, Ramya Korlakai Vinayak

    Abstract: We study metric learning from preference comparisons under the ideal point model, in which a user prefers an item over another if it is closer to their latent ideal item. These items are embedded into $\mathbb{R}^d$ equipped with an unknown Mahalanobis distance shared across users. While recent work shows that it is possible to simultaneously recover the metric and ideal items given… ▽ More

    Submitted 12 July, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: The 40th Conference on Uncertainty in Artificial Intelligence (UAI-2024)

  6. arXiv:2309.07277  [pdf, ps, other

    cs.CV cs.LG

    Limitations of Face Image Generation

    Authors: Harrison Rosenberg, Shimaa Ahmed, Guruprasad V Ramesh, Ramya Korlakai Vinayak, Kassem Fawaz

    Abstract: Text-to-image diffusion models have achieved widespread popularity due to their unprecedented image generation capability. In particular, their ability to synthesize and modify human faces has spurred research into using generated face images in both training data augmentation and model performance assessments. In this paper, we study the efficacy and shortcomings of generative models in the conte… ▽ More

    Submitted 21 December, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: Accepted to The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)

  7. arXiv:2211.12620  [pdf, other

    cs.LG cs.AI stat.ML

    Promises and Pitfalls of Threshold-based Auto-labeling

    Authors: Harit Vishwakarma, Heguang Lin, Frederic Sala, Ramya Korlakai Vinayak

    Abstract: Creating large-scale high-quality labeled datasets is a major bottleneck in supervised machine learning workflows. Threshold-based auto-labeling (TBAL), where validation data obtained from humans is used to find a confidence threshold above which the data is machine-labeled, reduces reliance on manual annotation. TBAL is emerging as a widely-used solution in practice. Given the long shelf-life and… ▽ More

    Submitted 21 February, 2024; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2023 (Spotlight)

    Journal ref: Thirty Seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  8. arXiv:2207.03609  [pdf, other

    stat.ML cs.AI cs.LG

    One for All: Simultaneous Metric and Preference Learning over Multiple Users

    Authors: Gregory Canal, Blake Mason, Ramya Korlakai Vinayak, Robert Nowak

    Abstract: This paper investigates simultaneous preference and metric learning from a crowd of respondents. A set of items represented by $d$-dimensional feature vectors and paired comparisons of the form ``item $i$ is preferable to item $j$'' made by each user is given. Our model jointly learns a distance metric that characterizes the crowd's general measure of item similarities along with a latent ideal po… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  9. arXiv:2106.03022  [pdf, other

    stat.ME cs.LG stat.ML

    Fisher-Pitman permutation tests based on nonparametric Poisson mixtures with application to single cell genomics

    Authors: Zhen Miao, Weihao Kong, Ramya Korlakai Vinayak, Wei Sun, Fang Han

    Abstract: This paper investigates the theoretical and empirical performance of Fisher-Pitman-type permutation tests for assessing the equality of unknown Poisson mixture distributions. Building on nonparametric maximum likelihood estimators (NPMLEs) of the mixing distribution, these tests are theoretically shown to be able to adapt to complicated unspecified structures of count data and also consistent agai… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

    Comments: 52 pages

  10. arXiv:2002.07297  [pdf, other

    stat.ML cs.LG

    Estimating the number and effect sizes of non-null hypotheses

    Authors: Jennifer Brennan, Ramya Korlakai Vinayak, Kevin Jamieson

    Abstract: We study the problem of estimating the distribution of effect sizes (the mean of the test statistic under the alternate hypothesis) in a multiple testing setting. Knowing this distribution allows us to calculate the power (type II error) of any experimental design. We show that it is possible to estimate this distribution using an inexpensive pilot experiment, which takes significantly fewer sampl… ▽ More

    Submitted 24 July, 2020; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: ICML 2020

  11. arXiv:1911.12568  [pdf, other

    cs.LG math.ST stat.ML

    Optimal Estimation of Change in a Population of Parameters

    Authors: Ramya Korlakai Vinayak, Weihao Kong, Sham M. Kakade

    Abstract: Paired estimation of change in parameters of interest over a population plays a central role in several application domains including those in the social sciences, epidemiology, medicine and biology. In these domains, the size of the population under study is often very large, however, the number of observations available per individual in the population is very small (\emph{sparse observations})… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

  12. arXiv:1904.03257  [pdf, ps, other

    cs.LG cs.DB cs.DC cs.SE stat.ML

    MLSys: The New Frontier of Machine Learning Systems

    Authors: Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Jennifer Chayes, Eric Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim Hazelwood , et al. (44 additional authors not shown)

    Abstract: Machine learning (ML) techniques are enjoying rapidly increasing adoption. However, designing and implementing the systems that support ML models in real-world deployments remains a significant obstacle, in large part due to the radically different development and deployment profile of modern ML methods, and the range of practical concerns that come with broader adoption. We propose to foster a ne… ▽ More

    Submitted 1 December, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

  13. arXiv:1902.04553  [pdf, ps, other

    math.ST cs.LG stat.ML

    Maximum Likelihood Estimation for Learning Populations of Parameters

    Authors: Ramya Korlakai Vinayak, Weihao Kong, Gregory Valiant, Sham M. Kakade

    Abstract: Consider a setting with $N$ independent individuals, each with an unknown parameter, $p_i \in [0, 1]$ drawn from some unknown distribution $P^\star$. After observing the outcomes of $t$ independent Bernoulli trials, i.e., $X_i \sim \text{Binomial}(t, p_i)$ per individual, our objective is to accurately estimate $P^\star$. This problem arises in numerous domains, including the social sciences, psyc… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.