Zum Hauptinhalt springen

Showing 1–30 of 30 results for author: Bates, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.19605  [pdf, other

    stat.ME cs.LG

    Data-Adaptive Tradeoffs among Multiple Risks in Distribution-Free Prediction

    Authors: Drew T. Nguyen, Reese Pathak, Anastasios N. Angelopoulos, Stephen Bates, Michael I. Jordan

    Abstract: Decision-making pipelines are generally characterized by tradeoffs among various risk functions. It is often desirable to manage such tradeoffs in a data-adaptive manner. As we demonstrate, if this is done naively, state-of-the art uncertainty quantification methods can lead to significant violations of putative risk guarantees. To address this issue, we develop methods that permit valid control… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 27 pages, 10 figures

  2. arXiv:2402.01139  [pdf, other

    stat.ML cs.LG stat.ME

    Online conformal prediction with decaying step sizes

    Authors: Anastasios N. Angelopoulos, Rina Foygel Barber, Stephen Bates

    Abstract: We introduce a method for online conformal prediction with decaying step sizes. Like previous methods, ours possesses a retrospective guarantee of coverage for arbitrary sequences. However, unlike previous methods, we can simultaneously estimate a population quantile when it exists. Our theory and experiments indicate substantially improved practical properties: in particular, when the distributio… ▽ More

    Submitted 28 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  3. arXiv:2309.01837  [pdf, other

    cs.LG stat.ML

    Delegating Data Collection in Decentralized Machine Learning

    Authors: Nivasini Ananthakrishnan, Stephen Bates, Michael I. Jordan, Nika Haghtalab

    Abstract: Motivated by the emergence of decentralized machine learning (ML) ecosystems, we study the delegation of data collection. Taking the field of contract theory as our starting point, we design optimal and near-optimal contracts that deal with two fundamental information asymmetries that arise in decentralized ML: uncertainty in the assessment of model quality and uncertainty regarding the optimal pe… ▽ More

    Submitted 2 May, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

  4. arXiv:2307.03748  [pdf, other

    stat.ME cs.GT cs.LG stat.ML

    Incentive-Theoretic Bayesian Inference for Collaborative Science

    Authors: Stephen Bates, Michael I. Jordan, Michael Sklar, Jake A. Soloff

    Abstract: Contemporary scientific research is a distributed, collaborative endeavor, carried out by teams of researchers, regulatory institutions, funding agencies, commercial partners, and scientific bodies, all interacting with each other and facing different incentives. To maintain scientific rigor, statistical methods should acknowledge this state of affairs. To this end, we study hypothesis testing whe… ▽ More

    Submitted 8 February, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

  5. arXiv:2306.09335  [pdf, other

    stat.ML cs.CV cs.LG stat.ME

    Class-Conditional Conformal Prediction with Many Classes

    Authors: Tiffany Ding, Anastasios N. Angelopoulos, Stephen Bates, Michael I. Jordan, Ryan J. Tibshirani

    Abstract: Standard conformal prediction methods provide a marginal coverage guarantee, which means that for a random test point, the conformal prediction set contains the true label with a user-specified probability. In many classification problems, we would like to obtain a stronger guarantee--that for test points of a specific class, the prediction set contains the true label with the same user-chosen pro… ▽ More

    Submitted 27 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  6. arXiv:2305.14595  [pdf, other

    cs.LG cs.CY cs.GT

    Operationalizing Counterfactual Metrics: Incentives, Ranking, and Information Asymmetry

    Authors: Serena Wang, Stephen Bates, P. M. Aronow, Michael I. Jordan

    Abstract: From the social sciences to machine learning, it has been well documented that metrics to be optimized are not always aligned with social welfare. In healthcare, Dranove et al. (2003) showed that publishing surgery mortality metrics actually harmed the welfare of sicker patients by increasing provider selection behavior. We analyze the incentive misalignments that arise from such average treated o… ▽ More

    Submitted 29 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  7. arXiv:2303.09045  [pdf

    cs.LG cs.CR

    Web and Mobile Platforms for Managing Elections based on IoT And Machine Learning Algorithms

    Authors: G. M. I. K. Galagoda, W. M. C. A. Karunarathne, R. S. Bates, K. M. H. V. P. Gangathilaka, Kanishka Yapa, Erandika Gamage

    Abstract: The global pandemic situation has severely affected all countries. As a result, almost all countries had to adjust to online technologies to continue their processes. In addition, Sri Lanka is yearly spending ten billion on elections. We have examined a proper way of minimizing the cost of hosting these events online. To solve the existing problems and increase the time potency and cost reduction… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Journal ref: International Journal of Engineering Applied Sciences and Technology, 2022, Vol 7, No 7, 29-35

  8. arXiv:2301.09633  [pdf, other

    stat.ML cs.AI cs.LG q-bio.QM stat.ME

    Prediction-Powered Inference

    Authors: Anastasios N. Angelopoulos, Stephen Bates, Clara Fannjiang, Michael I. Jordan, Tijana Zrnic

    Abstract: Prediction-powered inference is a framework for performing valid statistical inference when an experimental dataset is supplemented with predictions from a machine-learning system. The framework yields simple algorithms for computing provably valid confidence intervals for quantities such as means, quantiles, and linear and logistic regression coefficients, without making any assumptions on the ma… ▽ More

    Submitted 9 November, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Code is available at https://github.com/aangelopoulos/ppi_py

  9. arXiv:2211.05732  [pdf, other

    cs.GT cs.AI cs.LG econ.TH

    The Sample Complexity of Online Contract Design

    Authors: Banghua Zhu, Stephen Bates, Zhuoran Yang, Yixin Wang, Jiantao Jiao, Michael I. Jordan

    Abstract: We study the hidden-action principal-agent problem in an online setting. In each round, the principal posts a contract that specifies the payment to the agent based on each outcome. The agent then makes a strategic choice of action that maximizes her own utility, but the action is not directly observable by the principal. The principal observes the outcome and receives utility from the agent's cho… ▽ More

    Submitted 19 May, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

  10. arXiv:2209.14295  [pdf, other

    cs.LG cs.AI math.ST stat.ME stat.ML

    Conformal Prediction is Robust to Dispersive Label Noise

    Authors: Shai Feldman, Bat-Sheva Einbinder, Stephen Bates, Anastasios N. Angelopoulos, Asaf Gendler, Yaniv Romano

    Abstract: We study the robustness of conformal prediction, a powerful tool for uncertainty quantification, to label noise. Our analysis tackles both regression and classification problems, characterizing when and how it is possible to construct uncertainty sets that correctly cover the unobserved noiseless ground truth labels. We further extend our theory and formulate the requirements for correctly control… ▽ More

    Submitted 19 September, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

  11. arXiv:2208.02814  [pdf, other

    stat.ME cs.AI cs.LG math.ST stat.ML

    Conformal Risk Control

    Authors: Anastasios N. Angelopoulos, Stephen Bates, Adam Fisch, Lihua Lei, Tal Schuster

    Abstract: We extend conformal prediction to control the expected value of any monotone loss function. The algorithm generalizes split conformal prediction together with its coverage guarantee. Like conformal prediction, the conformal risk control procedure is tight up to an $\mathcal{O}(1/n)$ factor. We also introduce extensions of the idea to distribution shift, quantile risk control, multiple and adversar… ▽ More

    Submitted 29 April, 2023; v1 submitted 4 August, 2022; originally announced August 2022.

    Comments: Code available at https://github.com/aangelopoulos/conformal-risk

  12. arXiv:2207.10074  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Semantic uncertainty intervals for disentangled latent spaces

    Authors: Swami Sankaranarayanan, Anastasios N. Angelopoulos, Stephen Bates, Yaniv Romano, Phillip Isola

    Abstract: Meaningful uncertainty quantification in computer vision requires reasoning about semantic information -- say, the hair color of the person in a photo or the location of a car on the street. To this end, recent breakthroughs in generative modeling allow us to represent semantic information in disentangled latent spaces, but providing uncertainties on the semantic latent variables has remained chal… ▽ More

    Submitted 30 November, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted to NeurIPS 2022. Project page: https://swamiviv.github.io/semantic_uncertainty_intervals/

  13. arXiv:2207.01609  [pdf, other

    cs.IR cs.LG stat.ML

    Recommendation Systems with Distribution-Free Reliability Guarantees

    Authors: Anastasios N. Angelopoulos, Karl Krauth, Stephen Bates, Yixin Wang, Michael I. Jordan

    Abstract: When building recommendation systems, we seek to output a helpful set of items to the user. Under the hood, a ranking model predicts which of two candidate items is better, and we must distill these pairwise comparisons into the user-facing output. However, a learned ranking model is never perfect, so taking its predictions at face value gives no guarantee that the user-facing output is reliable.… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  14. arXiv:2206.02757  [pdf, other

    cs.LG cs.AI stat.ML

    Robust Calibration with Multi-domain Temperature Scaling

    Authors: Yaodong Yu, Stephen Bates, Yi Ma, Michael I. Jordan

    Abstract: Uncertainty quantification is essential for the reliable deployment of machine learning models to high-stakes application domains. Uncertainty quantification is all the more challenging when training distribution and test distribution are different, even the distribution shifts are mild. Despite the ubiquity of distribution shifts in real-world applications, existing uncertainty quantification app… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  15. arXiv:2205.09095  [pdf, other

    cs.LG stat.ML

    Achieving Risk Control in Online Learning Settings

    Authors: Shai Feldman, Liran Ringel, Stephen Bates, Yaniv Romano

    Abstract: To provide rigorous uncertainty quantification for online learning models, we develop a framework for constructing uncertainty sets that provably control risk -- such as coverage of confidence intervals, false negative rate, or F1 score -- in the online setting. This extends conformal prediction to apply to a larger class of online learning problems. Our method guarantees risk control at any user-… ▽ More

    Submitted 27 January, 2023; v1 submitted 18 May, 2022; originally announced May 2022.

  16. arXiv:2205.06812  [pdf, other

    cs.GT cs.LG cs.MA math.ST stat.ME

    Principal-Agent Hypothesis Testing

    Authors: Stephen Bates, Michael I. Jordan, Michael Sklar, Jake A. Soloff

    Abstract: Consider the relationship between a regulator (the principal) and an experimenter (the agent) such as a pharmaceutical company. The pharmaceutical company wishes to sell a drug for profit, whereas the regulator wishes to allow only efficacious drugs to be marketed. The efficacy of the drug is not known to the regulator, so the pharmaceutical company must run a costly trial to prove efficacy to the… ▽ More

    Submitted 15 April, 2024; v1 submitted 13 May, 2022; originally announced May 2022.

  17. arXiv:2202.05265  [pdf, other

    cs.LG cs.CV eess.IV q-bio.QM stat.ML

    Image-to-Image Regression with Distribution-Free Uncertainty Quantification and Applications in Imaging

    Authors: Anastasios N Angelopoulos, Amit P Kohli, Stephen Bates, Michael I Jordan, Jitendra Malik, Thayer Alshaabi, Srigokul Upadhyayula, Yaniv Romano

    Abstract: Image-to-image regression is an important learning task, used frequently in biological imaging. Current algorithms, however, do not generally offer statistical guarantees that protect against a model's mistakes and hallucinations. To address this, we develop uncertainty quantification techniques with rigorous statistical guarantees for image-to-image regression problems. In particular, we show how… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: Code available at https://github.com/aangelopoulos/im2im-uq

  18. arXiv:2202.03613  [pdf, other

    cs.LG q-bio.QM stat.ME

    Conformal prediction for the design problem

    Authors: Clara Fannjiang, Stephen Bates, Anastasios N. Angelopoulos, Jennifer Listgarten, Michael I. Jordan

    Abstract: Many applications of machine learning methods involve an iterative protocol in which data are collected, a model is trained, and then outputs of that model are used to choose what data to consider next. For example, one data-driven approach for designing proteins is to train a regression model to predict the fitness of protein sequences, then use it to propose new sequences believed to exhibit gre… ▽ More

    Submitted 31 May, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: for associated code, see https://github.com/clarafy/conformal-for-design

    Journal ref: Proc. Natl. Acad. Sci. 119 (43) e2204569119 (2022)

  19. arXiv:2201.10547  [pdf, other

    cs.LG cs.AI cs.MA

    Optimal Data Selection: An Online Distributed View

    Authors: Mariel Werner, Anastasios Angelopoulos, Stephen Bates, Michael I. Jordan

    Abstract: The blessing of ubiquitous data also comes with a curse: the communication, storage, and labeling of massive, mostly redundant datasets. We seek to solve this problem at its core, collecting only valuable data and throwing out the rest via submodular maximization. Specifically, we develop algorithms for the online and distributed version of the problem, where data selection occurs in an uncoordina… ▽ More

    Submitted 14 December, 2023; v1 submitted 25 January, 2022; originally announced January 2022.

  20. arXiv:2110.01052  [pdf, other

    cs.LG cs.AI cs.CV stat.ME stat.ML

    Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control

    Authors: Anastasios N. Angelopoulos, Stephen Bates, Emmanuel J. Candès, Michael I. Jordan, Lihua Lei

    Abstract: We introduce a framework for calibrating machine learning models so that their predictions satisfy explicit, finite-sample statistical guarantees. Our calibration algorithms work with any underlying model and (unknown) data-generating distribution and do not require model refitting. The framework addresses, among other examples, false discovery rate control in multi-label classification, intersect… ▽ More

    Submitted 29 September, 2022; v1 submitted 3 October, 2021; originally announced October 2021.

    Comments: Code available at https://github.com/aangelopoulos/ltt

  21. arXiv:2110.00816  [pdf, other

    cs.LG

    Calibrated Multiple-Output Quantile Regression with Representation Learning

    Authors: Shai Feldman, Stephen Bates, Yaniv Romano

    Abstract: We develop a method to generate predictive regions that cover a multivariate response variable with a user-specified probability. Our work is composed of two components. First, we use a deep generative model to learn a representation of the response that has a unimodal distribution. Existing multiple-output quantile regression approaches are effective in such cases, so we apply them on the learned… ▽ More

    Submitted 23 December, 2022; v1 submitted 2 October, 2021; originally announced October 2021.

  22. arXiv:2109.13412  [pdf, other

    cs.LG cs.CV

    Discriminative Attribution from Counterfactuals

    Authors: Nils Eckstein, Alexander S. Bates, Gregory S. X. E. Jefferis, Jan Funke

    Abstract: We present a method for neural network interpretability by combining feature attribution with counterfactual explanations to generate attribution maps that highlight the most discriminative features between pairs of classes. We show that this method can be used to quantitatively evaluate the performance of feature attribution methods in an objective manner, thus preventing potential observer bias.… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  23. arXiv:2107.07511  [pdf, other

    cs.LG cs.AI math.ST stat.ME stat.ML

    A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification

    Authors: Anastasios N. Angelopoulos, Stephen Bates

    Abstract: Black-box machine learning models are now routinely used in high-risk settings, like medical diagnostics, which demand uncertainty quantification to avoid consequential model failures. Conformal prediction is a user-friendly paradigm for creating statistically rigorous uncertainty sets/intervals for the predictions of such models. Critically, the sets are valid in a distribution-free sense: they p… ▽ More

    Submitted 7 December, 2022; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: Blog and tutorial video at http://angelopoulos.ai/blog/posts/gentle-intro/ ; Code is available at https://github.com/aangelopoulos/conformal-prediction

  24. arXiv:2106.12012  [pdf, other

    cs.LG cs.DC stat.ML

    Test-time Collective Prediction

    Authors: Celestine Mendler-Dünner, Wenshuo Guo, Stephen Bates, Michael I. Jordan

    Abstract: An increasingly common setting in machine learning involves multiple parties, each with their own data, who want to jointly make predictions on future test points. Agents wish to benefit from the collective expertise of the full set of agents to make better predictions than they would individually, but may not be willing to release their data or model parameters. In this work, we explore a decentr… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

  25. arXiv:2106.00394  [pdf, other

    cs.LG

    Improving Conditional Coverage via Orthogonal Quantile Regression

    Authors: Shai Feldman, Stephen Bates, Yaniv Romano

    Abstract: We develop a method to generate prediction intervals that have a user-specified coverage level across all regions of feature-space, a property called conditional coverage. A typical approach to this task is to estimate the conditional quantiles with quantile regression -- it is well-known that this leads to correct coverage in the large-sample limit, although it may not be accurate in finite sampl… ▽ More

    Submitted 2 October, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: 20 pages, 5 figures

  26. arXiv:2102.06202  [pdf, other

    cs.LG cs.AI cs.CR stat.ME stat.ML

    Private Prediction Sets

    Authors: Anastasios N. Angelopoulos, Stephen Bates, Tijana Zrnic, Michael I. Jordan

    Abstract: In real-world settings involving consequential decision-making, the deployment of machine learning systems generally requires both reliable uncertainty quantification and protection of individuals' privacy. We present a framework that treats these two desiderata jointly. Our framework is based on conformal prediction, a methodology that augments predictive models to return prediction sets that pro… ▽ More

    Submitted 3 March, 2024; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: Code available at https://github.com/aangelopoulos/private_prediction_sets

    Journal ref: Harvard Data Science Review, 4(2). 2022

  27. arXiv:2101.02703  [pdf, other

    cs.LG cs.AI cs.CV stat.ME stat.ML

    Distribution-Free, Risk-Controlling Prediction Sets

    Authors: Stephen Bates, Anastasios Angelopoulos, Lihua Lei, Jitendra Malik, Michael I. Jordan

    Abstract: While improving prediction accuracy has been the focus of machine learning in recent years, this alone does not suffice for reliable decision-making. Deploying learning systems in consequential settings also requires calibrating and communicating the uncertainty of predictions. To convey instance-wise uncertainty for prediction tasks, we show how to generate set-valued predictions from a black-box… ▽ More

    Submitted 4 August, 2021; v1 submitted 7 January, 2021; originally announced January 2021.

    Comments: Project website available at http://www.angelopoulos.ai/blog/posts/rcps/ and codebase available at https://github.com/aangelopoulos/rcps

  28. arXiv:2009.14193  [pdf, other

    cs.CV math.ST stat.ML

    Uncertainty Sets for Image Classifiers using Conformal Prediction

    Authors: Anastasios Angelopoulos, Stephen Bates, Jitendra Malik, Michael I. Jordan

    Abstract: Convolutional image classifiers can achieve high predictive accuracy, but quantifying their uncertainty remains an unresolved challenge, hindering their deployment in consequential settings. Existing uncertainty quantification techniques, such as Platt scaling, attempt to calibrate the network's probability estimates, but they do not have formal guarantees. We present an algorithm that modifies an… ▽ More

    Submitted 3 September, 2022; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: ICLR 2021 Spotlight, https://openreview.net/forum?id=eNdiU_DbM9 . Project website at https://people.eecs.berkeley.edu/~angelopoulos/blog/posts/conformal-classification/ . Codebase at https://github.com/aangelopoulos/conformal_classification

  29. arXiv:2006.04292  [pdf, other

    stat.ML cs.LG stat.ME

    Achieving Equalized Odds by Resampling Sensitive Attributes

    Authors: Yaniv Romano, Stephen Bates, Emmanuel J. Candès

    Abstract: We present a flexible framework for learning predictive models that approximately satisfy the equalized odds notion of fairness. This is achieved by introducing a general discrepancy functional that rigorously quantifies violations of this criterion. This differentiable functional is used as a penalty driving the model parameters towards equalized odds. To rigorously evaluate fitted models, we dev… ▽ More

    Submitted 7 June, 2020; originally announced June 2020.

    Comments: 14 pages, 4 figures

  30. arXiv:1704.04760  [pdf

    cs.AR cs.LG cs.NE

    In-Datacenter Performance Analysis of a Tensor Processing Unit

    Authors: Norman P. Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, Rick Boyle, Pierre-luc Cantin, Clifford Chao, Chris Clark, Jeremy Coriell, Mike Daley, Matt Dau, Jeffrey Dean, Ben Gelb, Tara Vazir Ghaemmaghami, Rajendra Gottipati, William Gulland, Robert Hagmann, C. Richard Ho, Doug Hogberg , et al. (50 additional authors not shown)

    Abstract: Many architects believe that major improvements in cost-energy-performance must now come from domain-specific hardware. This paper evaluates a custom ASIC---called a Tensor Processing Unit (TPU)---deployed in datacenters since 2015 that accelerates the inference phase of neural networks (NN). The heart of the TPU is a 65,536 8-bit MAC matrix multiply unit that offers a peak throughput of 92 TeraOp… ▽ More

    Submitted 16 April, 2017; originally announced April 2017.

    Comments: 17 pages, 11 figures, 8 tables. To appear at the 44th International Symposium on Computer Architecture (ISCA), Toronto, Canada, June 24-28, 2017