Zum Hauptinhalt springen

Showing 1–50 of 88 results for author: Jaillet, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10066  [pdf, ps, other

    cs.GT econ.TH math.OC

    Near-Optimal Mechanisms for Resource Allocation Without Monetary Transfers

    Authors: Moise Blanchard, Patrick Jaillet

    Abstract: We study the problem in which a central planner sequentially allocates a single resource to multiple strategic agents using their utility reports at each round, but without using any monetary transfers. We consider general agent utility distributions and two standard settings: a finite horizon $T$ and an infinite horizon with $γ$ discounts. We provide general tools to characterize the convergence… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  2. arXiv:2407.17112  [pdf, other

    cs.LG cs.AI stat.ML

    Neural Dueling Bandits

    Authors: Arun Verma, Zhongxiang Dai, Xiaoqiang Lin, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Contextual dueling bandit is used to model the bandit problems, where a learner's goal is to find the best arm for a given context using observed noisy preference feedback over the selected arms for the past contexts. However, existing algorithms assume the reward function is linear, which can be complex and non-linear in many real-life applications like online recommendations or ranking web searc… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: Accepted at ICML 2024 Workshop on Foundations of Reinforcement Learning and Control

  3. arXiv:2406.03682  [pdf, other

    cs.LG

    A Universal Class of Sharpness-Aware Minimization Algorithms

    Authors: Behrooz Tahmasebi, Ashkan Soleymani, Dara Bahri, Stefanie Jegelka, Patrick Jaillet

    Abstract: Recently, there has been a surge in interest in developing optimization algorithms for overparameterized models as achieving generalization is believed to require algorithms with suitable biases. This interest centers on minimizing sharpness of the original loss function; the Sharpness-Aware Minimization (SAM) algorithm has proven effective. However, most literature only considers a few sharpness… ▽ More

    Submitted 10 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024. Code is available at http://github.com/dbahri/universal_sam

  4. arXiv:2405.17346  [pdf, other

    cs.LG cs.AI

    Prompt Optimization with Human Feedback

    Authors: Xiaoqiang Lin, Zhongxiang Dai, Arun Verma, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have demonstrated remarkable performances in various tasks. However, the performance of LLMs heavily depends on the input prompt, which has given rise to a number of recent works on prompt optimization. However, previous works often require the availability of a numeric score to assess the quality of every prompt. Unfortunately, when a human user interacts with a black… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Preprint, 18 pages

  5. arXiv:2405.16122  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars

    Authors: Zhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have shown impressive capabilities in real-world applications. The capability of in-context learning (ICL) allows us to adapt an LLM to downstream tasks by including input-label exemplars in the prompt without model fine-tuning. However, the quality of these exemplars in the prompt greatly impacts performance, highlighting the need for an effective automated exemplar s… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 23 pages, 1 figure, 23 tables

  6. arXiv:2404.01676  [pdf, other

    cs.LG

    Incentives in Private Collaborative Machine Learning

    Authors: Rachael Hwee Ling Sim, Yehong Zhang, Trong Nghia Hoang, Xinyi Xu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Collaborative machine learning involves training models on data from multiple parties but must incentivize their participation. Existing data valuation methods fairly value and reward each party based on shared data or model parameters but neglect the privacy risks involved. To address this, we introduce differential privacy (DP) as an incentive. Each party can select its required DP guarantee and… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted to NeurIPS 2023

  7. arXiv:2402.08533  [pdf, other

    cs.GT math.OC

    Grace Period is All You Need: Individual Fairness without Revenue Loss in Revenue Management

    Authors: Patrick Jaillet, Chara Podimata, Zijie Zhou

    Abstract: Imagine you and a friend purchase identical items at a store, yet only your friend received a discount. Would your friend's discount make you feel unfairly treated by the store? And would you be less willing to purchase from that store again in the future? Based on a large-scale online survey that we ran on Prolific, it turns out that the answers to the above questions are positive. Motivated by t… ▽ More

    Submitted 17 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  8. arXiv:2312.13130  [pdf, ps, other

    stat.ML cs.LG

    Distribution-Dependent Rates for Multi-Distribution Learning

    Authors: Rafael Hanashiro, Patrick Jaillet

    Abstract: To address the needs of modeling uncertainty in sensitive machine learning applications, the setup of distributionally robust optimization (DRO) seeks good performance uniformly across a variety of tasks. The recent multi-distribution learning (MDL) framework tackles this objective in a dynamic interaction with the environment, where the learner has sampling access to each target distribution. Dra… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  9. arXiv:2312.04073  [pdf, other

    cs.GT cs.MA

    Information Design for Hybrid Work under Infectious Disease Transmission Risk

    Authors: Sohil Shah, Saurabh Amin, Patrick Jaillet

    Abstract: We study a planner's provision of information to manage workplace occupancy when strategic workers (agents) face risk of infectious disease transmission. The planner implements an information mechanism to signal information about the underlying risk of infection at the workplace. Agents update their belief over the risk parameter using this information and choose to work in-person or remotely. We… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  10. arXiv:2311.06012  [pdf, other

    cs.LG

    Doubly Robust Structure Identification from Temporal Data

    Authors: Emmanouil Angelis, Francesco Quinzan, Ashkan Soleymani, Patrick Jaillet, Stefan Bauer

    Abstract: Learning the causes of time-series data is a fundamental task in many applications, spanning from finance to earth sciences or bio-medical applications. Common approaches for this task are based on vector auto-regression, and they do not take into account unknown confounding between potential causes. However, in settings with many potential causes and noisy data, these approaches may be substantia… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  11. arXiv:2311.01195  [pdf, other

    cs.LG cs.AI

    Batch Bayesian Optimization for Replicable Experimental Design

    Authors: Zhongxiang Dai, Quoc Phong Nguyen, Sebastian Shenghong Tay, Daisuke Urano, Richalynn Leong, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Many real-world experimental design problems (a) evaluate multiple experimental conditions in parallel and (b) replicate each condition multiple times due to large and heteroscedastic observation noise. Given a fixed total budget, this naturally induces a trade-off between evaluating more unique conditions while replicating each of them fewer times vs. evaluating fewer unique conditions and replic… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS 2023

  12. arXiv:2310.07884  [pdf, ps, other

    cs.DS

    Secretary Problems with Random Number of Candidates: How Prior Distributional Information Helps

    Authors: Junhui Zhang, Patrick Jaillet

    Abstract: We study variants of the secretary problem, where $N$, the number of candidates, is a random variable, and the decision maker wants to maximize the probability of success -- picking the largest number among the $N$ candidates -- using only the relative ranks of the candidates revealed so far. We consider three forms of prior information about $\mathbf p$, the probability distribution of $N$. In… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  13. arXiv:2310.05373  [pdf, other

    cs.LG cs.AI

    Quantum Bayesian Optimization

    Authors: Zhongxiang Dai, Gregory Kang Ruey Lau, Arun Verma, Yao Shu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Kernelized bandits, also known as Bayesian optimization (BO), has been a prevalent method for optimizing complicated black-box reward functions. Various BO algorithms have been theoretically shown to enjoy upper bounds on their cumulative regret which are sub-linear in the number T of iterations, and a regret lower bound of Omega(sqrt(T)) has been derived which represents the unavoidable regrets f… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  14. arXiv:2310.02905  [pdf, other

    cs.LG cs.AI cs.CL

    Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers

    Authors: Xiaoqiang Lin, Zhaoxuan Wu, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have shown remarkable instruction-following capabilities and achieved impressive performances in various applications. However, the performances of LLMs depend heavily on the instructions given to them, which are typically manually tuned with substantial human efforts. Recent work has used the query-efficient Bayesian optimization (BO) algorithm to automatically optimi… ▽ More

    Submitted 23 June, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

    Comments: Accepted to ICML 2024

  15. arXiv:2307.03994  [pdf, other

    cs.GT econ.TH

    Market Design for Dynamic Pricing and Pooling in Capacitated Networks

    Authors: Saurabh Amin, Patrick Jaillet, Haripriya Pulyassary, Manxi Wu

    Abstract: We study a market mechanism that sets edge prices to incentivize strategic agents to organize trips that efficiently share limited network capacity. This market allows agents to form groups to share trips, make decisions on departure times and route choices, and make payments to cover edge prices and other costs. We develop a new approach to analyze the existence and computation of market equilibr… ▽ More

    Submitted 1 November, 2023; v1 submitted 8 July, 2023; originally announced July 2023.

  16. arXiv:2306.12282  [pdf, other

    cs.DS cs.LG math.OC

    Online Resource Allocation with Convex-set Machine-Learned Advice

    Authors: Negin Golrezaei, Patrick Jaillet, Zijie Zhou

    Abstract: Decision-makers often have access to a machine-learned prediction about demand, referred to as advice, which can potentially be utilized in online decision-making processes for resource allocation. However, exploiting such advice poses challenges due to its potential inaccuracy. To address this issue, we propose a framework that enhances online resource allocation decisions with potentially unreli… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 74 pages, 5 figures

  17. arXiv:2306.10096  [pdf, ps, other

    math.OC cs.CC cs.DS cs.LG stat.ML

    Memory-Constrained Algorithms for Convex Optimization via Recursive Cutting-Planes

    Authors: Moïse Blanchard, Junhui Zhang, Patrick Jaillet

    Abstract: We propose a family of recursive cutting-plane algorithms to solve feasibility problems with constrained memory, which can also be used for first-order convex optimization. Precisely, in order to find a point within a ball of radius $ε$ with a separation oracle in dimension $d$ -- or to minimize $1$-Lipschitz convex functions to accuracy $ε$ over the unit ball -- our algorithms use… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  18. arXiv:2306.07024  [pdf, other

    cs.LG stat.ME

    DRCFS: Doubly Robust Causal Feature Selection

    Authors: Francesco Quinzan, Ashkan Soleymani, Patrick Jaillet, Cristian R. Rojas, Stefan Bauer

    Abstract: Knowing the features of a complex system that are highly relevant to a particular target variable is of fundamental interest in many areas of science. Existing approaches are often limited to linear settings, sometimes lack guarantees, and in most cases, do not scale to the problem at hand, in particular to images. We propose DRCFS, a doubly robust feature selection method for identifying the caus… ▽ More

    Submitted 5 July, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

  19. arXiv:2302.07186  [pdf, ps, other

    stat.ML cs.LG math.ST

    Adversarial Rewards in Universal Learning for Contextual Bandits

    Authors: Moise Blanchard, Steve Hanneke, Patrick Jaillet

    Abstract: We study the fundamental limits of learning in contextual bandits, where a learner's rewards depend on their actions and a known context, which extends the canonical multi-armed bandit to the case where side-information is available. We are interested in universally consistent algorithms, which achieve sublinear regret compared to any measurable fixed policy, without any function class restriction… ▽ More

    Submitted 12 June, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

  20. arXiv:2302.06916  [pdf, other

    cs.LG stat.ML

    Effective Dimension in Bandit Problems under Censorship

    Authors: Gauthier Guinet, Saurabh Amin, Patrick Jaillet

    Abstract: In this paper, we study both multi-armed and contextual bandit problems in censored environments. Our goal is to estimate the performance loss due to censorship in the context of classical algorithms designed for uncensored environments. Our main contributions include the introduction of a broad class of censorship models and their analysis in terms of the effective dimension of the problem -- a n… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 45 pages, 5 figures, NeurIPS 2022

    Journal ref: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  21. arXiv:2302.04963  [pdf, ps, other

    cs.LG cs.CC cs.DS math.OC stat.ML

    Quadratic Memory is Necessary for Optimal Query Complexity in Convex Optimization: Center-of-Mass is Pareto-Optimal

    Authors: Moïse Blanchard, Junhui Zhang, Patrick Jaillet

    Abstract: We give query complexity lower bounds for convex optimization and the related feasibility problem. We show that quadratic memory is necessary to achieve the optimal oracle complexity for first-order convex optimization. In particular, this shows that center-of-mass cutting-planes algorithms in dimension $d$ which use $\tilde O(d^2)$ memory and $\tilde O(d)$ queries are Pareto-optimal for both conv… ▽ More

    Submitted 18 May, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

  22. arXiv:2302.01523  [pdf, other

    cs.GT cs.LG

    Multi-channel Autobidding with Budget and ROI Constraints

    Authors: Yuan Deng, Negin Golrezaei, Patrick Jaillet, Jason Cheuk Nam Liang, Vahab Mirrokni

    Abstract: In digital online advertising, advertisers procure ad impressions simultaneously on multiple platforms, or so-called channels, such as Google Ads, Meta Ads Manager, etc., each of which consists of numerous ad auctions. We study how an advertiser maximizes total conversion (e.g. ad clicks) while satisfying aggregate return-on-investment (ROI) and budget constraints across all channels. In practice,… ▽ More

    Submitted 14 June, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  23. arXiv:2301.00241  [pdf, ps, other

    stat.ML cs.LG math.ST

    Contextual Bandits and Optimistically Universal Learning

    Authors: Moise Blanchard, Steve Hanneke, Patrick Jaillet

    Abstract: We consider the contextual bandit problem on general action and context spaces, where the learner's rewards depend on their selected actions and an observable context. This generalizes the standard multi-armed bandit to the case where side information is available, e.g., patients' records or customers' history, which allows for personalized treatment. We focus on consistency -- vanishing regret co… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

  24. arXiv:2211.11065  [pdf, ps, other

    cs.DM math.PR

    Additional Results and Extensions for the paper "Probabilistic bounds on the $k-$Traveling Salesman Problem and the Traveling Repairman Problem''

    Authors: Moïse Blanchard, Alexandre Jacquillat, Patrick Jaillet

    Abstract: This technical report provides additional results for the main paper ``Probabilistic bounds on the $k-$Traveling Salesman Problem ($k-$TSP) and the Traveling Repairman Problem (TRP)''. For the $k-$TSP, we extend the probabilistic bounds derived in the main paper to the case of distributions with general densities. For the TRP, we propose a utility-based notion of fairness and derive constant-facto… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

  25. arXiv:2211.11063  [pdf, other

    cs.DM math.PR

    Probabilistic bounds on the $k-$Traveling Salesman Problem and the Traveling Repairman Problem

    Authors: Moïse Blanchard, Alexandre Jacquillat, Patrick Jaillet

    Abstract: The $k-$traveling salesman problem ($k$-TSP) seeks a tour of minimal length that visits a subset of $k\leq n$ points. The traveling repairman problem (TRP) seeks a complete tour with minimal latency. This paper provides constant-factor probabilistic approximations of both problems. We first show that the optimal length of the $k$-TSP path grows at a rate of… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

  26. arXiv:2210.06850  [pdf, other

    cs.LG cs.AI

    Sample-Then-Optimize Batch Neural Thompson Sampling

    Authors: Zhongxiang Dai, Yao Shu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Bayesian optimization (BO), which uses a Gaussian process (GP) as a surrogate to model its objective function, is popular for black-box optimization. However, due to the limitations of GPs, BO underperforms in some problems such as those with categorical, high-dimensional or image inputs. To this end, recent works have used the highly expressive neural networks (NNs) as the surrogate model and der… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to 36th Conference on Neural Information Processing Systems (NeurIPS 2022), Extended version with proofs and additional experimental details and results, 30 pages

  27. arXiv:2209.04748  [pdf, other

    cs.GT

    Individual Welfare Guarantees in the Autobidding World with Machine-learned Advice

    Authors: Yuan Deng, Negin Golrezaei, Patrick Jaillet, Jason Cheuk Nam Liang, Vahab Mirrokni

    Abstract: Online advertising channels have commonly focused on maximizing total advertiser value (or welfare) to enhance long-run retention and channel healthiness. Previous literature has studied auction design by incorporating machine learning predictions on advertiser values (also known as machine-learned advice) through various forms to improve total welfare. Yet, such improvements could come at the cos… ▽ More

    Submitted 14 June, 2023; v1 submitted 10 September, 2022; originally announced September 2022.

  28. arXiv:2208.09611  [pdf, other

    cs.LG

    Weighted Maximum Entropy Inverse Reinforcement Learning

    Authors: The Viet Bui, Tien Mai, Patrick Jaillet

    Abstract: We study inverse reinforcement learning (IRL) and imitation learning (IM), the problems of recovering a reward or policy function from expert's demonstrated trajectories. We propose a new way to improve the learning process by adding a weight function to the maximum entropy framework, with the motivation of having the ability to learn and recover the stochasticity (or the bounded rationality) of t… ▽ More

    Submitted 20 August, 2022; originally announced August 2022.

  29. arXiv:2206.06872  [pdf, other

    cs.LG cs.AI

    On Provably Robust Meta-Bayesian Optimization

    Authors: Zhongxiang Dai, Yizhou Chen, Haibin Yu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Bayesian optimization (BO) has become popular for sequential optimization of black-box functions. When BO is used to optimize a target function, we often have access to previous evaluations of potentially related functions. This begs the question as to whether we can leverage these previous experiences to accelerate the current BO task through meta-learning (meta-BO), while ensuring robustness aga… ▽ More

    Submitted 15 June, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Accepted to 38th Conference on Uncertainty in Artificial Intelligence (UAI 2022), Extended version with proofs and additional experimental details and results, 31 pages

  30. arXiv:2205.14309  [pdf, other

    cs.LG cs.AI

    Federated Neural Bandits

    Authors: Zhongxiang Dai, Yao Shu, Arun Verma, Flint Xiaofeng Fan, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Recent works on neural contextual bandits have achieved compelling performances due to their ability to leverage the strong representation power of neural networks (NNs) for reward prediction. Many applications of contextual bandits involve multiple agents who collaborate without sharing raw observations, thus giving rise to the setting of federated contextual bandits. Existing works on federated… ▽ More

    Submitted 28 February, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: ICLR 2023. Code: https://github.com/daizhongxiang/Federated-Neural-Bandits

  31. arXiv:2205.02732  [pdf, other

    cs.MA cs.GT

    Optimal Information Provision for Strategic Hybrid Workers

    Authors: Sohil Shah, Saurabh Amin, Patrick Jaillet

    Abstract: We study the problem of information provision by a strategic central planner who can publicly signal about an uncertain infectious risk parameter. Signalling leads to an updated public belief over the parameter, and agents then make equilibrium choices on whether to work remotely or in-person. The planner maintains a set of desirable outcomes for each realization of the uncertain parameter and see… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

  32. arXiv:2203.05067  [pdf, ps, other

    cs.LG stat.ML

    Universal Regression with Adversarial Responses

    Authors: Moïse Blanchard, Patrick Jaillet

    Abstract: We provide algorithms for regression with adversarial responses under large classes of non-i.i.d. instance sequences, on general separable metric spaces, with provably minimal assumptions. We also give characterizations of learnability in this regression context. We consider universal consistency which asks for strong consistency of a learner without restrictions on the value responses. Our analys… ▽ More

    Submitted 9 June, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

  33. arXiv:2202.13597  [pdf, other

    cs.LG stat.ML

    Rectified Max-Value Entropy Search for Bayesian Optimization

    Authors: Quoc Phong Nguyen, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Although the existing max-value entropy search (MES) is based on the widely celebrated notion of mutual information, its empirical performance can suffer due to two misconceptions whose implications on the exploration-exploitation trade-off are investigated in this paper. These issues are essential in the development of future acquisition functions and the improvement of the existing ones as they… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

  34. arXiv:2112.15364  [pdf, other

    cs.LG cs.AI

    Robust Entropy-regularized Markov Decision Processes

    Authors: Tien Mai, Patrick Jaillet

    Abstract: Stochastic and soft optimal policies resulting from entropy-regularized Markov decision processes (ER-MDP) are desirable for exploration and imitation learning applications. Motivated by the fact that such policies are sensitive with respect to the state transition probabilities, and the estimation of these probabilities may be inaccurate, we study a robust version of the ER-MDP model, where the s… ▽ More

    Submitted 31 December, 2021; originally announced December 2021.

  35. arXiv:2110.14153  [pdf, other

    cs.LG cs.CR

    Differentially Private Federated Bayesian Optimization with Distributed Exploration

    Authors: Zhongxiang Dai, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Bayesian optimization (BO) has recently been extended to the federated learning (FL) setting by the federated Thompson sampling (FTS) algorithm, which has promising applications such as federated hyperparameter tuning. However, FTS is not equipped with a rigorous privacy guarantee which is an important consideration in FL. Recent works have incorporated differential privacy (DP) into the training… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Accepted to 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Extended version with proofs and additional experimental details and results, 29 pages

  36. arXiv:2107.14465  [pdf, other

    cs.LG cs.AI stat.ML

    Trusted-Maximizers Entropy Search for Efficient Bayesian Optimization

    Authors: Quoc Phong Nguyen, Zhaoxuan Wu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Information-based Bayesian optimization (BO) algorithms have achieved state-of-the-art performance in optimizing a black-box objective function. However, they usually require several approximations or simplifying assumptions (without clearly understanding their effects on the BO performance) and/or their generalization to batch BO is computationally unwieldy, especially with an increasing batch si… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    Comments: Published as a conference paper at UAI 2021

  37. arXiv:2107.07725  [pdf, other

    cs.GT

    Learning to Price against a Budget and ROI Constrained Buyer

    Authors: Negin Golrezaei, Patrick Jaillet, Jason Cheuk Nam Liang, Vahab Mirrokni

    Abstract: Internet advertisers (buyers) repeatedly procure ad impressions from ad platforms (sellers) with the aim to maximize total conversion (i.e. ad value) while respecting both budget and return-on-investment (ROI) constraints for efficient utilization of limited monetary resources. Facing such a constrained buyer who aims to learn her optimal strategy to acquire impressions, we study from a seller's p… ▽ More

    Submitted 7 February, 2023; v1 submitted 16 July, 2021; originally announced July 2021.

  38. arXiv:2105.06126  [pdf, other

    cs.LG

    Value-at-Risk Optimization with Gaussian Processes

    Authors: Quoc Phong Nguyen, Zhongxiang Dai, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Value-at-risk (VaR) is an established measure to assess risks in critical real-world applications with random environmental factors. This paper presents a novel VaR upper confidence bound (V-UCB) algorithm for maximizing the VaR of a black-box objective function with the first no-regret guarantee. To realize this, we first derive a confidence bound of VaR and then prove the existence of values of… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

  39. arXiv:2104.08472  [pdf, other

    cs.LG

    Convolutional Normalizing Flows for Deep Gaussian Processes

    Authors: Haibin Yu, Dapeng Liu, Yizhou Chen, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Deep Gaussian processes (DGPs), a hierarchical composition of GP models, have successfully boosted the expressive power of their single-layer counterpart. However, it is impossible to perform exact inference in DGPs, which has motivated the recent development of variational inference-based methods. Unfortunately, either these methods yield a biased posterior belief or it is difficult to evaluate t… ▽ More

    Submitted 26 May, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: To appear in Proceedings of the International Joint Conference on Neural Networks 2021 (IJCNN'21). arXiv admin note: text overlap with arXiv:1910.11998

  40. arXiv:2102.09132  [pdf, other

    cs.GT

    Efficient Carpooling and Toll Pricing for Autonomous Transportation

    Authors: Saurabh Amin, Patrick Jaillet, Manxi Wu

    Abstract: In this paper, we address the existence and computation of competitive equilibrium in the transportation market for autonomous carpooling first proposed by [Ostrovsky and Schwarz, 2019]. At equilibrium, the market organizes carpooled trips over a transportation network in a socially optimal manner and sets the corresponding payments for individual riders and toll prices on edges. The market outcom… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

  41. arXiv:2012.10695  [pdf, other

    cs.LG stat.ML

    An Information-Theoretic Framework for Unifying Active Learning Problems

    Authors: Quoc Phong Nguyen, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: This paper presents an information-theoretic framework for unifying active learning problems: level set estimation (LSE), Bayesian optimization (BO), and their generalized variant. We first introduce a novel active learning criterion that subsumes an existing LSE algorithm and achieves state-of-the-art performance in LSE problems with a continuous input domain. Then, by exploiting the relationship… ▽ More

    Submitted 19 December, 2020; originally announced December 2020.

    Comments: 35th AAAI Conference on Artificial Intelligence (AAAI 2021), Extended version with derivations, 12 pages

  42. arXiv:2012.10688  [pdf, other

    cs.LG stat.ML

    Top-$k$ Ranking Bayesian Optimization

    Authors: Quoc Phong Nguyen, Sebastian Tay, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: This paper presents a novel approach to top-$k$ ranking Bayesian optimization (top-$k$ ranking BO) which is a practical and significant generalization of preferential BO to handle top-$k$ ranking and tie/indifference observations. We first design a surrogate model that is not only capable of catering to the above observations, but is also supported by a classic random utility model. Another equall… ▽ More

    Submitted 19 December, 2020; originally announced December 2020.

    Comments: 35th AAAI Conference on Artificial Intelligence (AAAI 2021), Extended version with derivations, 13 pages

  43. arXiv:2011.03653  [pdf, other

    cs.GT

    No-regret Learning in Price Competitions under Consumer Reference Effects

    Authors: Negin Golrezaei, Patrick Jaillet, Jason Cheuk Nam Liang

    Abstract: We study long-run market stability for repeated price competitions between two firms, where consumer demand depends on firms' posted prices and consumers' price expectations called reference prices. Consumers' reference prices vary over time according to a memory-based dynamic, which is a weighted average of all historical prices. We focus on the setting where firms are not aware of demand functio… ▽ More

    Submitted 21 March, 2022; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: Includes minor correction to statement of Theorem 5.1 and relevant assumptions in NeurIPS camera ready version

  44. arXiv:2010.12883  [pdf, other

    cs.LG stat.ML

    Variational Bayesian Unlearning

    Authors: Quoc Phong Nguyen, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: This paper studies the problem of approximately unlearning a Bayesian model from a small subset of the training data to be erased. We frame this problem as one of minimizing the Kullback-Leibler divergence between the approximate posterior belief of model parameters after directly unlearning from erased data vs. the exact posterior belief from retraining with remaining data. Using the variational… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

    Comments: 34th Annual Conference on Neural Information Processing Systems (NeurIPS 2020), Extended version with proofs, 22 pages

  45. arXiv:2010.10154  [pdf, other

    cs.LG cs.AI

    Federated Bayesian Optimization via Thompson Sampling

    Authors: Zhongxiang Dai, Kian Hsiang Low, Patrick Jaillet

    Abstract: Bayesian optimization (BO) is a prominent approach to optimizing expensive-to-evaluate black-box functions. The massive computational capability of edge devices such as mobile phones, coupled with privacy concerns, has led to a surging interest in federated learning (FL) which focuses on collaborative training of deep neural networks (DNNs) via first-order optimization techniques. However, some co… ▽ More

    Submitted 22 October, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: Accepted to 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Extended version with proofs and additional experimental details and results, 25 pages

  46. arXiv:2009.07925  [pdf, other

    cs.AI cs.DS

    Competitive Ratios for Online Multi-capacity Ridesharing

    Authors: Meghna Lowalekar, Pradeep Varakantham, Patrick Jaillet

    Abstract: In multi-capacity ridesharing, multiple requests (e.g., customers, food items, parcels) with different origin and destination pairs travel in one resource. In recent years, online multi-capacity ridesharing services (i.e., where assignments are made online) like Uber-pool, foodpanda, and on-demand shuttles have become hugely popular in transportation, food delivery, logistics and other domains. Th… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: 28 pages, 4 Figures

  47. arXiv:2009.06051  [pdf, other

    cs.AI

    Zone pAth Construction (ZAC) based Approaches for Effective Real-Time Ridesharing

    Authors: Meghna Lowalekar, Pradeep Varakantham, Patrick Jaillet

    Abstract: Real-time ridesharing systems such as UberPool, Lyft Line, GrabShare have become hugely popular as they reduce the costs for customers, improve per trip revenue for drivers and reduce traffic on the roads by grouping customers with similar itineraries. The key challenge in these systems is to group the "right" requests to travel together in the "right" available vehicles in real-time, so that the… ▽ More

    Submitted 13 September, 2020; originally announced September 2020.

    Comments: 48 pages, 22 figures

  48. arXiv:2008.08048  [pdf, other

    stat.ME cs.LG econ.EM

    Learning Structure in Nested Logit Models

    Authors: Youssef M. Aboutaleb, Moshe Ben-Akiva, Patrick Jaillet

    Abstract: This paper introduces a new data-driven methodology for nested logit structure discovery. Nested logit models allow the modeling of positive correlations between the error terms of the utility specifications of the different alternatives in a discrete choice scenario through the specification of a nesting structure. Current nested logit model estimation practices require an a priori specification… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

  49. arXiv:2008.07820  [pdf, other

    math.OC cs.LG econ.EM

    A Relation Analysis of Markov Decision Process Frameworks

    Authors: Tien Mai, Patrick Jaillet

    Abstract: We study the relation between different Markov Decision Process (MDP) frameworks in the machine learning and econometrics literatures, including the standard MDP, the entropy and general regularized MDP, and stochastic MDP, where the latter is based on the assumption that the reward function is stochastic and follows a given distribution. We show that the entropy-regularized MDP is equivalent to a… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

  50. arXiv:2006.16679  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    R2-B2: Recursive Reasoning-Based Bayesian Optimization for No-Regret Learning in Games

    Authors: Zhongxiang Dai, Yizhou Chen, Kian Hsiang Low, Patrick Jaillet, Teck-Hua Ho

    Abstract: This paper presents a recursive reasoning formalism of Bayesian optimization (BO) to model the reasoning process in the interactions between boundedly rational, self-interested agents with unknown, complex, and costly-to-evaluate payoff functions in repeated games, which we call Recursive Reasoning-Based BO (R2-B2). Our R2-B2 algorithm is general in that it does not constrain the relationship amon… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

    Comments: Accepted to 37th International Conference on Machine Learning (ICML 2020), Extended version with proofs and additional experimental details and results, 27 pages