Zum Hauptinhalt springen

Showing 1–33 of 33 results for author: Frazier, P I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07866  [pdf, other

    cs.LG math.OC

    Asymptotically Optimal Regret for Black-Box Predict-then-Optimize

    Authors: Samuel Tan, Peter I. Frazier

    Abstract: We consider the predict-then-optimize paradigm for decision-making in which a practitioner (1) trains a supervised learning model on historical data of decisions, contexts, and rewards, and then (2) uses the resulting model to make future binary decisions for new contexts by finding the decision that maximizes the model's predicted reward. This approach is common in industry. Past analysis assumes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 15 pages, 2 figures, 3 tables

  2. arXiv:2311.02146  [pdf, other

    stat.ML cs.LG math.OC

    Bayesian Optimization of Function Networks with Partial Evaluations

    Authors: Poompol Buathong, Jiayue Wan, Raul Astudillo, Samuel Daulton, Maximilian Balandat, Peter I. Frazier

    Abstract: Bayesian optimization is a powerful framework for optimizing functions that are expensive or time-consuming to evaluate. Recent work has considered Bayesian optimization of function networks (BOFN), where the objective function is given by a network of functions, each taking as input the output of previous nodes in the network as well as additional parameters. Leveraging this network structure has… ▽ More

    Submitted 12 June, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: 34 pages, 15 figures, 3 tables

  3. arXiv:2303.15746  [pdf, other

    cs.LG stat.ML

    qEUBO: A Decision-Theoretic Acquisition Function for Preferential Bayesian Optimization

    Authors: Raul Astudillo, Zhiyuan Jerry Lin, Eytan Bakshy, Peter I. Frazier

    Abstract: Preferential Bayesian optimization (PBO) is a framework for optimizing a decision maker's latent utility function using preference feedback. This work introduces the expected utility of the best option (qEUBO) as a novel acquisition function for PBO. When the decision maker's responses are noise-free, we show that qEUBO is one-step Bayes optimal and thus equivalent to the popular knowledge gradien… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: In Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS) 2023

  4. arXiv:2301.12366  [pdf, other

    cs.LG cs.AI math.OC math.ST

    Smooth Non-Stationary Bandits

    Authors: Su Jia, Qian Xie, Nathan Kallus, Peter I. Frazier

    Abstract: In many applications of online decision making, the environment is non-stationary and it is therefore crucial to use bandit algorithms that handle changes. Most existing approaches are designed to protect against non-smooth changes, constrained only by total variation or Lipschitzness over time, where they guarantee $\tilde Θ(T^{2/3})$ regret. However, in practice environments are often changing {… ▽ More

    Submitted 7 June, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: Accepted by ICML 2023

  5. arXiv:2205.09679  [pdf, ps, other

    math.OC cs.GT

    Dynamic Pricing Provides Robust Equilibria in Stochastic Ridesharing Networks

    Authors: J. Massey Cashore, Peter I. Frazier, Eva Tardos

    Abstract: Ridesharing markets are complex: drivers are strategic, rider demand and driver availability are stochastic, and complex city-scale phenomena like weather induce large scale correlation across space and time. At the same time, past work has focused on a subset of these challenges. We propose a model of ridesharing networks with strategic drivers, spatiotemporal dynamics, and stochasticity. Support… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  6. arXiv:2203.15853  [pdf, other

    cs.LG math.OC

    Near-optimality for infinite-horizon restless bandits with many arms

    Authors: Xiangyu Zhang, Peter I. Frazier

    Abstract: Restless bandits are an important class of problems with applications in recommender systems, active learning, revenue management and other areas. We consider infinite-horizon discounted restless bandits with many arms where a fixed proportion of arms may be pulled in each period and where arms share a finite state space. Although an average-case-optimal policy can be computed via stochastic dynam… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  7. arXiv:2203.11382  [pdf, other

    cs.LG math.OC stat.ML

    Preference Exploration for Efficient Bayesian Optimization with Multiple Outcomes

    Authors: Zhiyuan Jerry Lin, Raul Astudillo, Peter I. Frazier, Eytan Bakshy

    Abstract: We consider Bayesian optimization of expensive-to-evaluate experiments that generate vector-valued outcomes over which a decision-maker (DM) has preferences. These preferences are encoded by a utility function that is not known in closed form but can be estimated by asking the DM to express preferences over pairs of outcome vectors. To address this problem, we develop Bayesian optimization with pr… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Journal ref: AISTATS 2022

  8. arXiv:2201.00272  [pdf, other

    cs.LG math.OC stat.ML

    Thinking inside the box: A tutorial on grey-box Bayesian optimization

    Authors: Raul Astudillo, Peter I. Frazier

    Abstract: Bayesian optimization (BO) is a framework for global optimization of expensive-to-evaluate objective functions. Classical BO methods assume that the objective function is a black box. However, internal information about objective function computation is often available. For example, when optimizing a manufacturing line's throughput with simulation, we observe the number of parts waiting at each wo… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

    Comments: Published as an advanced tutorial in the proceedings of the 2021 Winter Simulation Conference

  9. arXiv:2112.15311  [pdf, other

    cs.LG math.OC stat.ML

    Bayesian Optimization of Function Networks

    Authors: Raul Astudillo, Peter I. Frazier

    Abstract: We consider Bayesian optimization of the output of a network of functions, where each function takes as input the output of its parent nodes, and where the network takes significant time to evaluate. Such problems arise, for example, in reinforcement learning, engineering design, and manufacturing. While the standard Bayesian optimization approach observes only the final output, our approach deliv… ▽ More

    Submitted 31 December, 2021; originally announced December 2021.

    Comments: In Advances in Neural Information Processing Systems, 2021

  10. arXiv:2112.02833  [pdf, other

    stat.ML cs.AI cs.LG

    Two-step Lookahead Bayesian Optimization with Inequality Constraints

    Authors: Yunxiang Zhang, Xiangyu Zhang, Peter I. Frazier

    Abstract: Recent advances in computationally efficient non-myopic Bayesian optimization (BO) improve query efficiency over traditional myopic methods like expected improvement while only modestly increasing computational cost. These advances have been largely limited, however, to unconstrained optimization. For constrained optimization, the few existing non-myopic BO methods require heavy computation. For i… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

  11. arXiv:2111.06537  [pdf, other

    cs.LG math.OC stat.ML

    Multi-Step Budgeted Bayesian Optimization with Unknown Evaluation Costs

    Authors: Raul Astudillo, Daniel R. Jiang, Maximilian Balandat, Eytan Bakshy, Peter I. Frazier

    Abstract: Bayesian optimization (BO) is a sample-efficient approach to optimizing costly-to-evaluate black-box functions. Most BO methods ignore how evaluation costs may vary over the optimization domain. However, these costs can be highly heterogeneous and are often unknown in advance. This occurs in many practical settings, such as hyperparameter tuning of machine learning algorithms or physics-based simu… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: In Advances in Neural Information Processing Systems, 2021

  12. arXiv:2107.11911  [pdf, other

    math.OC cs.LG math.PR

    Restless Bandits with Many Arms: Beating the Central Limit Theorem

    Authors: Xiangyu Zhang, Peter I. Frazier

    Abstract: We consider finite-horizon restless bandits with multiple pulls per period, which play an important role in recommender systems, active learning, revenue management, and many other areas. While an optimal policy can be computed, in principle, using dynamic programming, the computation required scales exponentially in the number of arms $N$. Thus, there is substantial value in understanding the per… ▽ More

    Submitted 25 July, 2021; originally announced July 2021.

  13. arXiv:1911.05934  [pdf, ps, other

    stat.ML cs.LG math.OC

    Multi-Attribute Bayesian Optimization With Interactive Preference Learning

    Authors: Raul Astudillo, Peter I. Frazier

    Abstract: We consider black-box global optimization of time-consuming-to-evaluate functions on behalf of a decision-maker (DM) whose preferences must be learned. Each feasible design is associated with a time-consuming-to-evaluate vector of attributes and each vector of attributes is assigned a utility by the DM's utility function, which may be learned approximately using preferences expressed over pairs of… ▽ More

    Submitted 3 March, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

    Comments: In Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

  14. arXiv:1906.01537  [pdf, ps, other

    stat.ML cs.LG math.OC

    Bayesian Optimization of Composite Functions

    Authors: Raul Astudillo, Peter I. Frazier

    Abstract: We consider optimization of composite objective functions, i.e., of the form $f(x)=g(h(x))$, where $h$ is a black-box derivative-free expensive-to-evaluate function with vector-valued outputs, and $g$ is a cheap-to-evaluate real-valued function. While these problems can be solved with standard Bayesian optimization, we propose a novel approach that exploits the composite structure of the objective… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: In Proceedings of the 36th International Conference on Machine Learning, PMLR 97:354-363, 2019

    Journal ref: In Proceedings of the 36th International Conference on Machine Learning, PMLR 97:354-363, 2019

  15. arXiv:1903.04703  [pdf, other

    cs.LG math.OC stat.ME stat.ML

    Practical Multi-fidelity Bayesian Optimization for Hyperparameter Tuning

    Authors: Jian Wu, Saul Toscano-Palmerin, Peter I. Frazier, Andrew Gordon Wilson

    Abstract: Bayesian optimization is popular for optimizing time-consuming black-box objectives. Nonetheless, for hyperparameter tuning in deep neural networks, the time required to evaluate the validation error for even a few hyperparameter settings remains a bottleneck. Multi-fidelity optimization promises relief using cheaper proxies to such objectives --- for example, validation error for a network traine… ▽ More

    Submitted 11 March, 2019; originally announced March 2019.

  16. arXiv:1807.02811  [pdf, other

    stat.ML cs.LG math.OC

    A Tutorial on Bayesian Optimization

    Authors: Peter I. Frazier

    Abstract: Bayesian optimization is an approach to optimizing objective functions that take a long time (minutes or hours) to evaluate. It is best-suited for optimization over continuous domains of less than 20 dimensions, and tolerates stochastic noise in function evaluations. It builds a surrogate for the objective and quantifies the uncertainty in that surrogate using a Bayesian machine learning technique… ▽ More

    Submitted 8 July, 2018; originally announced July 2018.

  17. arXiv:1803.08661  [pdf, other

    cs.LG stat.ML

    Bayesian Optimization with Expensive Integrands

    Authors: Saul Toscano-Palmerin, Peter I. Frazier

    Abstract: We propose a Bayesian optimization algorithm for objective functions that are sums or integrals of expensive-to-evaluate functions, allowing noisy evaluations. These objective functions arise in multi-task Bayesian optimization for tuning machine learning hyperparameters, optimization via simulation, and sequential design of experiments with random environmental conditions. Our method is average-c… ▽ More

    Submitted 23 March, 2018; originally announced March 2018.

  18. arXiv:1707.06541   

    stat.ML cs.AI cs.LG math.OC math.PR

    Discretization-free Knowledge Gradient Methods for Bayesian Optimization

    Authors: Jian Wu, Peter I. Frazier

    Abstract: This paper studies Bayesian ranking and selection (R&S) problems with correlated prior beliefs and continuous domains, i.e. Bayesian optimization (BO). Knowledge gradient methods [Frazier et al., 2008, 2009] have been widely studied for discrete R&S problems, which sample the one-step Bayes-optimal point. When used over continuous domains, previous work on the knowledge gradient [Scott et al., 201… ▽ More

    Submitted 26 July, 2017; v1 submitted 20 July, 2017; originally announced July 2017.

    Comments: This paper, which combines and extends two conference papers (arXiv:1703.04389, arXiv:1606.04414), has been withdrawn by the authors because it was submitted prematurely before proper attribution could be provided

  19. arXiv:1706.04304  [pdf, other

    cs.LG

    Dueling Bandits With Weak Regret

    Authors: Bangrui Chen, Peter I. Frazier

    Abstract: We consider online content recommendation with implicit feedback through pairwise comparisons, formalized as the so-called dueling bandit problem. We study the dueling bandit problem in the Condorcet winner setting, and consider two notions of regret: the more well-studied strong regret, which is 0 only when both arms pulled are the Condorcet winner; and the less well-studied weak regret, which is… ▽ More

    Submitted 13 June, 2017; originally announced June 2017.

  20. arXiv:1703.04389  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    Bayesian Optimization with Gradients

    Authors: Jian Wu, Matthias Poloczek, Andrew Gordon Wilson, Peter I. Frazier

    Abstract: Bayesian optimization has been successful at global optimization of expensive-to-evaluate multimodal objective functions. However, unlike most optimization methods, Bayesian optimization typically does not use derivative information. In this paper we show how Bayesian optimization can exploit derivative information to decrease the number of objective function evaluations required for good performa… ▽ More

    Submitted 6 February, 2018; v1 submitted 13 March, 2017; originally announced March 2017.

    Comments: Advances in Neural Information Processing Systems 30 (NIPS), 2017

    Journal ref: Advances in Neural Information Processing Systems 30 (NIPS), 2017

  21. arXiv:1702.07694  [pdf, other

    stat.ML cs.IT cs.LG

    Bayes-Optimal Entropy Pursuit for Active Choice-Based Preference Learning

    Authors: Stephen N. Pallone, Peter I. Frazier, Shane G. Henderson

    Abstract: We analyze the problem of learning a single user's preferences in an active learning setting, sequentially and adaptively querying the user over a finite time horizon. Learning is conducted via choice-based queries, where the user selects her preferred option among a small subset of offered alternatives. These queries have been shown to be a robust and efficient way to learn an individual's prefer… ▽ More

    Submitted 24 February, 2017; originally announced February 2017.

  22. arXiv:1608.03585  [pdf, other

    stat.ML cs.LG stat.AP

    Warm Starting Bayesian Optimization

    Authors: Matthias Poloczek, Jialei Wang, Peter I. Frazier

    Abstract: We develop a framework for warm-starting Bayesian optimization, that reduces the solution time required to solve an optimization problem that is one in a sequence of related problems. This is useful when optimizing the output of a stochastic simulator that fails to provide derivative information, for which Bayesian optimization methods are well-suited. Solving sequences of related optimization pro… ▽ More

    Submitted 11 August, 2016; originally announced August 2016.

    Comments: To Appear in the Proc. of the 2016 Winter Simulation Conference

  23. arXiv:1607.03195  [pdf, other

    math.OC cs.LG stat.CO

    Multi-Step Bayesian Optimization for One-Dimensional Feasibility Determination

    Authors: J. Massey Cashore, Lemuel Kumarga, Peter I. Frazier

    Abstract: Bayesian optimization methods allocate limited sampling budgets to maximize expensive-to-evaluate functions. One-step-lookahead policies are often used, but computing optimal multi-step-lookahead policies remains a challenge. We consider a specialized Bayesian optimization problem: finding the superlevel set of an expensive one-dimensional function, with a Markov process prior. We compute the Baye… ▽ More

    Submitted 11 July, 2016; originally announced July 2016.

  24. arXiv:1606.04414  [pdf, other

    stat.ML cs.AI cs.LG

    The Parallel Knowledge Gradient Method for Batch Bayesian Optimization

    Authors: Jian Wu, Peter I. Frazier

    Abstract: In many applications of black-box optimization, one can evaluate multiple points simultaneously, e.g. when evaluating the performances of several different neural network architectures in a parallel computing environment. In this paper, we develop a novel batch Bayesian optimization algorithm --- the parallel knowledge gradient method. By construction, this method provides the one-step Bayes-optim… ▽ More

    Submitted 22 April, 2018; v1 submitted 14 June, 2016; originally announced June 2016.

    Comments: Minor edits and typo fixes. Please cite "J. Wu and P. Frazier. The parallel knowledge gradient method for batch bayesian optimization. In Advances In Neural Information Processing Systems, pp. 3126-3134. 2016"

  25. arXiv:1605.09088  [pdf, other

    cs.LG

    The Bayesian Linear Information Filtering Problem

    Authors: Bangrui Chen, Peter I. Frazier

    Abstract: We present a Bayesian sequential decision-making formulation of the information filtering problem, in which an algorithm presents items (news articles, scientific papers, tweets) arriving in a stream, and learns relevance from user feedback on presented items. We model user preferences using a Bayesian linear model, similar in spirit to a Bayesian linear bandit. We compute a computational upper bo… ▽ More

    Submitted 22 October, 2016; v1 submitted 29 May, 2016; originally announced May 2016.

  26. arXiv:1605.08838  [pdf, other

    cs.LG

    Dueling Bandits with Dependent Arms

    Authors: Bangrui Chen, Peter I. Frazier

    Abstract: We study dueling bandits with weak utility-based regret when preferences over arms have a total order and carry observable feature vectors. The order is assumed to be determined by these feature vectors, an unknown preference vector, and a known utility function. This structure introduces dependence between preferences for pairs of arms, and allows learning about the preference over one pair of ar… ▽ More

    Submitted 14 June, 2017; v1 submitted 27 May, 2016; originally announced May 2016.

  27. arXiv:1602.02338  [pdf, other

    cs.LG math.OC stat.ML

    Stratified Bayesian Optimization

    Authors: Saul Toscano-Palmerin, Peter I. Frazier

    Abstract: We consider derivative-free black-box global optimization of expensive noisy functions, when most of the randomness in the objective is produced by a few influential scalar random inputs. We present a new Bayesian global optimization algorithm, called Stratified Bayesian Optimization (SBO), which uses this strong dependence to improve performance. Our algorithm is similar in spirit to stratificati… ▽ More

    Submitted 20 February, 2016; v1 submitted 6 February, 2016; originally announced February 2016.

  28. arXiv:1512.09204  [pdf, other

    cs.LG cs.AI stat.ML

    Bayes-Optimal Effort Allocation in Crowdsourcing: Bounds and Index Policies

    Authors: Weici Hu, Peter I. Frazier

    Abstract: We consider effort allocation in crowdsourcing, where we wish to assign labeling tasks to imperfect homogeneous crowd workers to maximize overall accuracy in a continuous-time Bayesian setting, subject to budget and time constraints. The Bayes-optimal policy for this problem is the solution to a partially observable Markov decision process, but the curse of dimensionality renders the computation i… ▽ More

    Submitted 30 December, 2015; originally announced December 2015.

  29. arXiv:1505.06538  [pdf, other

    stat.ML cs.LG cs.SI

    Clustering via Content-Augmented Stochastic Blockmodels

    Authors: J. Massey Cashore, Xiaoting Zhao, Alexander A. Alemi, Yujia Liu, Peter I. Frazier

    Abstract: Much of the data being created on the web contains interactions between users and items. Stochastic blockmodels, and other methods for community detection and clustering of bipartite graphs, can infer latent user communities and latent item clusters from this interaction data. These methods, however, typically ignore the items' contents and the information they provide about item clusters, despite… ▽ More

    Submitted 25 May, 2015; originally announced May 2015.

  30. arXiv:1410.7852  [pdf, other

    cs.LG cs.IR math.OC

    A Markov Decision Process Analysis of the Cold Start Problem in Bayesian Information Filtering

    Authors: Xiaoting Zhao, Peter I. Frazier

    Abstract: We consider the information filtering problem, in which we face a stream of items, and must decide which ones to forward to a user to maximize the number of relevant items shown, minus a penalty for each irrelevant item shown. Forwarding decisions are made separately in a personalized way for each user. We focus on the cold-start setting for this problem, in which we have limited historical data o… ▽ More

    Submitted 28 October, 2014; originally announced October 2014.

    Comments: 12 pages, 9 figures

  31. arXiv:1407.8186  [pdf, other

    math.OC cs.IR

    Exploration vs. Exploitation in the Information Filtering Problem

    Authors: Xiaoting Zhao, Peter I. Frazier

    Abstract: We consider information filtering, in which we face a stream of items too voluminous to process by hand (e.g., scientific articles, blog posts, emails), and must rely on a computer system to automatically filter out irrelevant items. Such systems face the exploration vs. exploitation tradeoff, in which it may be beneficial to present an item despite a low probability of relevance, just to learn ab… ▽ More

    Submitted 8 February, 2015; v1 submitted 30 July, 2014; originally announced July 2014.

    Comments: 36 pages, 5 figures

  32. arXiv:1407.4446  [pdf, other

    cs.IT cs.LG math.OC math.ST stat.ML

    Probabilistic Group Testing under Sum Observations: A Parallelizable 2-Approximation for Entropy Loss

    Authors: Weidong Han, Purnima Rajan, Peter I. Frazier, Bruno M. Jedynak

    Abstract: We consider the problem of group testing with sum observations and noiseless answers, in which we aim to locate multiple objects by querying the number of objects in each of a sequence of chosen sets. We study a probabilistic setting with entropy loss, in which we assume a joint Bayesian prior density on the locations of the objects and seek to choose the sets queried to minimize the expected entr… ▽ More

    Submitted 22 September, 2015; v1 submitted 16 July, 2014; originally announced July 2014.

  33. arXiv:1407.2676  [pdf, other

    math.OC cs.AI cs.LG eess.SY stat.ML

    A New Optimal Stepsize For Approximate Dynamic Programming

    Authors: Ilya O. Ryzhov, Peter I. Frazier, Warren B. Powell

    Abstract: Approximate dynamic programming (ADP) has proven itself in a wide range of applications spanning large-scale transportation problems, health care, revenue management, and energy systems. The design of effective ADP algorithms has many dimensions, but one crucial factor is the stepsize rule used to update a value function approximation. Many operations research applications are computationally inte… ▽ More

    Submitted 13 July, 2014; v1 submitted 9 July, 2014; originally announced July 2014.

    Comments: Matlab files are included with the paper source