Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: Shilton, A

.
  1. arXiv:2405.15254  [pdf, other

    stat.ML cs.AI cs.LG

    Novel Kernel Models and Exact Representor Theory for Neural Networks Beyond the Over-Parameterized Regime

    Authors: Alistair Shilton, Sunil Gupta, Santu Rana, Svetha Venkatesh

    Abstract: This paper presents two models of neural-networks and their training applicable to neural networks of arbitrary width, depth and topology, assuming only finite-energy neural activations; and a novel representor theory for neural networks in terms of a matrix-valued kernel. The first model is exact (un-approximated) and global, casting the neural network as an elements in a reproducing kernel Banac… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2402.17343  [pdf, other

    cs.LG stat.ML

    Enhanced Bayesian Optimization via Preferential Modeling of Abstract Properties

    Authors: Arun Kumar A V, Alistair Shilton, Sunil Gupta, Santu Rana, Stewart Greenhill, Svetha Venkatesh

    Abstract: Experimental (design) optimization is a key driver in designing and discovering new products and processes. Bayesian Optimization (BO) is an effective tool for optimizing expensive and black-box experimental design processes. While Bayesian optimization is a principled data-driven approach to experimental optimization, it learns everything from scratch and could greatly benefit from the expertise… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 19 Pages, 6 Figures

  3. arXiv:2402.03243  [pdf, other

    cs.LG

    PINN-BO: A Black-box Optimization Algorithm using Physics-Informed Neural Networks

    Authors: Dat Phan-Trong, Hung The Tran, Alistair Shilton, Sunil Gupta

    Abstract: Black-box optimization is a powerful approach for discovering global optima in noisy and expensive black-box functions, a problem widely encountered in real-world scenarios. Recently, there has been a growing interest in leveraging domain knowledge to enhance the efficacy of machine learning methods. Partial Differential Equations (PDEs) often provide an effective means for elucidating the fundame… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  4. arXiv:2303.01684  [pdf, other

    cs.LG cs.AI

    BO-Muse: A human expert and AI teaming framework for accelerated experimental design

    Authors: Sunil Gupta, Alistair Shilton, Arun Kumar A V, Shannon Ryan, Majid Abdolshah, Hung Le, Santu Rana, Julian Berk, Mahad Rashid, Svetha Venkatesh

    Abstract: In this paper we introduce BO-Muse, a new approach to human-AI teaming for the optimization of expensive black-box functions. Inspired by the intrinsic difficulty of extracting expert knowledge and distilling it back into AI models and by observations of human behavior in real-world experimental design, our algorithm lets the human expert take the lead in the experimental process. The human expert… ▽ More

    Submitted 30 March, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: 34 Pages, 7 Figures and 5 Tables

  5. arXiv:2302.00205  [pdf, other

    stat.ML cs.LG

    Gradient Descent in Neural Networks as Sequential Learning in RKBS

    Authors: Alistair Shilton, Sunil Gupta, Santu Rana, Svetha Venkatesh

    Abstract: The study of Neural Tangent Kernels (NTKs) has provided much needed insight into convergence and generalization properties of neural networks in the over-parametrized (wide) limit by approximating the network using a first-order Taylor expansion with respect to its weights in the neighborhood of their initialization values. This allows neural network training to be analyzed from the perspective of… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

  6. arXiv:2009.03543  [pdf, other

    cs.LG stat.ML

    Sequential Subspace Search for Functional Bayesian Optimization Incorporating Experimenter Intuition

    Authors: Alistair Shilton, Sunil Gupta, Santu Rana, Svetha Venkatesh

    Abstract: We propose an algorithm for Bayesian functional optimisation - that is, finding the function to optimise a process - guided by experimenter beliefs and intuitions regarding the expected characteristics (length-scale, smoothness, cyclicity etc.) of the optimal solution encoded into the covariance function of a Gaussian Process. Our algorithm generates a sequence of finite-dimensional random subspac… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

  7. arXiv:2007.07459  [pdf, other

    stat.ML cs.LG

    From deep to Shallow: Equivalent Forms of Deep Networks in Reproducing Kernel Krein Space and Indefinite Support Vector Machines

    Authors: Alistair Shilton, Sunil Gupta, Santu Rana, Svetha Venkatesh

    Abstract: In this paper we explore a connection between deep networks and learning in reproducing kernel Krein space. Our approach is based on the concept of push-forward - that is, taking a fixed non-linear transform on a linear projection and converting it to a linear projection on the output of a fixed non-linear transform, pushing the weights forward through the non-linearity. Applying this repeatedly f… ▽ More

    Submitted 8 September, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

  8. arXiv:1911.12473  [pdf, other

    cs.LG cs.AI stat.ML

    Bayesian Optimization for Categorical and Category-Specific Continuous Inputs

    Authors: Dang Nguyen, Sunil Gupta, Santu Rana, Alistair Shilton, Svetha Venkatesh

    Abstract: Many real-world functions are defined over both categorical and category-specific continuous variables and thus cannot be optimized by traditional Bayesian optimization (BO) methods. To optimize such functions, we propose a new method that formulates the problem as a multi-armed bandit problem, wherein each category corresponds to an arm with its reward distribution centered around the optimum of… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

    Comments: To appear at AAAI 2020

  9. arXiv:1909.03600  [pdf, other

    cs.LG math.OC stat.ML

    Cost-aware Multi-objective Bayesian optimisation

    Authors: Majid Abdolshah, Alistair Shilton, Santu Rana, Sunil Gupta, Svetha Venkatesh

    Abstract: The notion of expense in Bayesian optimisation generally refers to the uniformly expensive cost of function evaluations over the whole search space. However, in some scenarios, the cost of evaluation for black-box objective functions is non-uniform since different inputs from search space may incur different costs for function evaluations. We introduce a cost-aware multi-objective Bayesian optimis… ▽ More

    Submitted 8 September, 2019; originally announced September 2019.

  10. arXiv:1902.07846  [pdf, other

    stat.ML cs.LG

    Stable Bayesian Optimisation via Direct Stability Quantification

    Authors: Alistair Shilton, Sunil Gupta, Santu Rana, Svetha Venkatesh, Majid Abdolshah, Dang Nguyen

    Abstract: In this paper we consider the problem of finding stable maxima of expensive (to evaluate) functions. We are motivated by the optimisation of physical and industrial processes where, for some input ranges, small and unavoidable variations in inputs lead to unacceptably large variation in outputs. Our approach uses multiple gradient Gaussian Process models to estimate the probability that worst-case… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

  11. arXiv:1902.04228  [pdf, other

    cs.LG cs.AI stat.ML

    Multi-objective Bayesian optimisation with preferences over objectives

    Authors: Majid Abdolshah, Alistair Shilton, Santu Rana, Sunil Gupta, Svetha Venkatesh

    Abstract: We present a multi-objective Bayesian optimisation algorithm that allows the user to express preference-order constraints on the objectives of the type "objective A is more important than objective B". These preferences are defined based on the stability of the obtained solutions with respect to preferred objective functions. Rather than attempting to find a representative subset of the complete P… ▽ More

    Submitted 12 November, 2019; v1 submitted 11 February, 2019; originally announced February 2019.

  12. arXiv:1805.07852  [pdf, other

    stat.ML cs.LG

    Accelerated Bayesian Optimization throughWeight-Prior Tuning

    Authors: Alistair Shilton, Sunil Gupta, Santu Rana, Pratibha Vellanki, Laurence Park, Cheng Li, Svetha Venkatesh, Alessandra Sutti, David Rubin, Thomas Dorin, Alireza Vahid, Murray Height, Teo Slezak

    Abstract: Bayesian optimization (BO) is a widely-used method for optimizing expensive (to evaluate) problems. At the core of most BO methods is the modeling of the objective function using a Gaussian Process (GP) whose covariance is selected from a set of standard covariance functions. From a weight-space view, this models the objective as a linear function in a feature space implied by the given covariance… ▽ More

    Submitted 6 February, 2020; v1 submitted 20 May, 2018; originally announced May 2018.

    Journal ref: PMLR 108:635-645, 2020

  13. arXiv:1802.05400  [pdf

    stat.ML

    High Dimensional Bayesian Optimization Using Dropout

    Authors: Cheng Li, Sunil Gupta, Santu Rana, Vu Nguyen, Svetha Venkatesh, Alistair Shilton

    Abstract: Scaling Bayesian optimization to high dimensions is challenging task as the global optimization of high-dimensional acquisition function can be expensive and often infeasible. Existing methods depend either on limited active variables or the additive form of the objective function. We propose a new method for high-dimensional Bayesian optimization, that uses a dropout strategy to optimize only a s… ▽ More

    Submitted 14 February, 2018; originally announced February 2018.

    Comments: 7 pages; Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence 2017

  14. arXiv:1802.05370  [pdf, other

    stat.ML

    Covariance Function Pre-Training with m-Kernels for Accelerated Bayesian Optimisation

    Authors: Alistair Shilton, Sunil Gupta, Santu Rana, Pratibha Vellanki, Cheng Li, Laurence Park, Svetha Venkatesh, Alessandra Sutti, David Rubin, Thomas Dorin, Alireza Vahid, Murray Height

    Abstract: The paper presents a novel approach to direct covariance function learning for Bayesian optimisation, with particular emphasis on experimental design problems where an existing corpus of condensed knowledge is present. The method presented borrows techniques from reproducing kernel Banach space theory (specifically m-kernels) and leverages them to convert (or re-weight) existing covariance functio… ▽ More

    Submitted 12 March, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

  15. arXiv:1106.4613  [pdf, ps, other

    hep-ph physics.data-an

    Fast supersymmetry phenomenology at the Large Hadron Collider using machine learning techniques

    Authors: A. Buckley, A. Shilton, M. J. White

    Abstract: A pressing problem for supersymmetry (SUSY) phenomenologists is how to incorporate Large Hadron Collider search results into parameter fits designed to measure or constrain the SUSY parameters. Owing to the computational expense of fully simulating lots of points in a generic SUSY space to aid the calculation of the likelihoods, the limits published by experimental collaborations are frequently in… ▽ More

    Submitted 7 July, 2011; v1 submitted 22 June, 2011; originally announced June 2011.

    Comments: 20 pages, 7 figures, replaced to correct author contact details