Zum Hauptinhalt springen

Showing 1–50 of 75 results for author: Osborne, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10971  [pdf, other

    cs.LG

    Walking the Values in Bayesian Inverse Reinforcement Learning

    Authors: Ondrej Bajgar, Alessandro Abate, Konstantinos Gatsis, Michael A. Osborne

    Abstract: The goal of Bayesian inverse reinforcement learning (IRL) is recovering a posterior distribution over reward functions using a set of demonstrations from an expert optimizing for a reward unknown to the learner. The resulting posterior over rewards can then be used to synthesize an apprentice policy that performs well on the same or a similar task. A key challenge in Bayesian IRL is bridging the c… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Published at the 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)

  2. arXiv:2404.12219  [pdf, other

    cs.LG math.NA stat.ML

    A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Saad Hamid, Harald Oberhauser, Michael A. Osborne

    Abstract: Parallelisation in Bayesian optimisation is a common strategy but faces several challenges: the need for flexibility in acquisition functions and kernel choices, flexibility dealing with discrete and continuous variables simultaneously, model misspecification, and lastly fast massive parallelisation. To address these challenges, we introduce a versatile and modular framework for batch Bayesian opt… ▽ More

    Submitted 19 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: This work is the journal extension of the workshop paper (arXiv:2301.11832) and AISTATS paper (arXiv:2306.05843). 48 pages, 11 figures

    MSC Class: 62C10; 62F15

  3. arXiv:2403.08501  [pdf, other

    cs.CY

    Governing Through the Cloud: The Intermediary Role of Compute Providers in AI Regulation

    Authors: Lennart Heim, Tim Fist, Janet Egan, Sihao Huang, Stephen Zekany, Robert Trager, Michael A Osborne, Noa Zilberman

    Abstract: As jurisdictions around the world take their first steps toward regulating the most powerful AI systems, such as the EU AI Act and the US Executive Order 14110, there is a growing need for effective enforcement mechanisms that can verify compliance and respond to violations. We argue that compute providers should have legal obligations and ethical responsibilities associated with AI development an… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: v2: Fixing affiliations, formatting errors, and vector graphics

  4. arXiv:2402.01632  [pdf, other

    cs.LG stat.ML

    Beyond Lengthscales: No-regret Bayesian Optimisation With Unknown Hyperparameters Of Any Type

    Authors: Juliusz Ziomek, Masaki Adachi, Michael A. Osborne

    Abstract: Bayesian optimisation requires fitting a Gaussian process model, which in turn requires specifying hyperparameters - most of the theoretical literature assumes those hyperparameters are known. The commonly used maximum likelihood estimator for hyperparameters of the Gaussian process is consistent only if the data fills the space uniformly, which does not have to be the case in Bayesian optimisatio… ▽ More

    Submitted 13 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  5. arXiv:2402.00809  [pdf, other

    cs.LG stat.ML

    Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

    Authors: Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang

    Abstract: In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learni… ▽ More

    Submitted 6 August, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  6. arXiv:2310.17273  [pdf, other

    cs.LG cs.HC stat.ML

    Looping in the Human Collaborative and Explainable Bayesian Optimization

    Authors: Masaki Adachi, Brady Planden, David A. Howey, Michael A. Osborne, Sebastian Orbell, Natalia Ares, Krikamol Muandet, Siu Lun Chau

    Abstract: Like many optimizers, Bayesian optimization often falls short of gaining user trust due to opacity. While attempts have been made to develop human-centric optimizers, they typically assume user knowledge is well-specified and error-free, employing users mainly as supervisors of the optimization process. We relax these assumptions and propose a more balanced human-AI partnership with our Collaborat… ▽ More

    Submitted 29 February, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted at AISTATS 2024, 24 pages, 11 figures

    MSC Class: 62C10; 62F15

  7. arXiv:2306.05843  [pdf, other

    cs.LG cs.AI math.NA stat.CO stat.ML

    Adaptive Batch Sizes for Active Learning A Probabilistic Numerics Approach

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Xingchen Wan, Vu Nguyen, Harald Oberhauser, Michael A. Osborne

    Abstract: Active learning parallelization is widely used, but typically relies on fixing the batch size throughout experimentation. This fixed approach is inefficient because of a dynamic trade-off between cost and speed -- larger batches are more costly, smaller batches lead to slower wall-clock run-times -- and the trade-off may change over the run (larger batches are often preferable earlier). To address… ▽ More

    Submitted 21 February, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted at AISTATS 2024. 33 pages, 6 figures

    MSC Class: 62C10; 62F15

  8. arXiv:2306.05304  [pdf, other

    cs.LG cs.AI stat.ML

    Bayesian Optimisation of Functions on Graphs

    Authors: Xingchen Wan, Pierre Osselin, Henry Kenlay, Binxin Ru, Michael A. Osborne, Xiaowen Dong

    Abstract: The increasing availability of graph-structured data motivates the task of optimising over functions defined on the node set of graphs. Traditional graph search algorithms can be applied in this case, but they may be sample-inefficient and do not make use of information about the function values; on the other hand, Bayesian optimisation is a class of promising black-box solvers with superior sampl… ▽ More

    Submitted 29 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023. 11 pages, 11 figures, 1 table (29 pages, 31 figures, 1 table including references and appendices)

  9. arXiv:2303.08874  [pdf, other

    stat.ML cs.LG

    Bayesian Quadrature for Neural Ensemble Search

    Authors: Saad Hamid, Xingchen Wan, Martin Jørgensen, Binxin Ru, Michael Osborne

    Abstract: Ensembling can improve the performance of Neural Networks, but existing approaches struggle when the architecture likelihood surface has dispersed, narrow peaks. Furthermore, existing methods construct equally weighted ensembles, and this is likely to be vulnerable to the failure modes of the weaker architectures. By viewing ensembling as approximately marginalising over architectures we construct… ▽ More

    Submitted 17 March, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  10. arXiv:2301.11832  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    SOBER: Highly Parallel Bayesian Optimization and Bayesian Quadrature over Discrete and Mixed Spaces

    Authors: Masaki Adachi, Satoshi Hayakawa, Saad Hamid, Martin Jørgensen, Harald Oberhauser, Micheal A. Osborne

    Abstract: Batch Bayesian optimisation and Bayesian quadrature have been shown to be sample-efficient methods of performing optimisation and quadrature where expensive-to-evaluate objective functions can be queried in parallel. However, current methods do not scale to large batch sizes -- a frequent desideratum in practice (e.g. drug discovery or simulation-based inference). We present a novel algorithm, SOB… ▽ More

    Submitted 5 July, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: 34 pages, 12 figures

    MSC Class: 62C10; 62F15

  11. arXiv:2212.13936  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations

    Authors: Tim G. J. Rudner, Cong Lu, Michael A. Osborne, Yarin Gal, Yee Whye Teh

    Abstract: KL-regularized reinforcement learning from expert demonstrations has proved successful in improving the sample efficiency of deep reinforcement learning algorithms, allowing them to be applied to challenging physical real-world tasks. However, we show that KL-regularized reinforcement learning with behavioral reference policies derived from expert demonstrations can suffer from pathological traini… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: Published in Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

  12. arXiv:2210.17299  [pdf, other

    stat.ME cs.LG eess.SY physics.chem-ph

    Bayesian Model Selection of Lithium-Ion Battery Models via Bayesian Quadrature

    Authors: Masaki Adachi, Yannick Kuhn, Birger Horstmann, Arnulf Latz, Michael A. Osborne, David A. Howey

    Abstract: A wide variety of battery models are available, and it is not always obvious which model `best' describes a dataset. This paper presents a Bayesian model selection approach using Bayesian quadrature. The model evidence is adopted as the selection metric, choosing the simplest model that describes the data, in the spirit of Occam's razor. However, estimating this requires integral computations over… ▽ More

    Submitted 5 April, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

    Comments: 11 pages, 2 figures, accepted at IFAC2023

    MSC Class: 62C10; 62F15

    Journal ref: IFAC-PapersOnLine, 56, 10521, 2023

  13. arXiv:2210.10199  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Bayesian Optimization over Discrete and Mixed Spaces via Probabilistic Reparameterization

    Authors: Samuel Daulton, Xingchen Wan, David Eriksson, Maximilian Balandat, Michael A. Osborne, Eytan Bakshy

    Abstract: Optimizing expensive-to-evaluate black-box functions of discrete (and potentially continuous) design parameters is a ubiquitous problem in scientific and engineering applications. Bayesian optimization (BO) is a popular, sample-efficient method that leverages a probabilistic surrogate model and an acquisition function (AF) to select promising designs to evaluate. However, maximizing the AF over mi… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: To appear in Advances in Neural Information Processing Systems 35, 2022. Code available at: https://github.com/facebookresearch/bo_pr

  14. arXiv:2210.01633  [pdf, other

    cs.LG

    Log-Linear-Time Gaussian Processes Using Binary Tree Kernels

    Authors: Michael K. Cohen, Samuel Daulton, Michael A. Osborne

    Abstract: Gaussian processes (GPs) produce good probabilistic models of functions, but most GP kernels require $O((n+m)n^2)$ time, where $n$ is the number of data points and $m$ the number of predictive locations. We present a new kernel that allows for Gaussian process regression in $O((n+m)\log(n+m))$ time. Our "binary tree" kernel places all data points on the leaves of a binary tree, with the kernel dep… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022; 9 pages + appendices

    Journal ref: Adv.Neur.Info.Proc.Sys. 35 (2022) 8118-8129

  15. arXiv:2209.00343  [pdf, other

    stat.ML cs.LG

    Bézier Gaussian Processes for Tall and Wide Data

    Authors: Martin Jørgensen, Michael A. Osborne

    Abstract: Modern approximations to Gaussian processes are suitable for "tall data", with a cost that scales well in the number of observations, but under-performs on ``wide data'', scaling poorly in the number of input features. That is, as the number of input features grows, good predictive performance requires the number of summarising variables, and their associated cost, to grow rapidly. We introduce a… ▽ More

    Submitted 13 October, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

  16. arXiv:2208.00853  [pdf, other

    cs.SE eess.SY

    Guidance on the Safety Assurance of Autonomous Systems in Complex Environments (SACE)

    Authors: Richard Hawkins, Matt Osborne, Mike Parsons, Mark Nicholson, John McDermid, Ibrahim Habli

    Abstract: Autonomous systems (AS) are systems that have the capability to take decisions free from direct human control. AS are increasingly being considered for adoption for applications where their behaviour may cause harm, such as when used for autonomous driving, medical applications or in domestic environments. For such applications, being able to ensure and demonstrate (assure) the safety of the opera… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    ACM Class: D.2.0

  17. arXiv:2207.09405  [pdf, other

    cs.LG cs.AI

    Bayesian Generational Population-Based Training

    Authors: Xingchen Wan, Cong Lu, Jack Parker-Holder, Philip J. Ball, Vu Nguyen, Binxin Ru, Michael A. Osborne

    Abstract: Reinforcement learning (RL) offers the potential for training generally capable agents that can interact autonomously in the real world. However, one key limitation is the brittleness of RL algorithms to core hyperparameters and network architecture choice. Furthermore, non-stationarities such as evolving training data and increased agent complexity mean that different hyperparameters and architec… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: AutoML Conference 2022. 10 pages, 4 figure, 3 tables (28 pages, 10 figures, 7 tables including references and appendices)

  18. arXiv:2206.04779  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

    Authors: Cong Lu, Philip J. Ball, Tim G. J. Rudner, Jack Parker-Holder, Michael A. Osborne, Yee Whye Teh

    Abstract: Offline reinforcement learning has shown great promise in leveraging large pre-collected datasets for policy learning, allowing agents to forgo often-expensive online data collection. However, offline reinforcement learning from visual observations with continuous action spaces remains under-explored, with a limited understanding of the key challenges in this complex domain. In this paper, we esta… ▽ More

    Submitted 6 July, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Published at TMLR, 2023

  19. arXiv:2206.04734  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Harald Oberhauser, Michael A. Osborne

    Abstract: Calculation of Bayesian posteriors and model evidences typically requires numerical integration. Bayesian quadrature (BQ), a surrogate-model-based approach to numerical integration, is capable of superb sample efficiency, but its lack of parallelisation has hindered its practical applications. In this work, we propose a parallelised (batch) BQ method, employing techniques from kernel quadrature, t… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 38 pages, 6 figures

    MSC Class: 62C10; 62F15

    Journal ref: NeurIPS 35, 16533--16547 (2022)

  20. arXiv:2202.07549  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Robust Multi-Objective Bayesian Optimization Under Input Noise

    Authors: Samuel Daulton, Sait Cakmak, Maximilian Balandat, Michael A. Osborne, Enlu Zhou, Eytan Bakshy

    Abstract: Bayesian optimization (BO) is a sample-efficient approach for tuning design parameters to optimize expensive-to-evaluate, black-box performance metrics. In many manufacturing processes, the design parameters are subject to random input noise, resulting in a product that is often less performant than expected. Although BO methods have been proposed for optimizing a single objective under input nois… ▽ More

    Submitted 3 June, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: To appear at ICML 2022. 36 pages. Code is available at https://github.com/facebookresearch/robust_mobo

  21. arXiv:2112.01379  [pdf, other

    cs.SI

    Sentinel node approach to monitoring online COVID-19 misinformation

    Authors: Matthew T. Osborne, Samuel S. Malloy, Erik C. Nisbet, Robert M. Bond, Joseph H. Tien

    Abstract: Understanding how different online communities engage with COVID-19 misinformation is critical for public health response, as misinformation confined to a small, isolated community of users poses a different public health risk than misinformation being consumed by a large population spanning many diverse communities. Here we take a longitudinal approach that leverages tools from network science to… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

  22. arXiv:2111.11285  [pdf, other

    cond-mat.mes-hall cs.LG

    Bridging the reality gap in quantum devices with physics-aware machine learning

    Authors: D. L. Craig, H. Moon, F. Fedele, D. T. Lennon, B. Van Straaten, F. Vigneau, L. C. Camenzind, D. M. Zumbühl, G. A. D. Briggs, M. A. Osborne, D. Sejdinovic, N. Ares

    Abstract: The discrepancies between reality and simulation impede the optimisation and scalability of solid-state quantum devices. Disorder induced by the unpredictable distribution of material defects is one of the major contributions to the reality gap. We bridge this gap using physics-aware machine learning, in particular, using an approach combining a physical model, deep learning, Gaussian random field… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

  23. arXiv:2111.02842  [pdf, other

    stat.ML cs.AI cs.CR cs.LG

    Adversarial Attacks on Graph Classification via Bayesian Optimisation

    Authors: Xingchen Wan, Henry Kenlay, Binxin Ru, Arno Blaas, Michael A. Osborne, Xiaowen Dong

    Abstract: Graph neural networks, a popular class of models effective in a wide range of graph-based learning tasks, have been shown to be vulnerable to adversarial attacks. While the majority of the literature focuses on such vulnerability in node-level classification tasks, little effort has been dedicated to analysing adversarial attacks on graph-level classification, an important problem with numerous re… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021. 11 pages, 8 figures, 2 tables (24 pages, 17 figures, 8 tables including references and appendices)

  24. arXiv:2110.12087  [pdf, other

    cs.LG stat.ML

    Gaussian Process Sampling and Optimization with Approximate Upper and Lower Bounds

    Authors: Vu Nguyen, Marc Peter Deisenroth, Michael A. Osborne

    Abstract: Many functions have approximately-known upper and/or lower bounds, potentially aiding the modeling of such functions. In this paper, we introduce Gaussian process models for functions where such bounds are (approximately) known. More specifically, we propose the first use of such bounds to improve Gaussian process (GP) posterior sampling and Bayesian optimization (BO). That is, we transform a GP m… ▽ More

    Submitted 19 October, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: 20 pages

  25. arXiv:2110.04135  [pdf, other

    cs.LG cs.AI

    Revisiting Design Choices in Offline Model-Based Reinforcement Learning

    Authors: Cong Lu, Philip J. Ball, Jack Parker-Holder, Michael A. Osborne, Stephen J. Roberts

    Abstract: Offline reinforcement learning enables agents to leverage large pre-collected datasets of environment transitions to learn control policies, circumventing the need for potentially expensive or unsafe online data collection. Significant progress has been made recently in offline model-based reinforcement learning, approaches which leverage a learned dynamics model. This typically involves construct… ▽ More

    Submitted 16 March, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: Spotlight @ ICLR 2022; Spotlight @ RL4RealLife Workshop ICML2021

  26. arXiv:2107.12975  [pdf, other

    cond-mat.mes-hall cs.LG quant-ph

    Cross-architecture Tuning of Silicon and SiGe-based Quantum Devices Using Machine Learning

    Authors: B. Severin, D. T. Lennon, L. C. Camenzind, F. Vigneau, F. Fedele, D. Jirovec, A. Ballabio, D. Chrastina, G. Isella, M. de Kruijf, M. J. Carballido, S. Svab, A. V. Kuhlmann, F. R. Braakman, S. Geyer, F. N. M. Froning, H. Moon, M. A. Osborne, D. Sejdinovic, G. Katsaros, D. M. Zumbühl, G. A. D. Briggs, N. Ares

    Abstract: The potential of Si and SiGe-based devices for the scaling of quantum circuits is tainted by device variability. Each device needs to be tuned to operation conditions. We give a key step towards tackling this variability with an algorithm that, without modification, is capable of tuning a 4-gate Si FinFET, a 5-gate GeSi nanowire and a 7-gate SiGe heterostructure double quantum dot device from scra… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

  27. arXiv:2107.01959  [pdf, other

    cs.LG stat.ML

    Universal Approximation of Functions on Sets

    Authors: Edward Wagstaff, Fabian B. Fuchs, Martin Engelcke, Michael A. Osborne, Ingmar Posner

    Abstract: Modelling functions of sets, or equivalently, permutation-invariant functions, is a long-standing challenge in machine learning. Deep Sets is a popular method which is known to be a universal approximator for continuous set functions. We provide a theoretical analysis of Deep Sets which shows that this universal approximation property is only guaranteed if the model's latent space is sufficiently… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 54 pages, 13 figures

  28. arXiv:2106.07452  [pdf, other

    stat.ML cs.LG

    Marginalising over Stationary Kernels with Bayesian Quadrature

    Authors: Saad Hamid, Sebastian Schulze, Michael A. Osborne, Stephen J. Roberts

    Abstract: Marginalising over families of Gaussian Process kernels produces flexible model classes with well-calibrated uncertainty estimates. Existing approaches require likelihood evaluations of many kernels, rendering them prohibitively expensive for larger datasets. We propose a Bayesian Quadrature scheme to make this marginalisation more efficient and thereby more practical. Through use of the maximum m… ▽ More

    Submitted 15 March, 2023; v1 submitted 14 June, 2021; originally announced June 2021.

  29. arXiv:2102.07188  [pdf, other

    stat.ML cs.LG

    Think Global and Act Local: Bayesian Optimisation over High-Dimensional Categorical and Mixed Search Spaces

    Authors: Xingchen Wan, Vu Nguyen, Huong Ha, Binxin Ru, Cong Lu, Michael A. Osborne

    Abstract: High-dimensional black-box optimisation remains an important yet notoriously challenging problem. Despite the success of Bayesian optimisation methods on continuous domains, domains that are categorical, or that mix continuous and categorical variables, remain challenging. We propose a novel solution -- we combine local optimisation with a tailored kernel design, effectively handling high-dimensio… ▽ More

    Submitted 10 June, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

    Comments: ICML 2021. 9 page, 6 figures (26 pages, 16 figures, 2 tables including references and appendices)

  30. arXiv:2010.15750  [pdf, other

    cs.LG

    Gaussian Process Bandit Optimization of the Thermodynamic Variational Objective

    Authors: Vu Nguyen, Vaden Masrani, Rob Brekelmans, Michael A. Osborne, Frank Wood

    Abstract: Achieving the full promise of the Thermodynamic Variational Objective (TVO), a recently proposed variational lower bound on the log evidence involving a one-dimensional Riemann integral approximation, requires choosing a "schedule" of sorted discretization points. This paper introduces a bespoke Gaussian process bandit optimization method for automatically choosing these points. Our approach not o… ▽ More

    Submitted 20 November, 2020; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2020

  31. arXiv:2009.14825  [pdf, other

    cond-mat.mes-hall cs.LG quant-ph

    Deep Reinforcement Learning for Efficient Measurement of Quantum Devices

    Authors: V. Nguyen, S. B. Orbell, D. T. Lennon, H. Moon, F. Vigneau, L. C. Camenzind, L. Yu, D. M. Zumbühl, G. A. D. Briggs, M. A. Osborne, D. Sejdinovic, N. Ares

    Abstract: Deep reinforcement learning is an emerging machine learning approach which can teach a computer to learn from their actions and rewards similar to the way humans learn from experience. It offers many advantages in automating decision processes to navigate large parameter spaces. This paper proposes a novel approach to the efficient measurement of quantum devices based on deep reinforcement learnin… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

  32. arXiv:2006.07593  [pdf, other

    cs.LG cs.NE stat.ML

    Optimal Transport Kernels for Sequential and Parallel Neural Architecture Search

    Authors: Vu Nguyen, Tam Le, Makoto Yamada, Michael A Osborne

    Abstract: Neural architecture search (NAS) automates the design of deep neural networks. One of the main challenges in searching complex and non-continuous architectures is to compare the similarity of networks that the conventional Euclidean metric may fail to capture. Optimal transport (OT) is resilient to such complex structure by considering the minimal cost for transporting a network into another. Howe… ▽ More

    Submitted 10 June, 2021; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: 23 pages, camera ready ICML2021

  33. arXiv:2006.07556  [pdf, other

    cs.LG stat.ML

    Interpretable Neural Architecture Search via Bayesian Optimisation with Weisfeiler-Lehman Kernels

    Authors: Binxin Ru, Xingchen Wan, Xiaowen Dong, Michael Osborne

    Abstract: Current neural architecture search (NAS) strategies focus only on finding a single, good, architecture. They offer little insight into why a specific network is performing well, or how we should modify the architecture if we want further improvements. We propose a Bayesian optimisation (BO) approach for NAS that combines the Weisfeiler-Lehman graph kernel with a Gaussian process surrogate. Our met… ▽ More

    Submitted 19 February, 2021; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: ICLR 2021. 9 pages, 5 figures, 1 table (23 pages, 14 figures and 3 tables including references and appendices)

  34. arXiv:2001.02589  [pdf, other

    cond-mat.mes-hall cs.LG quant-ph

    Machine learning enables completely automatic tuning of a quantum device faster than human experts

    Authors: H. Moon, D. T. Lennon, J. Kirkpatrick, N. M. van Esbroeck, L. C. Camenzind, Liuqi Yu, F. Vigneau, D. M. Zumbühl, G. A. D. Briggs, M. A Osborne, D. Sejdinovic, E. A. Laird, N. Ares

    Abstract: Device variability is a bottleneck for the scalability of semiconductor quantum devices. Increasing device control comes at the cost of a large parameter space that has to be explored in order to find the optimal operating conditions. We demonstrate a statistical tuning algorithm that navigates this entire parameter space, using just a few modelling assumptions, in the search for specific electron… ▽ More

    Submitted 8 January, 2020; originally announced January 2020.

  35. arXiv:1912.09068  [pdf, other

    stat.ML cs.LG

    A Maximum Entropy approach to Massive Graph Spectra

    Authors: Diego Granziol, Robin Ru, Stefan Zohren, Xiaowen Dong, Michael Osborne, Stephen Roberts

    Abstract: Graph spectral techniques for measuring graph similarity, or for learning the cluster number, require kernel smoothing. The choice of kernel function and bandwidth are typically chosen in an ad-hoc manner and heavily affect the resulting output. We prove that kernel smoothing biases the moments of the spectral density. We propose an information theoretically optimal approach to learn a smooth grap… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

    Comments: 12 pages. 9 Figures

  36. arXiv:1909.09593  [pdf, other

    cs.LG stat.ML

    Bayesian Optimization for Iterative Learning

    Authors: Vu Nguyen, Sebastian Schulze, Michael A Osborne

    Abstract: The performance of deep (reinforcement) learning systems crucially depends on the choice of hyperparameters. Their tuning is notoriously expensive, typically requiring an iterative training process to run for numerous steps to convergence. Traditional tuning algorithms only consider the final performance of hyperparameters acquired after many expensive iterations and ignore intermediate informatio… ▽ More

    Submitted 16 January, 2021; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: Camera ready NeurIPS 2020

  37. arXiv:1909.03019  [pdf, other

    cs.AI cs.RO eess.SP eess.SY

    Towards Integrating Formal Verification of Autonomous Robots with Battery Prognostics and Health Management

    Authors: Xingyu Zhao, Matt Osborne, Jenny Lantair, Valentin Robu, David Flynn, Xiaowei Huang, Michael Fisher, Fabio Papacchini, Angelo Ferrando

    Abstract: The battery is a key component of autonomous robots. Its performance limits the robot's safety and reliability. Unlike liquid-fuel, a battery, as a chemical device, exhibits complicated features, including (i) capacity fade over successive recharges and (ii) increasing discharge rate as the state of charge (SOC) goes down for a given power demand. Existing formal verification studies of autonomous… ▽ More

    Submitted 22 August, 2019; originally announced September 2019.

    Journal ref: Proceedings of 17th International Conference on Software Engineering and Formal Methods (SEFM 2019), Oslo, Norway (September 2019)

  38. arXiv:1908.08258  [pdf, ps, other

    stat.ML cs.LG

    Adaptive Configuration Oracle for Online Portfolio Selection Methods

    Authors: Favour M. Nyikosa, Michael A. Osborne, Stephen J. Roberts

    Abstract: Financial markets are complex environments that produce enormous amounts of noisy and non-stationary data. One fundamental problem is online portfolio selection, the goal of which is to exploit this data to sequentially select portfolios of assets to achieve positive investment outcomes while managing risks. Various algorithms have been proposed for solving this problem in fields such as finance,… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    MSC Class: 62P30 ACM Class: G.3

  39. arXiv:1907.00865  [pdf, other

    stat.ML cs.LG

    Radial Bayesian Neural Networks: Beyond Discrete Support In Large-Scale Bayesian Deep Learning

    Authors: Sebastian Farquhar, Michael Osborne, Yarin Gal

    Abstract: We propose Radial Bayesian Neural Networks (BNNs): a variational approximate posterior for BNNs which scales well to large models while maintaining a distribution over weight-space with full support. Other scalable Bayesian deep learning methods, like MC dropout or deep ensembles, have discrete support-they assign zero probability to almost all of the weight-space. Unlike these discrete support me… ▽ More

    Submitted 31 May, 2021; v1 submitted 1 July, 2019; originally announced July 2019.

    Journal ref: AI Stats, PMLR 108:1352-1362, 2020

  40. arXiv:1906.08878  [pdf, other

    stat.ML cs.LG

    Bayesian Optimisation over Multiple Continuous and Categorical Inputs

    Authors: Binxin Ru, Ahsan S. Alvi, Vu Nguyen, Michael A. Osborne, Stephen J Roberts

    Abstract: Efficient optimisation of black-box problems that comprise both continuous and categorical inputs is important, yet poses significant challenges. We propose a new approach, Continuous and Categorical Bayesian Optimisation (CoCaBO), which combines the strengths of multi-armed bandits and Bayesian optimisation to select values for both categorical and continuous inputs. We model this mixed-type spac… ▽ More

    Submitted 9 August, 2020; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: 16 pages

  41. arXiv:1906.01101  [pdf, other

    stat.ML cs.LG

    MEMe: An Accurate Maximum Entropy Method for Efficient Approximations in Large-Scale Machine Learning

    Authors: Diego Granziol, Binxin Ru, Stefan Zohren, Xiaowen Doing, Michael Osborne, Stephen Roberts

    Abstract: Efficient approximation lies at the heart of large-scale machine learning problems. In this paper, we propose a novel, robust maximum entropy algorithm, which is capable of dealing with hundreds of moments and allows for computationally efficient approximations. We showcase the usefulness of the proposed method, its equivalence to constrained Bayesian variational inference and demonstrate its supe… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

    Comments: 18 pages, 3 figures, Published at Entropy 2019: Special Issue Entropy Based Inference and Optimization in Machine Learning

    Journal ref: MEMe: An Accurate Maximum Entropy Method for Efficient Approximations in Large-Scale Machine Learning. Entropy, 21(6), 551 (2019)

  42. arXiv:1905.02685  [pdf, other

    stat.ML cs.LG

    Knowing The What But Not The Where in Bayesian Optimization

    Authors: Vu Nguyen, Michael A. Osborne

    Abstract: Bayesian optimization has demonstrated impressive success in finding the optimum input x* and output f* = f(x*) = max f(x) of a black-box function f. In some applications, however, the optimum output f* is known in advance and the goal is to find the corresponding optimum input x*. In this paper, we consider a new setting in BO in which the knowledge of the optimum output f* is available. Our goal… ▽ More

    Submitted 14 August, 2020; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: 16 pages

    Journal ref: International Conference on Machine Learning (ICML) 2020

  43. arXiv:1902.09724  [pdf, other

    cs.LG stat.ML

    Automated Model Selection with Bayesian Quadrature

    Authors: Henry Chai, Jean-Francois Ton, Roman Garnett, Michael A. Osborne

    Abstract: We present a novel technique for tailoring Bayesian quadrature (BQ) to model selection. The state-of-the-art for comparing the evidence of multiple models relies on Monte Carlo methods, which converge slowly and are unreliable for computationally expensive models. Previous research has shown that BQ offers sample efficiency superior to Monte Carlo in computing the evidence of an individual model.… ▽ More

    Submitted 1 March, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: 10 pages, 5 figures. Currently in submission to ICML 2019

  44. arXiv:1902.08480  [pdf, other

    cs.LG math.DS stat.ML

    AReS and MaRS - Adversarial and MMD-Minimizing Regression for SDEs

    Authors: Gabriele Abbati, Philippe Wenk, Michael A Osborne, Andreas Krause, Bernhard Schölkopf, Stefan Bauer

    Abstract: Stochastic differential equations are an important modeling class in many disciplines. Consequently, there exist many methods relying on various discretization and numerical integration schemes. In this paper, we propose a novel, probabilistic model for estimating the drift and diffusion given noisy observations of the underlying stochastic system. Using state-of-the-art adversarial and moment mat… ▽ More

    Submitted 28 May, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: Published at the Thirty-sixth International Conference on Machine Learning (ICML 2019)

  45. arXiv:1902.06278  [pdf, other

    cs.LG math.DS stat.ML

    ODIN: ODE-Informed Regression for Parameter and State Inference in Time-Continuous Dynamical Systems

    Authors: Philippe Wenk, Gabriele Abbati, Michael A Osborne, Bernhard Schölkopf, Andreas Krause, Stefan Bauer

    Abstract: Parameter inference in ordinary differential equations is an important problem in many applied sciences and in engineering, especially in a data-scarce setting. In this work, we introduce a novel generative modeling approach based on constrained Gaussian processes and leverage it to build a computationally and data efficient algorithm for state and parameter inference. In an extensive set of exper… ▽ More

    Submitted 5 December, 2019; v1 submitted 17 February, 2019; originally announced February 2019.

    Comments: Published at the Thirty-fourth AAAI Conference on Artificial Intelligence

  46. arXiv:1901.10452  [pdf, other

    stat.ML cs.AI cs.LG

    Asynchronous Batch Bayesian Optimisation with Improved Local Penalisation

    Authors: Ahsan S. Alvi, Binxin Ru, Jan Calliess, Stephen J. Roberts, Michael A. Osborne

    Abstract: Batch Bayesian optimisation (BO) has been successfully applied to hyperparameter tuning using parallel computing, but it is wasteful of resources: workers that complete jobs ahead of others are left idle. We address this problem by developing an approach, Penalising Locally for Asynchronous Bayesian Optimisation on $k$ workers (PLAyBOOK), for asynchronous parallel BO. We demonstrate empirically th… ▽ More

    Submitted 28 May, 2019; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: Camera-ready version after incorporating reviewers' suggestions

  47. arXiv:1901.09006  [pdf, other

    cs.LG cs.AI cs.NE cs.RO stat.ML

    On the Limitations of Representing Functions on Sets

    Authors: Edward Wagstaff, Fabian B. Fuchs, Martin Engelcke, Ingmar Posner, Michael Osborne

    Abstract: Recent work on the representation of functions on sets has considered the use of summation in a latent space to enforce permutation invariance. In particular, it has been conjectured that the dimension of this latent space may remain fixed as the cardinality of the sets under consideration increases. However, we demonstrate that the analysis leading to this conjecture requires mappings which are h… ▽ More

    Submitted 7 October, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: Published at the International Conference on Machine Learning (2019)

  48. arXiv:1812.01553  [pdf, other

    stat.ML cs.LG stat.CO

    Batch Selection for Parallelisation of Bayesian Quadrature

    Authors: Ed Wagstaff, Saad Hamid, Michael Osborne

    Abstract: Integration over non-negative integrands is a central problem in machine learning (e.g. for model averaging, (hyper-)parameter marginalisation, and computing posterior predictive distributions). Bayesian Quadrature is a probabilistic numerical integration technique that performs promisingly when compared to traditional Markov Chain Monte Carlo methods. However, in contrast to easily-parallelised M… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

  49. arXiv:1811.10275  [pdf, ps, other

    stat.CO cs.LG math.NA stat.ML

    Rejoinder for "Probabilistic Integration: A Role in Statistical Computation?"

    Authors: Francois-Xavier Briol, Chris J. Oates, Mark Girolami, Michael A. Osborne, Dino Sejdinovic

    Abstract: This article is the rejoinder for the paper "Probabilistic Integration: A Role in Statistical Computation?" to appear in Statistical Science with discussion. We would first like to thank the reviewers and many of our colleagues who helped shape this paper, the editor for selecting our paper for discussion, and of course all of the discussants for their thoughtful, insightful and constructive comme… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: Accepted to Statistical Science

  50. arXiv:1811.09960  [pdf, other

    stat.ML cs.AI cs.CY cs.LG

    Intersectionality: Multiple Group Fairness in Expectation Constraints

    Authors: Jack Fitzsimons, Michael Osborne, Stephen Roberts

    Abstract: Group fairness is an important concern for machine learning researchers, developers, and regulators. However, the strictness to which models must be constrained to be considered fair is still under debate. The focus of this work is on constraining the expected outcome of subpopulations in kernel regression and, in particular, decision tree regression, with application to random forests, boosted tr… ▽ More

    Submitted 25 November, 2018; originally announced November 2018.

    Comments: NeurIPS (previously NIPS) 2018, Workshop on Ethical, Social and Governance Issues in AI