Zum Hauptinhalt springen

Showing 1–50 of 112 results for author: Turner, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.04745  [pdf, other

    cs.AI physics.ao-ph

    AI for operational methane emitter monitoring from space

    Authors: Anna Vaughan, Gonzalo Mateo-Garcia, Itziar Irakulis-Loitxate, Marc Watine, Pablo Fernandez-Poblaciones, Richard E. Turner, James Requeima, Javier Gorroño, Cynthia Randles, Manfredi Caltagirone, Claudio Cifarelli

    Abstract: Mitigating methane emissions is the fastest way to stop global warming in the short-term and buy humanity time to decarbonise. Despite the demonstrated ability of remote sensing instruments to detect methane plumes, no system has been available to routinely monitor and act on these events. We present MARS-S2L, an automated AI-driven methane emitter monitoring system for Sentinel-2 and Landsat sate… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  2. arXiv:2407.16526  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Imperfect Vision Encoders: Efficient and Robust Tuning for Vision-Language Models

    Authors: Aristeidis Panos, Rahaf Aljundi, Daniel Olmeda Reino, Richard E Turner

    Abstract: Vision language models (VLMs) demonstrate impressive capabilities in visual question answering and image captioning, acting as a crucial link between visual and language models. However, existing open-source VLMs heavily rely on pretrained and frozen vision encoders (such as CLIP). Despite CLIP's robustness across diverse domains, it still exhibits non-negligible image understanding errors. These… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  3. arXiv:2406.13493  [pdf, other

    cs.LG stat.ML

    In-Context In-Context Learning with Transformer Neural Processes

    Authors: Matthew Ashman, Cristiana Diaconu, Adrian Weller, Richard E. Turner

    Abstract: Neural processes (NPs) are a powerful family of meta-learning models that seek to approximate the posterior predictive map of the ground-truth stochastic process from which each dataset in a meta-dataset is sampled. There are many cases in which practitioners, besides having access to the dataset of interest, may also have access to other datasets that share similarities with it. In this case, int… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2406.13488  [pdf, other

    stat.ML cs.LG

    Approximately Equivariant Neural Processes

    Authors: Matthew Ashman, Cristiana Diaconu, Adrian Weller, Wessel Bruinsma, Richard E. Turner

    Abstract: Equivariant deep learning architectures exploit symmetries in learning problems to improve the sample efficiency of neural-network-based models and their ability to generalise. However, when modelling real-world data, learning problems are often not exactly equivariant, but only approximately. For example, when estimating the global temperature field from weather station observations, local topogr… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  5. arXiv:2406.13151  [pdf, other

    stat.ML cs.LG stat.CO

    von Mises Quasi-Processes for Bayesian Circular Regression

    Authors: Yarden Cohen, Alexandre Khae Wu Navarro, Jes Frellsen, Richard E. Turner, Raziel Riemer, Ari Pakman

    Abstract: The need for regression models to predict circular values arises in many scientific fields. In this work we explore a family of expressive and interpretable distributions over circle-valued random functions related to Gaussian processes targeting two Euclidean dimensions conditioned on the unit circle. The resulting probability model has connections with continuous spin models in statistical physi… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Contribution to the Structured Probabilistic Inference & Generative Modeling workshop of ICML 2024

  6. arXiv:2406.12409  [pdf, other

    stat.ML cs.LG

    Translation Equivariant Transformer Neural Processes

    Authors: Matthew Ashman, Cristiana Diaconu, Junhyuck Kim, Lakee Sivaraya, Stratis Markou, James Requeima, Wessel P. Bruinsma, Richard E. Turner

    Abstract: The effectiveness of neural processes (NPs) in modelling posterior prediction maps -- the mapping from data to posterior predictive distributions -- has significantly improved since their inception. This improvement can be attributed to two principal factors: (1) advancements in the architecture of permutation invariant set functions, which are intrinsic to all NPs; and (2) leveraging symmetries p… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  7. arXiv:2406.08569  [pdf, other

    cs.LG cs.CR stat.ML

    Noise-Aware Differentially Private Regression via Meta-Learning

    Authors: Ossi Räisä, Stratis Markou, Matthew Ashman, Wessel P. Bruinsma, Marlon Tobaben, Antti Honkela, Richard E. Turner

    Abstract: Many high-stakes applications require machine learning models that protect user privacy and provide well-calibrated, accurate predictions. While Differential Privacy (DP) is the gold standard for protecting user privacy, standard DP mechanisms typically significantly impair performance. One approach to mitigating this issue is pre-training models on simulated data before DP learning on the private… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  8. arXiv:2406.01801  [pdf, other

    stat.ML cs.LG

    Fearless Stochasticity in Expectation Propagation

    Authors: Jonathan So, Richard E. Turner

    Abstract: Expectation propagation (EP) is a family of algorithms for performing approximate inference in probabilistic models. The updates of EP involve the evaluation of moments -- expectations of certain functions -- which can be estimated from Monte Carlo (MC) samples. However, the updates are not robust to MC noise when performed naively, and various prior works have attempted to address this issue in d… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  9. arXiv:2405.16541  [pdf, other

    stat.ML cs.LG

    Variance-Reducing Couplings for Random Features: Perspectives from Optimal Transport

    Authors: Isaac Reid, Stratis Markou, Krzysztof Choromanski, Richard E. Turner, Adrian Weller

    Abstract: Random features (RFs) are a popular technique to scale up kernel methods in machine learning, replacing exact kernel evaluations with stochastic Monte Carlo estimates. They underpin models as diverse as efficient transformers (by approximating attention) to sparse spectrum Gaussian processes (by approximating the covariance function). Efficiency can be further improved by speeding up the convergen… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  10. arXiv:2405.13063  [pdf, other

    physics.ao-ph cs.LG

    Aurora: A Foundation Model of the Atmosphere

    Authors: Cristian Bodnar, Wessel P. Bruinsma, Ana Lucic, Megan Stanley, Johannes Brandstetter, Patrick Garvan, Maik Riechert, Jonathan Weyn, Haiyu Dong, Anna Vaughan, Jayesh K. Gupta, Kit Tambiratnam, Alex Archibald, Elizabeth Heider, Max Welling, Richard E. Turner, Paris Perdikaris

    Abstract: Deep learning foundation models are revolutionizing many facets of science by leveraging vast amounts of data to learn general-purpose representations that can be adapted to tackle diverse downstream tasks. Foundation models hold the promise to also transform our ability to model our planet and its subsystems by exploiting the vast expanse of Earth system data. Here we introduce Aurora, a large-sc… ▽ More

    Submitted 28 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  11. arXiv:2405.12856  [pdf, other

    stat.ML cs.CL cs.LG

    LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language

    Authors: James Requeima, John Bronskill, Dami Choi, Richard E. Turner, David Duvenaud

    Abstract: Machine learning practitioners often face significant challenges in formally integrating their prior knowledge and beliefs into predictive models, limiting the potential for nuanced and context-aware analyses. Moreover, the expertise needed to integrate this prior knowledge into probabilistic modeling typically limits the application of these models to specialists. Our goal is to build a regressio… ▽ More

    Submitted 25 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  12. arXiv:2404.00411  [pdf, other

    physics.ao-ph cs.LG

    Aardvark weather: end-to-end data-driven weather forecasting

    Authors: Anna Vaughan, Stratis Markou, Will Tebbutt, James Requeima, Wessel P. Bruinsma, Tom R. Andersson, Michael Herzog, Nicholas D. Lane, Matthew Chantry, J. Scott Hosking, Richard E. Turner

    Abstract: Weather forecasting is critical for a range of human activities including transportation, agriculture, industry, as well as the safety of the general public. Machine learning models have the potential to transform the complex weather prediction pipeline, but current approaches still rely on numerical weather prediction (NWP) systems, limiting forecast speed and accuracy. Here we demonstrate that a… ▽ More

    Submitted 13 July, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

  13. arXiv:2403.12977  [pdf, other

    cs.CV cs.LG eess.IV stat.AP

    SportsNGEN: Sustained Generation of Realistic Multi-player Sports Gameplay

    Authors: Lachlan Thorpe, Lewis Bawden, Karanjot Vendal, John Bronskill, Richard E. Turner

    Abstract: We present a transformer decoder based sports simulation engine, SportsNGEN, trained on sports player and ball tracking sequences, that is capable of generating sustained gameplay and accurately mimicking the decision making of real players. By training on a large database of professional tennis tracking data, we demonstrate that simulations produced by SportsNGEN can be used to predict the outcom… ▽ More

    Submitted 28 July, 2024; v1 submitted 9 February, 2024; originally announced March 2024.

  14. arXiv:2403.01946  [pdf, other

    cs.LG

    A Generative Model of Symmetry Transformations

    Authors: James Urquhart Allingham, Bruno Kacper Mlodozeniec, Shreyas Padhy, Javier Antorán, David Krueger, Richard E. Turner, Eric Nalisnick, José Miguel Hernández-Lobato

    Abstract: Correctly capturing the symmetry transformations of data can lead to efficient models with strong generalization capabilities, though methods incorporating symmetries often require prior knowledge. While recent advancements have been made in learning those symmetries directly from the dataset, most of this work has focused on the discriminative setting. In this paper, we take inspiration from grou… ▽ More

    Submitted 20 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  15. arXiv:2402.04384  [pdf, other

    cs.LG stat.ML

    Denoising Diffusion Probabilistic Models in Six Simple Steps

    Authors: Richard E. Turner, Cristiana-Diana Diaconu, Stratis Markou, Aliaksandra Shysheya, Andrew Y. K. Foong, Bruno Mlodozeniec

    Abstract: Denoising Diffusion Probabilistic Models (DDPMs) are a very popular class of deep generative model that have been successfully applied to a diverse range of problems including image and video generation, protein and material synthesis, weather forecasting, and neural surrogates of partial differential equations. Despite their ubiquity it is hard to find an introduction to DDPMs which is simple, co… ▽ More

    Submitted 10 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  16. arXiv:2402.03496  [pdf, other

    cs.LG math.OC

    Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective

    Authors: Wu Lin, Felix Dangel, Runa Eschenhagen, Juhan Bae, Richard E. Turner, Alireza Makhzani

    Abstract: Adaptive gradient optimizers like Adam(W) are the default training algorithms for many deep learning architectures, such as transformers. Their diagonal preconditioner is based on the gradient outer product which is incorporated into the parameter update via a square root. While these methods are often motivated as approximate second-order methods, the square root represents a fundamental differen… ▽ More

    Submitted 30 August, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: A long version of the ICML 2024 paper. Added root-free update schemes for n-dim tensor cases

  17. arXiv:2401.01855  [pdf, other

    cs.LG

    Transformer Neural Autoregressive Flows

    Authors: Massimiliano Patacchiola, Aliaksandra Shysheya, Katja Hofmann, Richard E. Turner

    Abstract: Density estimation, a central problem in machine learning, can be performed using Normalizing Flows (NFs). NFs comprise a sequence of invertible transformations, that turn a complex target distribution into a simple one, by exploiting the change of variables theorem. Neural Autoregressive Flows (NAFs) and Block Neural Autoregressive Flows (B-NAFs) are arguably the most perfomant members of the NF… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  18. arXiv:2312.05705  [pdf, other

    cs.LG stat.ML

    Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC

    Authors: Wu Lin, Felix Dangel, Runa Eschenhagen, Kirill Neklyudov, Agustinus Kristiadi, Richard E. Turner, Alireza Makhzani

    Abstract: Second-order methods such as KFAC can be useful for neural net training. However, they are often memory-inefficient since their preconditioning Kronecker factors are dense, and numerically unstable in low precision as they require matrix inversion or decomposition. These limitations render such methods unpopular for modern mixed-precision training. We address them by (i) formulating an inverse-fre… ▽ More

    Submitted 23 July, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: A long version of the ICML 2024 paper, updated the text about a related work

  19. arXiv:2311.16849  [pdf, other

    stat.ML cs.LG

    Identifiable Feature Learning for Spatial Data with Nonlinear ICA

    Authors: Hermanni Hälvä, Jonathan So, Richard E. Turner, Aapo Hyvärinen

    Abstract: Recently, nonlinear ICA has surfaced as a popular alternative to the many heuristic models used in deep representation learning and disentanglement. An advantage of nonlinear ICA is that a sophisticated identifiability theory has been developed; in particular, it has been proven that the original components can be recovered under sufficiently strong latent dependencies. Despite this general theory… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Work under review

  20. arXiv:2311.09848  [pdf, other

    cs.LG

    Diffusion-Augmented Neural Processes

    Authors: Lorenzo Bonito, James Requeima, Aliaksandra Shysheya, Richard E. Turner

    Abstract: Over the last few years, Neural Processes have become a useful modelling tool in many application areas, such as healthcare and climate sciences, in which data are scarce and prediction uncertainty estimates are indispensable. However, the current state of the art in the field (AR CNPs; Bruinsma et al., 2023) presents a few issues that prevent its widespread deployment. This work proposes an alter… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to the NeurIPS 2023 Workshop on Diffusion Models

    ACM Class: I.2.6

  21. arXiv:2311.00636  [pdf, other

    cs.LG stat.ML

    Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures

    Authors: Runa Eschenhagen, Alexander Immer, Richard E. Turner, Frank Schneider, Philipp Hennig

    Abstract: The core components of many modern neural network architectures, such as transformers, convolutional, or graph neural networks, can be expressed as linear layers with $\textit{weight-sharing}$. Kronecker-Factored Approximate Curvature (K-FAC), a second-order optimisation method, has shown promise to speed up neural network training and thereby reduce computational costs. However, there is currentl… ▽ More

    Submitted 11 January, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023

  22. arXiv:2310.19932  [pdf, other

    cs.LG physics.ao-ph

    Sim2Real for Environmental Neural Processes

    Authors: Jonas Scholz, Tom R. Andersson, Anna Vaughan, James Requeima, Richard E. Turner

    Abstract: Machine learning (ML)-based weather models have recently undergone rapid improvements. These models are typically trained on gridded reanalysis data from numerical data assimilation systems. However, reanalysis data comes with limitations, such as assumptions about physical laws and low spatiotemporal resolution. The gap between reanalysis and reality has sparked growing interest in training ML mo… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 4 pages, 3 figures, To be published in Tackling Climate Change with Machine Learning workshop at NeurIPS

  23. arXiv:2310.11837  [pdf, other

    stat.ML cs.LG

    Optimising Distributions with Natural Gradient Surrogates

    Authors: Jonathan So, Richard E. Turner

    Abstract: Natural gradient methods have been used to optimise the parameters of probability distributions in a variety of settings, often resulting in fast-converging procedures. Unfortunately, for many distributions of interest, computing the natural gradient has a number of challenges. In this work we propose a novel technique for tackling such issues, which involves reframing the optimisation as one with… ▽ More

    Submitted 4 March, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Journal ref: PMLR 238 (2024):2224-2232

  24. arXiv:2309.11608  [pdf, other

    cs.AI

    Dataset Factory: A Toolchain For Generative Computer Vision Datasets

    Authors: Daniel Kharitonov, Ryan Turner

    Abstract: Generative AI workflows heavily rely on data-centric tasks - such as filtering samples by annotation fields, vector distances, or scores produced by custom classifiers. At the same time, computer vision datasets are quickly approaching petabyte volumes, rendering data wrangling difficult. In addition, the iterative nature of data preparation necessitates robust dataset sharing and versioning mecha… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: Presented at the datacomp.ai workshop at ICCV 2023

  25. arXiv:2308.05732  [pdf, other

    cs.LG cs.AI

    PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers

    Authors: Phillip Lippe, Bastiaan S. Veeling, Paris Perdikaris, Richard E. Turner, Johannes Brandstetter

    Abstract: Time-dependent partial differential equations (PDEs) are ubiquitous in science and engineering. Recently, mostly due to the high computational cost of traditional solution techniques, deep neural network based surrogates have gained increased interest. The practical utility of such neural PDE solvers relies on their ability to provide accurate, stable predictions over long time horizons, which is… ▽ More

    Submitted 21 October, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: Project website: https://phlippe.github.io/PDERefiner/

  26. arXiv:2307.05431  [pdf, other

    stat.ML cs.LG

    Geometric Neural Diffusion Processes

    Authors: Emile Mathieu, Vincent Dutordoir, Michael J. Hutchinson, Valentin De Bortoli, Yee Whye Teh, Richard E. Turner

    Abstract: Denoising diffusion models have proven to be a flexible and effective paradigm for generative modelling. Their recent extension to infinite dimensional Euclidean spaces has allowed for the modelling of stochastic processes. However, many problems in the natural sciences incorporate symmetries and involve data living in non-Euclidean spaces. In this work, we extend the framework of diffusion models… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  27. arXiv:2307.03093  [pdf, other

    cs.LG stat.ML

    Beyond Intuition, a Framework for Applying GPs to Real-World Data

    Authors: Kenza Tazi, Jihao Andreas Lin, Ross Viljoen, Alex Gardner, ST John, Hong Ge, Richard E. Turner

    Abstract: Gaussian Processes (GPs) offer an attractive method for regression over small, structured and correlated datasets. However, their deployment is hindered by computational costs and limited guidelines on how to apply GPs beyond simple low-dimensional datasets. We propose a framework to identify the suitability of GPs to a given problem and how to set up a robust and well-specified GP model. The guid… ▽ More

    Submitted 17 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Accepted at the ICML Workshop on Structured Probabilistic Inference and Generative Modelling (2023)

  28. arXiv:2306.13554  [pdf, other

    cs.LG cs.AI

    Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation

    Authors: Massimiliano Patacchiola, Mingfei Sun, Katja Hofmann, Richard E. Turner

    Abstract: In this paper we explore few-shot imitation learning for control problems, which involves learning to imitate a target policy by accessing a limited set of offline rollouts. This setting has been relatively under-explored despite its relevance to robotics and control applications. State-of-the-art methods developed to tackle few-shot imitation rely on meta-learning, which is expensive to train as… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  29. arXiv:2304.10557  [pdf, other

    cs.LG cs.AI

    An Introduction to Transformers

    Authors: Richard E. Turner

    Abstract: The transformer is a neural network component that can be used to learn useful representations of sequences or sets of data-points. The transformer has driven recent advances in natural language processing, computer vision, and spatio-temporal modelling. There are many introductions to transformers, but most do not contain precise mathematical descriptions of the architecture and the intuitions be… ▽ More

    Submitted 8 February, 2024; v1 submitted 20 April, 2023; originally announced April 2023.

  30. arXiv:2303.14468  [pdf, other

    stat.ML cs.LG

    Autoregressive Conditional Neural Processes

    Authors: Wessel P. Bruinsma, Stratis Markou, James Requiema, Andrew Y. K. Foong, Tom R. Andersson, Anna Vaughan, Anthony Buonomo, J. Scott Hosking, Richard E. Turner

    Abstract: Conditional neural processes (CNPs; Garnelo et al., 2018a) are attractive meta-learning models which produce well-calibrated predictions and are trainable via a simple maximum likelihood procedure. Although CNPs have many advantages, they are unable to model dependencies in their predictions. Various works propose solutions to this, but these come at the cost of either requiring approximate infere… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: 57 pages; accepted to the 11th International Conference on Learning Representations (ICLR 2023)

  31. arXiv:2303.13199  [pdf, other

    cs.CV

    First Session Adaptation: A Strong Replay-Free Baseline for Class-Incremental Learning

    Authors: Aristeidis Panos, Yuriko Kobe, Daniel Olmeda Reino, Rahaf Aljundi, Richard E. Turner

    Abstract: In Class-Incremental Learning (CIL) an image classification system is exposed to new classes in each learning session and must be updated incrementally. Methods approaching this problem have updated both the classification head and the feature extractor body at each session of CIL. In this work, we develop a baseline method, First Session Adaptation (FSA), that sheds light on the efficacy of exist… ▽ More

    Submitted 12 January, 2024; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: accepted at ICCV 23

  32. arXiv:2302.01190  [pdf, other

    stat.ML cs.CR cs.LG

    On the Efficacy of Differentially Private Few-shot Image Classification

    Authors: Marlon Tobaben, Aliaksandra Shysheya, John Bronskill, Andrew Paverd, Shruti Tople, Santiago Zanella-Beguelin, Richard E Turner, Antti Honkela

    Abstract: There has been significant recent progress in training differentially private (DP) models which achieve accuracy that approaches the best non-private models. These DP models are typically pretrained on large public datasets and then fine-tuned on private downstream datasets that are relatively large and similar in distribution to the pretraining data. However, in many applications including person… ▽ More

    Submitted 19 December, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: 49 pages, 24 figures; published in TMLR 12/2023 https://openreview.net/forum?id=hFsr59Imzm

    Journal ref: Transactions on Machine Learning Research, ISSN 2835-8856, 2023

  33. arXiv:2211.12990  [pdf, other

    cs.LG cs.CR

    Adversarial Attacks are a Surprisingly Strong Baseline for Poisoning Few-Shot Meta-Learners

    Authors: Elre T. Oldewage, John Bronskill, Richard E. Turner

    Abstract: This paper examines the robustness of deployed few-shot meta-learning systems when they are fed an imperceptibly perturbed few-shot dataset. We attack amortized meta-learners, which allows us to craft colluding sets of inputs that are tailored to fool the system's learning algorithm when used as training data. Jointly crafted adversarial inputs might be expected to synergistically manipulate a cla… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted at I Can't Believe It's Not Better Workshop, Neurips 2022

  34. arXiv:2211.10381  [pdf, other

    stat.ML cs.LG

    Environmental Sensor Placement with Convolutional Gaussian Neural Processes

    Authors: Tom R. Andersson, Wessel P. Bruinsma, Stratis Markou, James Requeima, Alejandro Coca-Castro, Anna Vaughan, Anna-Louise Ellis, Matthew A. Lazzara, Dani Jones, J. Scott Hosking, Richard E. Turner

    Abstract: Environmental sensors are crucial for monitoring weather conditions and the impacts of climate change. However, it is challenging to place sensors in a way that maximises the informativeness of their measurements, particularly in remote regions like Antarctica. Probabilistic machine learning models can suggest informative sensor placements by finding sites that maximally reduce prediction uncertai… ▽ More

    Submitted 15 May, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

    Comments: Accepted in Environmental Data Science (Climate Informatics 2023 Special Issue)

  35. arXiv:2210.16568  [pdf, other

    stat.ML cs.LG

    Ice Core Dating using Probabilistic Programming

    Authors: Aditya Ravuri, Tom R. Andersson, Ieva Kazlauskaite, Will Tebbutt, Richard E. Turner, J. Scott Hosking, Neil D. Lawrence, Markus Kaiser

    Abstract: Ice cores record crucial information about past climate. However, before ice core data can have scientific value, the chronology must be inferred by estimating the age as a function of depth. Under certain conditions, chemicals locked in the ice display quasi-periodic cycles that delineate annual layers. Manually counting these noisy seasonal patterns to infer the chronology can be an imperfect an… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

  36. arXiv:2209.11595  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially private partitioned variational inference

    Authors: Mikko A. Heikkilä, Matthew Ashman, Siddharth Swaroop, Richard E. Turner, Antti Honkela

    Abstract: Learning a privacy-preserving model from sensitive data which are distributed across multiple devices is an increasingly important problem. The problem is often formulated in the federated learning context, with the aim of learning a single global model while keeping the data distributed. Moreover, Bayesian learning is a popular approach for modelling, since it naturally supports reliable uncertai… ▽ More

    Submitted 18 April, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: Published in TMLR 04/2023: https://openreview.net/forum?id=55BcghgicI

    Journal ref: Transactions on Machine Learning Research, ISSN 2835-8856, 2023

  37. arXiv:2209.04947  [pdf, other

    cs.LG stat.AP stat.ML

    Kernel Learning for Explainable Climate Science

    Authors: Vidhi Lalchand, Kenza Tazi, Talay M. Cheema, Richard E. Turner, Scott Hosking

    Abstract: The Upper Indus Basin, Himalayas provides water for 270 million people and countless ecosystems. However, precipitation, a key component to hydrological modelling, is poorly understood in this area. A key challenge surrounding this uncertainty comes from the complex spatial-temporal distribution of precipitation across the basin. In this work we propose Gaussian processes with structured non-stati… ▽ More

    Submitted 16 July, 2023; v1 submitted 11 September, 2022; originally announced September 2022.

    Comments: 16th Bayesian Modelling Applications Workshop at UAI, 2022 (Eindhoven, Netherlands)

  38. arXiv:2209.00517  [pdf, other

    cs.LG cs.CV

    The Neural Process Family: Survey, Applications and Perspectives

    Authors: Saurav Jha, Dong Gong, Xuesong Wang, Richard E. Turner, Lina Yao

    Abstract: The standard approaches to neural network implementation yield powerful function approximation capabilities but are limited in their abilities to learn meta representations and reason probabilistic uncertainties in their predictions. Gaussian processes, on the other hand, adopt the Bayesian learning scheme to estimate such uncertainties but are constrained by their efficiency and approximation cap… ▽ More

    Submitted 2 October, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: Work under review

  39. arXiv:2209.00357  [pdf, other

    cs.SE

    Testing Causality in Scientific Modelling Software

    Authors: Andrew G. Clark, Michael Foster, Benedikt Prifling, Neil Walkinshaw, Robert M. Hierons, Volker Schmidt, Robert D. Turner

    Abstract: From simulating galaxy formation to viral transmission in a pandemic, scientific models play a pivotal role in developing scientific theories and supporting government policy decisions that affect us all. Given these critical applications, a poor modelling assumption or bug could have far-reaching consequences. However, scientific models possess several properties that make them notoriously diffic… ▽ More

    Submitted 30 June, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

    ACM Class: D.2.5; I.6.4

  40. arXiv:2207.03227  [pdf, other

    cs.LG cs.AI stat.ML

    Challenges and Pitfalls of Bayesian Unlearning

    Authors: Ambrish Rawat, James Requeima, Wessel Bruinsma, Richard Turner

    Abstract: Machine unlearning refers to the task of removing a subset of training data, thereby removing its contributions to a trained model. Approximate unlearning are one class of methods for this task which avoid the need to retrain the model from scratch on the retained data. Bayes' rule can be used to cast approximate unlearning as an inference problem where the objective is to obtain the updated poste… ▽ More

    Submitted 13 September, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: 5 pages, 3 figures, Updatable ML (UpML) Workshop, International Conference on Machine Learning (ICML) 2022

  41. arXiv:2206.09843  [pdf, other

    cs.CV cs.LG

    Contextual Squeeze-and-Excitation for Efficient Few-Shot Image Classification

    Authors: Massimiliano Patacchiola, John Bronskill, Aliaksandra Shysheya, Katja Hofmann, Sebastian Nowozin, Richard E. Turner

    Abstract: Recent years have seen a growth in user-centric applications that require effective knowledge transfer across tasks in the low-data regime. An example is personalization, where a pretrained system is adapted by learning on small amounts of labeled data belonging to a specific user. This setting requires high accuracy under low computational complexity, therefore the Pareto frontier of accuracy vs.… ▽ More

    Submitted 11 January, 2023; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: Advances in Neural Information Processing Systems (NeurIPS 2022)

  42. arXiv:2206.08671  [pdf, other

    stat.ML cs.CV cs.LG

    FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification

    Authors: Aliaksandra Shysheya, John Bronskill, Massimiliano Patacchiola, Sebastian Nowozin, Richard E Turner

    Abstract: Modern deep learning systems are increasingly deployed in situations such as personalization and federated learning where it is necessary to support i) learning on small amounts of data, and ii) communication efficient distributed training protocols. In this work, we develop FiLM Transfer (FiT) which fulfills these requirements in the image classification setting by combining ideas from transfer l… ▽ More

    Submitted 2 February, 2023; v1 submitted 17 June, 2022; originally announced June 2022.

    Journal ref: The Eleventh International Conference on Learning Representations (ICLR 2023)

  43. arXiv:2205.08875  [pdf, other

    cs.LG cs.CY

    Multi-disciplinary fairness considerations in machine learning for clinical trials

    Authors: Isabel Chien, Nina Deliu, Richard E. Turner, Adrian Weller, Sofia S. Villar, Niki Kilbertus

    Abstract: While interest in the application of machine learning to improve healthcare has grown tremendously in recent years, a number of barriers prevent deployment in medical practice. A notable concern is the potential to exacerbate entrenched biases and existing health disparities in society. The area of fairness in machine learning seeks to address these issues of equity; however, appropriate approache… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: Appeared at ACM FAccT 2022

  44. Visualization for Epidemiological Modelling: Challenges, Solutions, Reflections & Recommendations

    Authors: Jason Dykes, Alfie Abdul-Rahman, Daniel Archambault, Benjamin Bach, Rita Borgo, Min Chen, Jessica Enright, Hui Fang, Elif E. Firat, Euan Freeman, Tuna Gonen, Claire Harris, Radu Jianu, Nigel W. John, Saiful Khan, Andrew Lahiff, Robert S. Laramee, Louise Matthews, Sibylle Mohr, Phong H. Nguyen, Alma A. M. Rahat, Richard Reeve, Panagiotis D. Ritsos, Jonathan C. Roberts, Aidan Slingsby , et al. (8 additional authors not shown)

    Abstract: We report on an ongoing collaboration between epidemiological modellers and visualization researchers by documenting and reflecting upon knowledge constructs -- a series of ideas, approaches and methods taken from existing visualization research and practice -- deployed and developed to support modelling of the COVID-19 pandemic. Structured independent commentary on these efforts is synthesized th… ▽ More

    Submitted 20 June, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Journal ref: RSTA: Special Issue - Technical challenges of modelling real-life epidemics and examples of overcoming these, 380(2233) 2022

  45. arXiv:2203.08775  [pdf, other

    stat.ML cs.LG

    Practical Conditional Neural Processes Via Tractable Dependent Predictions

    Authors: Stratis Markou, James Requeima, Wessel P. Bruinsma, Anna Vaughan, Richard E. Turner

    Abstract: Conditional Neural Processes (CNPs; Garnelo et al., 2018a) are meta-learning models which leverage the flexibility of deep learning to produce well-calibrated predictions and naturally handle off-the-grid and missing data. CNPs scale to large datasets and train with ease. Due to these features, CNPs appear well-suited to tasks from environmental sciences or healthcare. Unfortunately, CNPs do not p… ▽ More

    Submitted 13 June, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: 23 pages; accepted to the 10th International Conference on Learning Representations (ICLR 2022)

  46. arXiv:2203.06997  [pdf, other

    stat.ML cs.LG eess.SP

    Modelling Non-Smooth Signals with Complex Spectral Structure

    Authors: Wessel P. Bruinsma, Martin Tegnér, Richard E. Turner

    Abstract: The Gaussian Process Convolution Model (GPCM; Tobar et al., 2015a) is a model for signals with complex spectral structure. A significant limitation of the GPCM is that it assumes a rapidly decaying spectrum: it can only model smooth signals. Moreover, inference in the GPCM currently requires (1) a mean-field assumption, resulting in poorly calibrated uncertainties, and (2) a tedious variational op… ▽ More

    Submitted 14 April, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: 30 pages, accepted to the 25th International Conference on Artificial Intelligence and Statistics (AISTATS) 2022

  47. arXiv:2202.12275  [pdf, other

    stat.ML cs.LG

    Partitioned Variational Inference: A Framework for Probabilistic Federated Learning

    Authors: Matthew Ashman, Thang D. Bui, Cuong V. Nguyen, Stratis Markou, Adrian Weller, Siddharth Swaroop, Richard E. Turner

    Abstract: The proliferation of computing devices has brought about an opportunity to deploy machine learning models on new problem domains using previously inaccessible data. Traditional algorithms for training such models often require data to be stored on a single machine with compute performed by a single node, making them unsuitable for decentralised training on multiple devices. This deficiency has mot… ▽ More

    Submitted 28 April, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:1811.11206

  48. FAIR Data Pipeline: provenance-driven data management for traceable scientific workflows

    Authors: Sonia Natalie Mitchell, Andrew Lahiff, Nathan Cummings, Jonathan Hollocombe, Bram Boskamp, Ryan Field, Dennis Reddyhoff, Kristian Zarebski, Antony Wilson, Bruno Viola, Martin Burke, Blair Archibald, Paul Bessell, Richard Blackwell, Lisa A Boden, Alys Brett, Sam Brett, Ruth Dundas, Jessica Enright, Alejandra N. Gonzalez-Beltran, Claire Harris, Ian Hinder, Christopher David Hughes, Martin Knight, Vino Mano , et al. (13 additional authors not shown)

    Abstract: Modern epidemiological analyses to understand and combat the spread of disease depend critically on access to, and use of, data. Rapidly evolving data, such as data streams changing during a disease outbreak, are particularly challenging. Data management is further complicated by data being imprecisely identified when used. Public trust in policy decisions resulting from such analyses is easily da… ▽ More

    Submitted 4 May, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

  49. arXiv:2108.09676  [pdf, other

    cs.LG stat.ML

    Efficient Gaussian Neural Processes for Regression

    Authors: Stratis Markou, James Requeima, Wessel Bruinsma, Richard Turner

    Abstract: Conditional Neural Processes (CNP; Garnelo et al., 2018) are an attractive family of meta-learning models which produce well-calibrated predictions, enable fast inference at test time, and are trainable via a simple maximum likelihood procedure. A limitation of CNPs is their inability to model dependencies in the outputs. This significantly hurts predictive performance and renders it impossible to… ▽ More

    Submitted 18 October, 2021; v1 submitted 22 August, 2021; originally announced August 2021.

    Comments: 6 pages

  50. arXiv:2107.01105  [pdf, other

    stat.ML cs.LG

    Memory Efficient Meta-Learning with Large Images

    Authors: John Bronskill, Daniela Massiceti, Massimiliano Patacchiola, Katja Hofmann, Sebastian Nowozin, Richard E. Turner

    Abstract: Meta learning approaches to few-shot classification are computationally efficient at test time, requiring just a few optimization steps or single forward pass to learn a new task, but they remain highly memory-intensive to train. This limitation arises because a task's entire support set, which can contain up to 1000 images, must be processed before an optimization step can be taken. Harnessing th… ▽ More

    Submitted 26 October, 2021; v1 submitted 2 July, 2021; originally announced July 2021.

    Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)