Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Lawson, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.19230  [pdf, other

    stat.ML cs.LG

    Valid Conformal Prediction for Dynamic GNNs

    Authors: Ed Davis, Ian Gallagher, Daniel John Lawson, Patrick Rubin-Delanchy

    Abstract: Graph neural networks (GNNs) are powerful black-box models which have shown impressive empirical performance. However, without any form of uncertainty quantification, it can be difficult to trust such models in high-risk scenarios. Conformal prediction aims to address this problem, however, an assumption of exchangeability is required for its validity which has limited its applicability to static… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 17 pages, 8 figures

    MSC Class: 62H30

  2. arXiv:2312.06506  [pdf, ps, other

    cs.LO

    The Directed Van Kampen Theorem in Lean

    Authors: Henning Basold, Peter Bruin, Dominique Lawson

    Abstract: Directed topology is an area of mathematics with applications in concurrency. It extends the concept of a topological space by adding a notion of directedness, which restricts how paths can evolve through a space and enables thereby a faithful representation of computation with their direction. In this paper, we present a Lean formalisation of directed spaces and a Van Kampen theorem for them. Thi… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    MSC Class: 03B35 ACM Class: F.4.1; F.1.2

  3. arXiv:2311.09251  [pdf, other

    cs.SI cs.LG stat.ML

    A Simple and Powerful Framework for Stable Dynamic Network Embedding

    Authors: Ed Davis, Ian Gallagher, Daniel John Lawson, Patrick Rubin-Delanchy

    Abstract: In this paper, we address the problem of dynamic network embedding, that is, representing the nodes of a dynamic network as evolving vectors within a low-dimensional space. While the field of static network embedding is wide and established, the field of dynamic network embedding is comparatively in its infancy. We propose that a wide class of established static network embedding methods can be us… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 33 pages, 9 figures

    MSC Class: 62H15 (Primary) 62H30; 62M10; 62G99 (Secondary)

  4. arXiv:2308.14864  [pdf, other

    cs.LG cs.AI stat.ML

    NAS-X: Neural Adaptive Smoothing via Twisting

    Authors: Dieterich Lawson, Michael Li, Scott Linderman

    Abstract: Sequential latent variable models (SLVMs) are essential tools in statistics and machine learning, with applications ranging from healthcare to neuroscience. As their flexibility increases, analytic inference and model learning can become challenging, necessitating approximate methods. Here we introduce neural adaptive smoothing via twisting (NAS-X), a method that extends reweighted wake-sleep (RWS… ▽ More

    Submitted 30 October, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Updating for clarity and adding new baselines

  5. arXiv:2303.07551  [pdf, other

    cs.LG cs.AI

    Merging Decision Transformers: Weight Averaging for Forming Multi-Task Policies

    Authors: Daniel Lawson, Ahmed H. Qureshi

    Abstract: Recent work has shown the promise of creating generalist, transformer-based, models for language, vision, and sequential decision-making problems. To create such models, we generally require centralized training objectives, data, and compute. It is of interest if we can more flexibly create generalist policies by merging together multiple, task-specific, individually trained policies. In this work… ▽ More

    Submitted 22 September, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

  6. arXiv:2303.01346  [pdf, other

    cs.RO cs.LG eess.SY

    Co-learning Planning and Control Policies Constrained by Differentiable Logic Specifications

    Authors: Zikang Xiong, Daniel Lawson, Joe Eappen, Ahmed H. Qureshi, Suresh Jagannathan

    Abstract: Synthesizing planning and control policies in robotics is a fundamental task, further complicated by factors such as complex logic specifications and high-dimensional robot dynamics. This paper presents a novel reinforcement learning approach to solving high-dimensional robot navigation tasks with complex logic specifications by co-learning planning and control policies. Notably, this approach sig… ▽ More

    Submitted 1 October, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  7. arXiv:2211.06407  [pdf, other

    cs.RO cs.AI cs.LG

    Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling

    Authors: Daniel Lawson, Ahmed H. Qureshi

    Abstract: Learning long-horizon tasks such as navigation has presented difficult challenges for successfully applying reinforcement learning to robotics. From another perspective, under known environments, sampling-based planning can robustly find collision-free paths in environments without learning. In this work, we propose Control Transformer that models return-conditioned sequences from low-level polici… ▽ More

    Submitted 13 July, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

  8. arXiv:2206.05952  [pdf, other

    cs.LG cs.AI stat.ML

    SIXO: Smoothing Inference with Twisted Objectives

    Authors: Dieterich Lawson, Allan Raventós, Andrew Warrington, Scott Linderman

    Abstract: Sequential Monte Carlo (SMC) is an inference algorithm for state space models that approximates the posterior by sampling from a sequence of target distributions. The target distributions are often chosen to be the filtering distributions, but these ignore information from future observations, leading to practical and theoretical limitations in inference and model learning. We introduce SIXO, a me… ▽ More

    Submitted 20 June, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: v2: Updates for clarity throughout. Results unchanged

  9. arXiv:2110.04629  [pdf, other

    cs.LG cs.AI stat.ML

    The Neural Testbed: Evaluating Joint Predictions

    Authors: Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Botao Hao, Morteza Ibrahimi, Dieterich Lawson, Xiuyuan Lu, Brendan O'Donoghue, Benjamin Van Roy

    Abstract: Predictive distributions quantify uncertainties ignored by point estimates. This paper introduces The Neural Testbed: an open-source benchmark for controlled and principled evaluation of agents that generate such predictions. Crucially, the testbed assesses agents not only on the quality of their marginal predictions per input, but also on their joint predictions across many inputs. We evaluate a… ▽ More

    Submitted 1 November, 2022; v1 submitted 9 October, 2021; originally announced October 2021.

  10. arXiv:1910.14265  [pdf, other

    cs.LG stat.ML

    Energy-Inspired Models: Learning with Sampler-Induced Distributions

    Authors: Dieterich Lawson, George Tucker, Bo Dai, Rajesh Ranganath

    Abstract: Energy-based models (EBMs) are powerful probabilistic models, but suffer from intractable sampling and density evaluation due to the partition function. As a result, inference in EBMs relies on approximate sampling algorithms, leading to a mismatch between the model and inference. Motivated by this, we consider the sampler-induced distribution as the model of interest and maximize the likelihood o… ▽ More

    Submitted 9 January, 2020; v1 submitted 31 October, 2019; originally announced October 2019.

    Comments: Presented at the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  11. arXiv:1810.04152  [pdf, other

    cs.LG stat.ML

    Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives

    Authors: George Tucker, Dieterich Lawson, Shixiang Gu, Chris J. Maddison

    Abstract: Deep latent variable models have become a popular model choice due to the scalable learning algorithms introduced by (Kingma & Welling, 2013; Rezende et al., 2014). These approaches maximize a variational lower bound on the intractable log likelihood of the observed data. Burda et al. (2015) introduced a multi-sample variational bound, IWAE, that is at least as tight as the standard variational lo… ▽ More

    Submitted 19 November, 2018; v1 submitted 9 October, 2018; originally announced October 2018.

  12. arXiv:1706.06428  [pdf, other

    cs.CL cs.LG stat.ML

    An online sequence-to-sequence model for noisy speech recognition

    Authors: Chung-Cheng Chiu, Dieterich Lawson, Yuping Luo, George Tucker, Kevin Swersky, Ilya Sutskever, Navdeep Jaitly

    Abstract: Generative models have long been the dominant approach for speech recognition. The success of these models however relies on the use of sophisticated recipes and complicated machinery that is not easily accessible to non-practitioners. Recent innovations in Deep Learning have given rise to an alternative - discriminative models called Sequence-to-Sequence models, that can almost match the accuracy… ▽ More

    Submitted 16 June, 2017; originally announced June 2017.

    Comments: arXiv admin note: substantial text overlap with arXiv:1608.01281

  13. arXiv:1705.09279  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Filtering Variational Objectives

    Authors: Chris J. Maddison, Dieterich Lawson, George Tucker, Nicolas Heess, Mohammad Norouzi, Andriy Mnih, Arnaud Doucet, Yee Whye Teh

    Abstract: When used as a surrogate objective for maximum likelihood estimation in latent variable models, the evidence lower bound (ELBO) produces state-of-the-art results. Inspired by this, we consider the extension of the ELBO to a family of lower bounds defined by a particle filter's estimator of the marginal likelihood, the filtering variational objectives (FIVOs). FIVOs take the same arguments as the E… ▽ More

    Submitted 12 November, 2017; v1 submitted 25 May, 2017; originally announced May 2017.

  14. arXiv:1705.05524  [pdf, other

    cs.AI cs.LG stat.ML

    Learning Hard Alignments with Variational Inference

    Authors: Dieterich Lawson, Chung-Cheng Chiu, George Tucker, Colin Raffel, Kevin Swersky, Navdeep Jaitly

    Abstract: There has recently been significant interest in hard attention models for tasks such as object recognition, visual captioning and speech recognition. Hard attention can offer benefits over soft attention such as decreased computational cost, but training hard attention models can be difficult because of the discrete latent variables they introduce. Previous work used REINFORCE and Q-learning to ap… ▽ More

    Submitted 1 November, 2017; v1 submitted 16 May, 2017; originally announced May 2017.

  15. arXiv:1703.07370  [pdf, other

    cs.LG stat.ML

    REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models

    Authors: George Tucker, Andriy Mnih, Chris J. Maddison, Dieterich Lawson, Jascha Sohl-Dickstein

    Abstract: Learning in models with discrete latent variables is challenging due to high variance gradient estimators. Generally, approaches have relied on control variates to reduce the variance of the REINFORCE estimator. Recent work (Jang et al. 2016, Maddison et al. 2016) has taken a different approach, introducing a continuous relaxation of discrete variables to produce low-variance, but biased, gradient… ▽ More

    Submitted 6 November, 2017; v1 submitted 21 March, 2017; originally announced March 2017.

    Comments: NIPS 2017

  16. arXiv:1703.05820  [pdf, other

    cs.LG cs.AI

    Particle Value Functions

    Authors: Chris J. Maddison, Dieterich Lawson, George Tucker, Nicolas Heess, Arnaud Doucet, Andriy Mnih, Yee Whye Teh

    Abstract: The policy gradients of the expected return objective can react slowly to rare rewards. Yet, in some cases agents may wish to emphasize the low or high returns regardless of their probability. Borrowing from the economics and control literature, we review the risk-sensitive value function that arises from an exponential utility and illustrate its effects on an example. This risk-sensitive value fu… ▽ More

    Submitted 16 March, 2017; originally announced March 2017.

  17. arXiv:1702.07780  [pdf, other

    stat.ML cs.LG

    Changing Model Behavior at Test-Time Using Reinforcement Learning

    Authors: Augustus Odena, Dieterich Lawson, Christopher Olah

    Abstract: Machine learning models are often used at test-time subject to constraints and trade-offs not present at training-time. For example, a computer vision model operating on an embedded device may need to perform real-time inference, or a translation model operating on a cell phone may wish to bound its average compute time in order to be power-efficient. In this work we describe a mixture-of-experts… ▽ More

    Submitted 24 February, 2017; originally announced February 2017.

    Comments: Submitted to ICLR 2017 Workshop Track

  18. arXiv:1702.06914  [pdf, other

    cs.LG

    Training a Subsampling Mechanism in Expectation

    Authors: Colin Raffel, Dieterich Lawson

    Abstract: We describe a mechanism for subsampling sequences and show how to compute its expected output so that it can be trained with standard backpropagation. We test this approach on a simple toy problem and discuss its shortcomings.

    Submitted 7 April, 2017; v1 submitted 22 February, 2017; originally announced February 2017.

    Comments: Camera-ready version. Includes additional figures in an appendix