Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Midgley, L I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.10364  [pdf, other

    cs.LG physics.comp-ph

    SE(3) Equivariant Augmented Coupling Flows

    Authors: Laurence I. Midgley, Vincent Stimper, Javier Antorán, Emile Mathieu, Bernhard Schölkopf, José Miguel Hernández-Lobato

    Abstract: Coupling normalizing flows allow for fast sampling and density evaluation, making them the tool of choice for probabilistic modeling of physical systems. However, the standard coupling architecture precludes endowing flows that operate on the Cartesian coordinates of atoms with the SE(3) and permutation invariances of physical systems. This work proposes a coupling flow that preserves SE(3) and pe… ▽ More

    Submitted 5 March, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

  2. arXiv:2306.09884  [pdf, other

    cs.LG cs.AI

    Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

    Authors: Clément Bonnet, Daniel Luo, Donal Byrne, Shikha Surana, Sasha Abramowitz, Paul Duckworth, Vincent Coyette, Laurence I. Midgley, Elshadai Tegegn, Tristan Kalloniatis, Omayma Mahjoub, Matthew Macfarlane, Andries P. Smit, Nathan Grinsztajn, Raphael Boige, Cemlyn N. Waters, Mohamed A. Mimouni, Ulrich A. Mbou Sob, Ruan de Kock, Siddarth Singh, Daniel Furelos-Blanco, Victor Le, Arnu Pretorius, Alexandre Laterre

    Abstract: Open-source reinforcement learning (RL) environments have played a crucial role in driving progress in the development of AI algorithms. In modern RL research, there is a need for simulated environments that are performant, scalable, and modular to enable their utilization in a wider range of potential real-world applications. Therefore, we present Jumanji, a suite of diverse RL environments speci… ▽ More

    Submitted 15 March, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 9 pages + 21 pages of appendices and references. Published at ICLR 2024

  3. arXiv:2211.04327  [pdf, other

    cs.LG cs.CE

    Synthesis of separation processes with reinforcement learning

    Authors: Stephan C. P. A. van Kalmthout, Laurence I. Midgley, Meik B. Franke

    Abstract: This paper shows the implementation of reinforcement learning (RL) in commercial flowsheet simulator software (Aspen Plus V12) for designing and optimising a distillation sequence. The aim of the SAC agent was to separate a hydrocarbon mixture in its individual components by utilising distillation. While doing so it tries to maximise the profit produced by the distillation sequence. All actions of… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  4. arXiv:2208.01893  [pdf, other

    cs.LG q-bio.QM stat.ML

    Flow Annealed Importance Sampling Bootstrap

    Authors: Laurence Illing Midgley, Vincent Stimper, Gregor N. C. Simm, Bernhard Schölkopf, José Miguel Hernández-Lobato

    Abstract: Normalizing flows are tractable density models that can approximate complicated target distributions, e.g. Boltzmann distributions of physical systems. However, current methods for training flows either suffer from mode-seeking behavior, use samples from the target generated beforehand by expensive MCMC methods, or use stochastic losses that have high variance. To avoid these problems, we augment… ▽ More

    Submitted 7 March, 2023; v1 submitted 3 August, 2022; originally announced August 2022.

  5. arXiv:2111.11510  [pdf, other

    cs.LG cs.AI stat.ML

    Bootstrap Your Flow

    Authors: Laurence Illing Midgley, Vincent Stimper, Gregor N. C. Simm, José Miguel Hernández-Lobato

    Abstract: Normalizing flows are flexible, parameterized distributions that can be used to approximate expectations from intractable distributions via importance sampling. However, current flow-based approaches are limited on challenging targets where they either suffer from mode seeking behaviour or high variance in the training loss, or rely on samples from the target distribution, which may not be availab… ▽ More

    Submitted 14 March, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

  6. arXiv:2009.13265  [pdf, other

    cs.LG eess.SY

    Deep Reinforcement Learning for Process Synthesis

    Authors: Laurence Illing Midgley

    Abstract: This paper demonstrates the application of reinforcement learning (RL) to process synthesis by presenting Distillation Gym, a set of RL environments in which an RL agent is tasked with designing a distillation train, given a user defined multi-component feed stream. Distillation Gym interfaces with a process simulator (COCO and ChemSep) to simulate the environment. A demonstration of two distillat… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.