Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Brehmer, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14995  [pdf, other

    cs.LG cs.NI eess.SP stat.ML

    Probabilistic and Differentiable Wireless Simulation with Geometric Transformers

    Authors: Thomas Hehn, Markus Peschl, Tribhuvanesh Orekondy, Arash Behboodi, Johann Brehmer

    Abstract: Modelling the propagation of electromagnetic signals is critical for designing modern communication systems. While there are precise simulators based on ray tracing, they do not lend themselves to solving inverse problems or the integration in an automated design loop. We propose to address these challenges through differentiable neural surrogates that exploit the geometric aspects of the problem.… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2405.14806  [pdf, other

    physics.data-an cs.LG hep-ph stat.ML

    Lorentz-Equivariant Geometric Algebra Transformers for High-Energy Physics

    Authors: Jonas Spinner, Victor Bresó, Pim de Haan, Tilman Plehn, Jesse Thaler, Johann Brehmer

    Abstract: Extracting scientific understanding from particle-physics experiments requires solving diverse learning problems with high precision and good data efficiency. We propose the Lorentz Geometric Algebra Transformer (L-GATr), a new multi-purpose architecture for high-energy physics. L-GATr represents high-energy data in a geometric algebra over four-dimensional space-time and is equivariant under Lore… ▽ More

    Submitted 9 July, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 10+12 pages, 5+2 figures, 2 tables, v2: Extend acknowledgements, add link to github repo

    Report number: MIT-CTP/5723

  3. arXiv:2312.03881  [pdf, other

    cs.LG cs.AI

    FoMo Rewards: Can we cast foundation models as reward functions?

    Authors: Ekdeep Singh Lubana, Johann Brehmer, Pim de Haan, Taco Cohen

    Abstract: We explore the viability of casting foundation models as generic reward functions for reinforcement learning. To this end, we propose a simple pipeline that interfaces an off-the-shelf vision model with a large language model. Specifically, given a trajectory of observations, we infer the likelihood of an instruction describing the task that the user wants an agent to perform. We show that this ge… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Accepted to NeurIPS FMDM workshop

  4. arXiv:2311.04744  [pdf, other

    cs.LG cs.AI

    Euclidean, Projective, Conformal: Choosing a Geometric Algebra for Equivariant Transformers

    Authors: Pim de Haan, Taco Cohen, Johann Brehmer

    Abstract: The Geometric Algebra Transformer (GATr) is a versatile architecture for geometric deep learning based on projective geometric algebra. We generalize this architecture into a blueprint that allows one to construct a scalable transformer architecture given any geometric (or Clifford) algebra. We study versions of this architecture for Euclidean, projective, and conformal algebras, all of which are… ▽ More

    Submitted 14 March, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: Accepted to AISTATS 2024

  5. arXiv:2305.18415  [pdf, other

    cs.LG cs.RO stat.ML

    Geometric Algebra Transformer

    Authors: Johann Brehmer, Pim de Haan, Sönke Behrends, Taco Cohen

    Abstract: Problems involving geometric data arise in physics, chemistry, robotics, computer vision, and many other fields. Such data can take numerous forms, for instance points, direction vectors, translations, or rotations, but to date there is no single architecture that can be applied to such a wide variety of geometric types while respecting their symmetries. In this paper we introduce the Geometric Al… ▽ More

    Submitted 20 November, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: Published at NeurIPS 2023, implementation available at https://github.com/qualcomm-ai-research/geometric-algebra-transformer . v3: matches camera-ready version

  6. arXiv:2303.12410  [pdf, other

    cs.LG cs.RO stat.ML

    EDGI: Equivariant Diffusion for Planning with Embodied Agents

    Authors: Johann Brehmer, Joey Bose, Pim de Haan, Taco Cohen

    Abstract: Embodied agents operate in a structured world, often solving tasks with spatial, temporal, and permutation symmetries. Most algorithms for planning and model-based reinforcement learning (MBRL) do not take this rich geometric structure into account, leading to sample inefficiency and poor generalization. We introduce the Equivariant Diffuser for Generating Interactions (EDGI), an algorithm for MBR… ▽ More

    Submitted 19 October, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Accepted at NeurIPS 2023. v2: matches camera-ready version

  7. arXiv:2211.02667  [pdf, other

    cs.LG stat.ML

    Deconfounding Imitation Learning with Variational Inference

    Authors: Risto Vuorio, Pim de Haan, Johann Brehmer, Hanno Ackermann, Daniel Dijkman, Taco Cohen

    Abstract: Standard imitation learning can fail when the expert demonstrators have different sensory inputs than the imitating agent. This is because partial observability gives rise to hidden confounders in the causal graph. In previous work, to work around the confounding problem, policies have been trained using query access to the expert's policy or inverse reinforcement learning (IRL). However, both app… ▽ More

    Submitted 25 August, 2024; v1 submitted 4 November, 2022; originally announced November 2022.

  8. arXiv:2203.16437  [pdf, other

    stat.ML cs.LG

    Weakly supervised causal representation learning

    Authors: Johann Brehmer, Pim de Haan, Phillip Lippe, Taco Cohen

    Abstract: Learning high-level causal representations together with a causal model from unstructured low-level data such as pixels is impossible from observational data alone. We prove under mild assumptions that this representation is however identifiable in a weakly supervised setting. This involves a dataset with paired samples before and after random, unknown interventions, but no further labels. We then… ▽ More

    Submitted 11 October, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Published at NeurIPS 2022. v3: Experiments with higher-dimensional data and larger graphs, improved writing, and added references; matches camera-ready version

  9. arXiv:2112.11312  [pdf, other

    cs.LG cs.CV

    Implicit Neural Video Compression

    Authors: Yunfan Zhang, Ties van Rozendaal, Johann Brehmer, Markus Nagel, Taco Cohen

    Abstract: We propose a method to compress full-resolution video sequences with implicit neural representations. Each frame is represented as a neural network that maps coordinate positions to pixel values. We use a separate implicit network to modulate the coordinate inputs, which enables efficient motion compensation between frames. Together with a small residual network, this allows us to efficiently comp… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

  10. arXiv:2112.03235  [pdf, other

    cs.AI cs.CE cs.LG cs.MS

    Simulation Intelligence: Towards a New Generation of Scientific Methods

    Authors: Alexander Lavin, David Krakauer, Hector Zenil, Justin Gottschlich, Tim Mattson, Johann Brehmer, Anima Anandkumar, Sanjay Choudry, Kamil Rocki, Atılım Güneş Baydin, Carina Prunkl, Brooks Paige, Olexandr Isayev, Erik Peterson, Peter L. McMahon, Jakob Macke, Kyle Cranmer, Jiaxin Zhang, Haruko Wainwright, Adi Hanuka, Manuela Veloso, Samuel Assefa, Stephan Zheng, Avi Pfeffer

    Abstract: The original "Seven Motifs" set forth a roadmap of essential methods for the field of scientific computing, where a motif is an algorithmic method that captures a pattern of computation and data movement. We present the "Nine Motifs of Simulation Intelligence", a roadmap for the development and integration of the essential algorithms necessary for a merger of scientific computing, scientific simul… ▽ More

    Submitted 27 November, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  11. arXiv:2111.10302  [pdf, other

    eess.IV cs.CV cs.LG

    Instance-Adaptive Video Compression: Improving Neural Codecs by Training on the Test Set

    Authors: Ties van Rozendaal, Johann Brehmer, Yunfan Zhang, Reza Pourreza, Auke Wiggers, Taco S. Cohen

    Abstract: We introduce a video compression algorithm based on instance-adaptive learning. On each video sequence to be transmitted, we finetune a pretrained compression model. The optimal parameters are transmitted to the receiver along with the latent code. By entropy-coding the parameter updates under a suitable mixture model prior, we ensure that the network parameters can be encoded efficiently. This in… ▽ More

    Submitted 23 June, 2023; v1 submitted 19 November, 2021; originally announced November 2021.

    Comments: Matches version published in TMLR

  12. arXiv:2011.08191  [pdf, other

    cs.AI cs.LG hep-ph

    Hierarchical clustering in particle physics through reinforcement learning

    Authors: Johann Brehmer, Sebastian Macaluso, Duccio Pappadopulo, Kyle Cranmer

    Abstract: Particle physics experiments often require the reconstruction of decay patterns through a hierarchical clustering of the observed final-state particles. We show that this task can be phrased as a Markov Decision Process and adapt reinforcement learning algorithms to solve it. In particular, we show that Monte-Carlo Tree Search guided by a neural policy can construct high-quality hierarchical clust… ▽ More

    Submitted 18 December, 2020; v1 submitted 16 November, 2020; originally announced November 2020.

    Comments: Accepted at the Machine Learning and the Physical Sciences workshop at NeurIPS 2020

  13. arXiv:2003.13913  [pdf, other

    stat.ML cs.LG

    Flows for simultaneous manifold learning and density estimation

    Authors: Johann Brehmer, Kyle Cranmer

    Abstract: We introduce manifold-learning flows (M-flows), a new class of generative models that simultaneously learn the data manifold as well as a tractable probability density on that manifold. Combining aspects of normalizing flows, GANs, autoencoders, and energy-based models, they have the potential to represent datasets with a manifold structure more faithfully and provide handles on dimensionality red… ▽ More

    Submitted 13 November, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: Code at https://github.com/johannbrehmer/manifold-flow , v2: multiple new experiments, v3: added comparison with probabilistic auto-encoder

  14. arXiv:1911.01429  [pdf, other

    stat.ML cs.LG stat.ME

    The frontier of simulation-based inference

    Authors: Kyle Cranmer, Johann Brehmer, Gilles Louppe

    Abstract: Many domains of science have developed complex simulations to describe phenomena of interest. While these simulations provide high-fidelity models, they are poorly suited for inference and lead to challenging inverse problems. We review the rapidly developing field of simulation-based inference and identify the forces giving new momentum to the field. Finally, we describe how the frontier is expan… ▽ More

    Submitted 2 April, 2020; v1 submitted 4 November, 2019; originally announced November 2019.

    Comments: 10 pages, 3 figures, proceedings for the Sackler Colloquia at the US National Academy of Sciences. v2: fixed typos. v3: clarified text, added references

  15. arXiv:1808.00973  [pdf, other

    stat.ML cs.LG hep-ph physics.data-an

    Likelihood-free inference with an improved cross-entropy estimator

    Authors: Markus Stoye, Johann Brehmer, Gilles Louppe, Juan Pavez, Kyle Cranmer

    Abstract: We extend recent work (Brehmer, et. al., 2018) that use neural networks as surrogate models for likelihood-free inference. As in the previous work, we exploit the fact that the joint likelihood ratio and joint score, conditioned on both observed and latent variables, can often be extracted from an implicit generative model or simulator to augment the training data for these surrogate models. We sh… ▽ More

    Submitted 2 August, 2018; originally announced August 2018.

    Comments: 8 pages, 3 figures

  16. arXiv:1805.12244  [pdf, other

    stat.ML cs.LG hep-ph physics.data-an

    Mining gold from implicit models to improve likelihood-free inference

    Authors: Johann Brehmer, Gilles Louppe, Juan Pavez, Kyle Cranmer

    Abstract: Simulators often provide the best description of real-world phenomena. However, they also lead to challenging inverse problems because the density they implicitly define is often intractable. We present a new suite of simulation-based inference techniques that go beyond the traditional Approximate Bayesian Computation approach, which struggles in a high-dimensional setting, and extend methods that… ▽ More

    Submitted 5 August, 2019; v1 submitted 30 May, 2018; originally announced May 2018.

    Comments: Code available at https://github.com/johannbrehmer/simulator-mining-example . v2: Fixed typos. v3: Expanded discussion, added Lotka-Volterra example. v4: Improved clarity