Skip to main content

Showing 1–38 of 38 results for author: Cranmer, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.06107  [pdf, other

    cs.LG cs.SC hep-ph hep-th stat.ML

    Transforming the Bootstrap: Using Transformers to Compute Scattering Amplitudes in Planar N = 4 Super Yang-Mills Theory

    Authors: Tianji Cai, Garrett W. Merz, François Charton, Niklas Nolte, Matthias Wilhelm, Kyle Cranmer, Lance J. Dixon

    Abstract: We pursue the use of deep learning methods to improve state-of-the-art computations in theoretical high-energy physics. Planar N = 4 Super Yang-Mills theory is a close cousin to the theory that describes Higgs boson production at the Large Hadron Collider; its scattering amplitudes are large mathematical expressions containing integer coefficients. In this paper, we apply Transformers to predict t… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 26+10 pages, 9 figures, 7 tables, application of machine learning aimed at physics and machine learning audience

    Report number: SLAC-PUB-17774

  2. arXiv:2401.08777  [pdf, other

    hep-ex cs.LG hep-ph physics.data-an

    Robust Anomaly Detection for Particle Physics Using Multi-Background Representation Learning

    Authors: Abhijith Gandrakota, Lily Zhang, Aahlad Puli, Kyle Cranmer, Jennifer Ngadiuba, Rajesh Ranganath, Nhan Tran

    Abstract: Anomaly, or out-of-distribution, detection is a promising tool for aiding discoveries of new particles or processes in particle physics. In this work, we identify and address two overlooked opportunities to improve anomaly detection for high-energy physics. First, rather than train a generative model on the single most dominant background process, we build detection algorithms using representation… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Report number: FERMILAB-PUB-23-675-CMS-CSAID

  3. Advances in machine-learning-based sampling motivated by lattice quantum chromodynamics

    Authors: Kyle Cranmer, Gurtej Kanwar, Sébastien Racanière, Danilo J. Rezende, Phiala E. Shanahan

    Abstract: Sampling from known probability distributions is a ubiquitous task in computational science, underlying calculations in domains from linguistics to biology and physics. Generative machine-learning (ML) models have emerged as a promising tool in this space, building on the success of this approach in applications such as image, text, and audio generation. Often, however, generative tasks in scienti… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: 11 pages, 5 figures

    Journal ref: Nature Reviews Physics 5, 526-535 (2023)

  4. arXiv:2305.02402  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Normalizing flows for lattice gauge theory in arbitrary space-time dimension

    Authors: Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Gurtej Kanwar, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

    Abstract: Applications of normalizing flows to the sampling of field configurations in lattice gauge theory have so far been explored almost exclusively in two space-time dimensions. We report new algorithmic developments of gauge-equivariant flow architectures facilitating the generalization to higher-dimensional lattice geometries. Specifically, we discuss masked autoregressive transformations with tracta… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  5. arXiv:2303.04217  [pdf, other

    cs.AI cs.CY

    AI for Science: An Emerging Agenda

    Authors: Philipp Berens, Kyle Cranmer, Neil D. Lawrence, Ulrike von Luxburg, Jessica Montgomery

    Abstract: This report documents the programme and the outcomes of Dagstuhl Seminar 22382 "Machine Learning for Science: Bridging Data-Driven and Mechanistic Modelling". Today's scientific challenges are characterised by complexity. Interconnected natural, technological, and human systems are influenced by forces acting across time- and spatial-scales, resulting in complex interactions and emergent behaviour… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  6. arXiv:2303.02101  [pdf, other

    hep-ex cs.LG hep-ph physics.ins-det

    Configurable calorimeter simulation for AI applications

    Authors: Francesco Armando Di Bello, Anton Charkin-Gorbulin, Kyle Cranmer, Etienne Dreyer, Sanmay Ganguly, Eilam Gross, Lukas Heinrich, Lorenzo Santi, Marumi Kado, Nilotpal Kakati, Patrick Rieck, Matteo Tusoni

    Abstract: A configurable calorimeter simulation for AI (COCOA) applications is presented, based on the Geant4 toolkit and interfaced with the Pythia event generator. This open-source project is aimed to support the development of machine learning algorithms in high energy physics that rely on realistic particle shower descriptions, such as reconstruction, fast simulation, and low-level analysis. Specificati… ▽ More

    Submitted 8 March, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: 9 pages, 11 figures

  7. arXiv:2211.07541  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Aspects of scaling and scalability for flow-based sampling of lattice QCD

    Authors: Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

    Abstract: Recent applications of machine-learned normalizing flows to sampling in lattice field theory suggest that such methods may be able to mitigate critical slowing down and topological freezing. However, these demonstrations have been at the scale of toy models, and it remains to be determined whether they can be applied to state-of-the-art lattice quantum chromodynamics calculations. Assessing the vi… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: 22 pages, 8 figures

    Report number: MIT-CTP/5496

  8. Gauge-equivariant flow models for sampling in lattice field theories with pseudofermions

    Authors: Ryan Abbott, Michael S. Albergo, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Gurtej Kanwar, Sébastien Racanière, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Betsy Tian, Julian M. Urban

    Abstract: This work presents gauge-equivariant architectures for flow-based sampling in fermionic lattice field theories using pseudofermions as stochastic estimators for the fermionic determinant. This is the default approach in state-of-the-art lattice field theory calculations, making this development critical to the practical application of flow models to theories such as QCD. Methods by which flow-base… ▽ More

    Submitted 16 October, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: 15 pages, 7 figures. v3: accepted version for publication. New appendix C

    Report number: MIT-CTP/5446, INT-PUB-22-017

    Journal ref: Phys.Rev.D 106 (2022) 7, 074506

  9. arXiv:2202.11712  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Flow-based sampling in the lattice Schwinger model at criticality

    Authors: Michael S. Albergo, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Gurtej Kanwar, Sébastien Racanière, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

    Abstract: Recent results suggest that flow-based algorithms may provide efficient sampling of field distributions for lattice field theory applications, such as studies of quantum chromodynamics and the Schwinger model. In this work, we provide a numerical demonstration of robust flow-based sampling in the Schwinger model at the critical value of the fermion mass. In contrast, at the same parameters, conven… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: 5 pages main text, 3 pages supplementary material. 4 figures

    Report number: MIT-CTP/5409

  10. arXiv:2112.12795  [pdf, other

    hep-ph cs.DS quant-ph

    The Quantum Trellis: A classical algorithm for sampling the parton shower with interference effects

    Authors: Sebastian Macaluso, Kyle Cranmer

    Abstract: Simulations of high-energy particle collisions, such as those used at the Large Hadron Collider, are based on quantum field theory; however, many approximations are made in practice. For example, the simulation of the parton shower, which gives rise to objects called `jets', is based on a semi-classical approximation that neglects various interference effects. While there is a desire to incorporat… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    Comments: 9 pages, 4 figures, Machine Learning and the Physical Sciences 2021, https://github.com/SebastianMacaluso/ClusterTrellis

  11. arXiv:2112.03235  [pdf, other

    cs.AI cs.CE cs.LG cs.MS

    Simulation Intelligence: Towards a New Generation of Scientific Methods

    Authors: Alexander Lavin, David Krakauer, Hector Zenil, Justin Gottschlich, Tim Mattson, Johann Brehmer, Anima Anandkumar, Sanjay Choudry, Kamil Rocki, Atılım Güneş Baydin, Carina Prunkl, Brooks Paige, Olexandr Isayev, Erik Peterson, Peter L. McMahon, Jakob Macke, Kyle Cranmer, Jiaxin Zhang, Haruko Wainwright, Adi Hanuka, Manuela Veloso, Samuel Assefa, Stephan Zheng, Avi Pfeffer

    Abstract: The original "Seven Motifs" set forth a roadmap of essential methods for the field of scientific computing, where a motif is an algorithmic method that captures a pattern of computation and data movement. We present the "Nine Motifs of Simulation Intelligence", a roadmap for the development and integration of the essential algorithms necessary for a merger of scientific computing, scientific simul… ▽ More

    Submitted 27 November, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  12. arXiv:2110.06931  [pdf, other

    astro-ph.HE cs.LG hep-ph

    A neural simulation-based inference approach for characterizing the Galactic Center $γ$-ray excess

    Authors: Siddharth Mishra-Sharma, Kyle Cranmer

    Abstract: The nature of the Fermi gamma-ray Galactic Center Excess (GCE) has remained a persistent mystery for over a decade. Although the excess is broadly compatible with emission expected due to dark matter annihilation, an explanation in terms of a population of unresolved astrophysical point sources e.g., millisecond pulsars, remains viable. The effort to uncover the origin of the GCE is hampered in pa… ▽ More

    Submitted 27 March, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: 21+4 pages, 10+6 figures; v2, version published in PRD with additional tests and minor changes to text, results and conclusions unchanged

    Report number: MIT-CTP/5337

    Journal ref: Phys. Rev. D 105, 063017 (2022)

  13. arXiv:2107.00734  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Flow-based sampling for multimodal distributions in lattice field theory

    Authors: Daniel C. Hackett, Chung-Chun Hsieh, Michael S. Albergo, Denis Boyda, Jiunn-Wei Chen, Kai-Feng Chen, Kyle Cranmer, Gurtej Kanwar, Phiala E. Shanahan

    Abstract: Recent results have demonstrated that samplers constructed with flow-based generative models are a promising new approach for configuration generation in lattice field theory. In this paper, we present a set of methods to construct flow models for targets with multiple separated modes (i.e. theories with multiple vacua). We demonstrate the application of these methods to modeling two-dimensional r… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 33 pages, 29 figures

    Report number: MIT-CTP/5312

  14. arXiv:2106.05934  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Flow-based sampling for fermionic lattice field theories

    Authors: Michael S. Albergo, Gurtej Kanwar, Sébastien Racanière, Danilo J. Rezende, Julian M. Urban, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Phiala E. Shanahan

    Abstract: Algorithms based on normalizing flows are emerging as promising machine learning approaches to sampling complicated probability distributions in a way that can be made asymptotically exact. In the context of lattice field theory, proof-of-principle studies have demonstrated the effectiveness of this approach for scalar theories, gauge theories, and statistical systems. This work develops approache… ▽ More

    Submitted 28 December, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: 26 pages, 5 figures

    Report number: MIT-CTP/5307

    Journal ref: Phys. Rev. D 104, 114507 (2021)

  15. arXiv:2104.07061  [pdf, other

    cs.LG cs.DS physics.data-an stat.ML

    Exact and Approximate Hierarchical Clustering Using A*

    Authors: Craig S. Greenberg, Sebastian Macaluso, Nicholas Monath, Avinava Dubey, Patrick Flaherty, Manzil Zaheer, Amr Ahmed, Kyle Cranmer, Andrew McCallum

    Abstract: Hierarchical clustering is a critical task in numerous domains. Many approaches are based on heuristics and the properties of the resulting clusterings are studied post hoc. However, in several applications, there is a natural cost function that can be used to characterize the quality of the clustering. In those cases, hierarchical clustering can be seen as a combinatorial optimization problem. To… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: 30 pages, 9 figures

  16. arXiv:2101.08176  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Introduction to Normalizing Flows for Lattice Field Theory

    Authors: Michael S. Albergo, Denis Boyda, Daniel C. Hackett, Gurtej Kanwar, Kyle Cranmer, Sébastien Racanière, Danilo Jimenez Rezende, Phiala E. Shanahan

    Abstract: This notebook tutorial demonstrates a method for sampling Boltzmann distributions of lattice field theories using a class of machine learning models known as normalizing flows. The ideas and approaches proposed in arXiv:1904.12072, arXiv:2002.02428, and arXiv:2003.06413 are reviewed and a concrete implementation of the framework is presented. We apply this framework to a lattice scalar field theor… ▽ More

    Submitted 6 August, 2021; v1 submitted 20 January, 2021; originally announced January 2021.

    Comments: 38 pages, 5 numbered figures, Jupyter notebook included as ancillary file

    Report number: MIT-CTP/5272

  17. arXiv:2011.08191  [pdf, other

    cs.AI cs.LG hep-ph

    Hierarchical clustering in particle physics through reinforcement learning

    Authors: Johann Brehmer, Sebastian Macaluso, Duccio Pappadopulo, Kyle Cranmer

    Abstract: Particle physics experiments often require the reconstruction of decay patterns through a hierarchical clustering of the observed final-state particles. We show that this task can be phrased as a Markov Decision Process and adapt reinforcement learning algorithms to solve it. In particular, we show that Monte-Carlo Tree Search guided by a neural policy can construct high-quality hierarchical clust… ▽ More

    Submitted 18 December, 2020; v1 submitted 16 November, 2020; originally announced November 2020.

    Comments: Accepted at the Machine Learning and the Physical Sciences workshop at NeurIPS 2020

  18. arXiv:2008.05456  [pdf, other

    hep-lat cs.LG stat.ML

    Sampling using $SU(N)$ gauge equivariant flows

    Authors: Denis Boyda, Gurtej Kanwar, Sébastien Racanière, Danilo Jimenez Rezende, Michael S. Albergo, Kyle Cranmer, Daniel C. Hackett, Phiala E. Shanahan

    Abstract: We develop a flow-based sampling algorithm for $SU(N)$ lattice gauge theories that is gauge-invariant by construction. Our key contribution is constructing a class of flows on an $SU(N)$ variable (or on a $U(N)$ variable by a simple alternative) that respect matrix conjugation symmetry. We apply this technique to sample distributions of single $SU(N)$ variables and to construct flow-based samplers… ▽ More

    Submitted 18 September, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

    Comments: 24 pages, 19 figures

    Report number: MIT-CTP/5228

    Journal ref: Phys. Rev. D 103, 074504 (2021)

  19. arXiv:2006.11287  [pdf, other

    cs.LG astro-ph.CO astro-ph.IM physics.comp-ph stat.ML

    Discovering Symbolic Models from Deep Learning with Inductive Biases

    Authors: Miles Cranmer, Alvaro Sanchez-Gonzalez, Peter Battaglia, Rui Xu, Kyle Cranmer, David Spergel, Shirley Ho

    Abstract: We develop a general approach to distill symbolic representations of a learned deep model by introducing strong inductive biases. We focus on Graph Neural Networks (GNNs). The technique works as follows: we first encourage sparse latent representations when we train a GNN in a supervised setting, then we apply symbolic regression to components of the learned model to extract explicit physical rela… ▽ More

    Submitted 17 November, 2020; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: Accepted to NeurIPS 2020. 9 pages content + 16 pages appendix/references. Supporting code found at https://github.com/MilesCranmer/symbolic_deep_learning

  20. arXiv:2003.13913  [pdf, other

    stat.ML cs.LG

    Flows for simultaneous manifold learning and density estimation

    Authors: Johann Brehmer, Kyle Cranmer

    Abstract: We introduce manifold-learning flows (M-flows), a new class of generative models that simultaneously learn the data manifold as well as a tractable probability density on that manifold. Combining aspects of normalizing flows, GANs, autoencoders, and energy-based models, they have the potential to represent datasets with a manifold structure more faithfully and provide handles on dimensionality red… ▽ More

    Submitted 13 November, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: Code at https://github.com/johannbrehmer/manifold-flow , v2: multiple new experiments, v3: added comparison with probabilistic auto-encoder

  21. arXiv:2003.06413  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Equivariant flow-based sampling for lattice gauge theory

    Authors: Gurtej Kanwar, Michael S. Albergo, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Sébastien Racanière, Danilo Jimenez Rezende, Phiala E. Shanahan

    Abstract: We define a class of machine-learned flow-based sampling algorithms for lattice gauge theories that are gauge-invariant by construction. We demonstrate the application of this framework to U(1) gauge theory in two spacetime dimensions, and find that near critical points in parameter space the approach is orders of magnitude more efficient at sampling topological quantities than more traditional sa… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

    Comments: 6 pages, 4 figures

    Report number: MIT-CTP/5181

    Journal ref: Phys. Rev. Lett. 125, 121601 (2020)

  22. arXiv:2002.11661  [pdf, other

    cs.DS cs.LG physics.data-an stat.ML

    Data Structures & Algorithms for Exact Inference in Hierarchical Clustering

    Authors: Craig S. Greenberg, Sebastian Macaluso, Nicholas Monath, Ji-Ah Lee, Patrick Flaherty, Kyle Cranmer, Andrew McGregor, Andrew McCallum

    Abstract: Hierarchical clustering is a fundamental task often used to discover meaningful structures in data, such as phylogenetic trees, taxonomies of concepts, subtypes of cancer, and cascades of particle decays in particle physics. Typically approximate algorithms are used for inference due to the combinatorial number of possible hierarchical clusterings. In contrast to existing methods, we present novel… ▽ More

    Submitted 22 October, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

    Comments: 27 pages, 12 figures

  23. arXiv:2002.08772  [pdf, other

    cs.LG stat.ML

    Set2Graph: Learning Graphs From Sets

    Authors: Hadar Serviansky, Nimrod Segol, Jonathan Shlomi, Kyle Cranmer, Eilam Gross, Haggai Maron, Yaron Lipman

    Abstract: Many problems in machine learning can be cast as learning functions from sets to graphs, or more generally to hypergraphs; in short, Set2Graph functions. Examples include clustering, learning vertex and edge features on graphs, and learning features on triplets in a collection. A natural approach for building Set2Graph models is to characterize all linear equivariant set-to-hypergraph layers and s… ▽ More

    Submitted 26 November, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

  24. arXiv:2002.02428  [pdf, other

    stat.ML cs.LG

    Normalizing Flows on Tori and Spheres

    Authors: Danilo Jimenez Rezende, George Papamakarios, Sébastien Racanière, Michael S. Albergo, Gurtej Kanwar, Phiala E. Shanahan, Kyle Cranmer

    Abstract: Normalizing flows are a powerful tool for building expressive distributions in high dimensions. So far, most of the literature has concentrated on learning flows on Euclidean spaces. Some problems however, such as those involving angles, are defined on spaces with more complex geometries, such as tori or spheres. In this paper, we propose and compare expressive and numerically stable flows on such… ▽ More

    Submitted 1 July, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: Accepted to the International Conference on Machine Learning (ICML) 2020

  25. arXiv:1911.01429  [pdf, other

    stat.ML cs.LG stat.ME

    The frontier of simulation-based inference

    Authors: Kyle Cranmer, Johann Brehmer, Gilles Louppe

    Abstract: Many domains of science have developed complex simulations to describe phenomena of interest. While these simulations provide high-fidelity models, they are poorly suited for inference and lead to challenging inverse problems. We review the rapidly developing field of simulation-based inference and identify the forces giving new momentum to the field. Finally, we describe how the frontier is expan… ▽ More

    Submitted 2 April, 2020; v1 submitted 4 November, 2019; originally announced November 2019.

    Comments: 10 pages, 3 figures, proceedings for the Sackler Colloquia at the US National Academy of Sciences. v2: fixed typos. v3: clarified text, added references

  26. arXiv:1909.12790  [pdf, other

    cs.LG physics.comp-ph

    Hamiltonian Graph Networks with ODE Integrators

    Authors: Alvaro Sanchez-Gonzalez, Victor Bapst, Kyle Cranmer, Peter Battaglia

    Abstract: We introduce an approach for imposing physically informed inductive biases in learned simulation models. We combine graph networks with a differentiable ordinary differential equation integrator as a mechanism for predicting future states, and a Hamiltonian as an internal representation. We find that our approach outperforms baselines without these biases in terms of predictive accuracy, energy ac… ▽ More

    Submitted 27 September, 2019; originally announced September 2019.

  27. arXiv:1907.03382  [pdf, other

    cs.LG cs.PF stat.ML

    Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

    Authors: Atılım Güneş Baydin, Lei Shao, Wahid Bhimji, Lukas Heinrich, Lawrence Meadows, Jialin Liu, Andreas Munk, Saeid Naderiparizi, Bradley Gram-Hansen, Gilles Louppe, Mingfei Ma, Xiaohui Zhao, Philip Torr, Victor Lee, Kyle Cranmer, Prabhat, Frank Wood

    Abstract: Probabilistic programming languages (PPLs) are receiving widespread attention for performing Bayesian inference in complex generative models. However, applications to science remain limited because of the impracticability of rewriting complex scientific simulators in a PPL, the computational cost of inference, and the lack of scalable implementations. To address these, we present a novel PPL frame… ▽ More

    Submitted 27 August, 2019; v1 submitted 7 July, 2019; originally announced July 2019.

    Comments: 14 pages, 8 figures

    MSC Class: 68T37; 68T05; 62P35 ACM Class: G.3; I.2.6; J.2

    Journal ref: Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC19), November 17--22, 2019

  28. arXiv:1808.00973  [pdf, other

    stat.ML cs.LG hep-ph physics.data-an

    Likelihood-free inference with an improved cross-entropy estimator

    Authors: Markus Stoye, Johann Brehmer, Gilles Louppe, Juan Pavez, Kyle Cranmer

    Abstract: We extend recent work (Brehmer, et. al., 2018) that use neural networks as surrogate models for likelihood-free inference. As in the previous work, we exploit the fact that the joint likelihood ratio and joint score, conditioned on both observed and latent variables, can often be extracted from an implicit generative model or simulator to augment the training data for these surrogate models. We sh… ▽ More

    Submitted 2 August, 2018; originally announced August 2018.

    Comments: 8 pages, 3 figures

  29. arXiv:1807.07706  [pdf, other

    cs.LG hep-ph physics.data-an stat.ML

    Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model

    Authors: Atılım Güneş Baydin, Lukas Heinrich, Wahid Bhimji, Lei Shao, Saeid Naderiparizi, Andreas Munk, Jialin Liu, Bradley Gram-Hansen, Gilles Louppe, Lawrence Meadows, Philip Torr, Victor Lee, Prabhat, Kyle Cranmer, Frank Wood

    Abstract: We present a novel probabilistic programming framework that couples directly to existing large-scale simulators through a cross-platform probabilistic execution protocol, which allows general-purpose inference engines to record and control random number draws within simulators in a language-agnostic way. The execution of existing simulators as probabilistic programs enables highly interpretable po… ▽ More

    Submitted 17 February, 2020; v1 submitted 20 July, 2018; originally announced July 2018.

    Comments: 20 pages, 9 figures

    MSC Class: 68T37; 68T05; 62P35 ACM Class: G.3; I.2.6; J.2

    Journal ref: In Advances in Neural Information Processing Systems 33 (NeurIPS), Vancouver, Canada, 2019

  30. arXiv:1807.02876  [pdf, other

    physics.comp-ph cs.LG hep-ex stat.ML

    Machine Learning in High Energy Physics Community White Paper

    Authors: Kim Albertsson, Piero Altoe, Dustin Anderson, John Anderson, Michael Andrews, Juan Pedro Araque Espinosa, Adam Aurisano, Laurent Basara, Adrian Bevan, Wahid Bhimji, Daniele Bonacorsi, Bjorn Burkle, Paolo Calafiura, Mario Campanelli, Louis Capps, Federico Carminati, Stefano Carrazza, Yi-fan Chen, Taylor Childers, Yann Coadou, Elias Coniavitis, Kyle Cranmer, Claire David, Douglas Davis, Andrea De Simone , et al. (103 additional authors not shown)

    Abstract: Machine learning has been applied to several problems in particle physics research, beginning with applications to high-level physics analysis in the 1990s and 2000s, followed by an explosion of applications in particle and event identification and reconstruction in the 2010s. In this document we discuss promising future research and development areas for machine learning in particle physics. We d… ▽ More

    Submitted 16 May, 2019; v1 submitted 8 July, 2018; originally announced July 2018.

    Comments: Editors: Sergei Gleyzer, Paul Seyfert and Steven Schramm

  31. arXiv:1806.01337  [pdf, other

    stat.ML cs.LG

    Backdrop: Stochastic Backpropagation

    Authors: Siavash Golkar, Kyle Cranmer

    Abstract: We introduce backdrop, a flexible and simple-to-implement method, intuitively described as dropout acting only along the backpropagation pipeline. Backdrop is implemented via one or more masking layers which are inserted at specific points along the network. Each backdrop masking layer acts as the identity in the forward pass, but randomly masks parts of the backward gradient propagation. Intuitiv… ▽ More

    Submitted 4 June, 2018; originally announced June 2018.

    Comments: 11 pages, 9 figures, 2 tables. Source code available at https://github.com/dexgen/backdrop

  32. arXiv:1805.12244  [pdf, other

    stat.ML cs.LG hep-ph physics.data-an

    Mining gold from implicit models to improve likelihood-free inference

    Authors: Johann Brehmer, Gilles Louppe, Juan Pavez, Kyle Cranmer

    Abstract: Simulators often provide the best description of real-world phenomena. However, they also lead to challenging inverse problems because the density they implicitly define is often intractable. We present a new suite of simulation-based inference techniques that go beyond the traditional Approximate Bayesian Computation approach, which struggles in a high-dimensional setting, and extend methods that… ▽ More

    Submitted 5 August, 2019; v1 submitted 30 May, 2018; originally announced May 2018.

    Comments: Code available at https://github.com/johannbrehmer/simulator-mining-example . v2: Fixed typos. v3: Expanded discussion, added Lotka-Volterra example. v4: Improved clarity

  33. arXiv:1712.07901  [pdf, other

    cs.AI physics.data-an

    Improvements to Inference Compilation for Probabilistic Programming in Large-Scale Scientific Simulators

    Authors: Mario Lezcano Casado, Atilim Gunes Baydin, David Martinez Rubio, Tuan Anh Le, Frank Wood, Lukas Heinrich, Gilles Louppe, Kyle Cranmer, Karen Ng, Wahid Bhimji, Prabhat

    Abstract: We consider the problem of Bayesian inference in the family of probabilistic models implicitly defined by stochastic generative models of data. In scientific fields ranging from population biology to cosmology, low-level mechanistic components are composed to create complex generative models. These models lead to intractable likelihoods and are typically non-differentiable, which poses challenges… ▽ More

    Submitted 21 December, 2017; originally announced December 2017.

    Comments: 7 pages, 2 figures

    MSC Class: 68T37; 68T05; 62P35 ACM Class: G.3; I.2.6; J.2

  34. arXiv:1707.07113  [pdf, other

    stat.ML cs.LG

    Adversarial Variational Optimization of Non-Differentiable Simulators

    Authors: Gilles Louppe, Joeri Hermans, Kyle Cranmer

    Abstract: Complex computer simulators are increasingly used across fields of science as generative models tying parameters of an underlying theory to experimental observations. Inference in this setup is often difficult, as simulators rarely admit a tractable density or likelihood function. We introduce Adversarial Variational Optimization (AVO), a likelihood-free inference algorithm for fitting a non-diffe… ▽ More

    Submitted 16 April, 2020; v1 submitted 22 July, 2017; originally announced July 2017.

    Comments: v4: Final version published at AISTATS 2019; v5: Fixed typo in Eqn 13

    Journal ref: PMLR 89:1438-1447, 2019

  35. arXiv:1611.01046  [pdf, other

    stat.ML cs.LG cs.NE physics.data-an stat.ME

    Learning to Pivot with Adversarial Networks

    Authors: Gilles Louppe, Michael Kagan, Kyle Cranmer

    Abstract: Several techniques for domain adaptation have been proposed to account for differences in the distribution of the data used for training and testing. The majority of this work focuses on a binary domain label. Similar problems occur in a scientific context where there may be a continuous family of plausible data generation processes associated to the presence of systematic uncertainties. Robust in… ▽ More

    Submitted 1 June, 2017; v1 submitted 3 November, 2016; originally announced November 2016.

    Comments: v1: Original submission. v2: Fixed references. v3: version submitted to NIPS'2017. Code available at https://github.com/glouppe/paper-learning-to-pivot

    Journal ref: Advances in Neural Information Processing Systems 30, pages 981-990, 2017

  36. Parameterized Machine Learning for High-Energy Physics

    Authors: Pierre Baldi, Kyle Cranmer, Taylor Faucett, Peter Sadowski, Daniel Whiteson

    Abstract: We investigate a new structure for machine learning classifiers applied to problems in high-energy physics by expanding the inputs to include not only measured features but also physics parameters. The physics parameters represent a smoothly varying learning task, and the resulting parameterized classifier can smoothly interpolate between them and replace sets of classifiers trained at individual… ▽ More

    Submitted 28 January, 2016; originally announced January 2016.

    Comments: For submission to PRD

  37. arXiv:1401.2134  [pdf, other

    cs.DL astro-ph.IM cs.CY

    10 Simple Rules for the Care and Feeding of Scientific Data

    Authors: Alyssa Goodman, Alberto Pepe, Alexander W. Blocker, Christine L. Borgman, Kyle Cranmer, Mercè Crosas, Rosanne Di Stefano, Yolanda Gil, Paul Groth, Margaret Hedstrom, David W. Hogg, Vinay Kashyap, Ashish Mahabal, Aneta Siemiginowska, Aleksandra Slavkovic

    Abstract: This article offers a short guide to the steps scientists can take to ensure that their data and associated analyses continue to be of value and to be recognized. In just the past few years, hundreds of scholarly papers and reports have been written on questions of data sharing, data provenance, research reproducibility, licensing, attribution, privacy, and more, but our goal here is not to review… ▽ More

    Submitted 9 January, 2014; originally announced January 2014.

    Comments: Accepted in PLOS Computational Biology. This paper was written collaboratively, on the web, in the open, using Authorea. The living version of this article, which includes sources and history, is available at http://www.authorea.com/3410/

  38. arXiv:1205.4667  [pdf

    hep-ex cs.DL

    Status Report of the DPHEP Study Group: Towards a Global Effort for Sustainable Data Preservation in High Energy Physics

    Authors: Z. Akopov, Silvia Amerio, David Asner, Eduard Avetisyan, Olof Barring, James Beacham, Matthew Bellis, Gregorio Bernardi, Siegfried Bethke, Amber Boehnlein, Travis Brooks, Thomas Browder, Rene Brun, Concetta Cartaro, Marco Cattaneo, Gang Chen, David Corney, Kyle Cranmer, Ray Culbertson, Sunje Dallmeier-Tiessen, Dmitri Denisov, Cristinel Diaconu, Vitaliy Dodonov, Tony Doyle, Gregory Dubois-Felsmann , et al. (65 additional authors not shown)

    Abstract: Data from high-energy physics (HEP) experiments are collected with significant financial and human effort and are mostly unique. An inter-experimental study group on HEP data preservation and long-term analysis was convened as a panel of the International Committee for Future Accelerators (ICFA). The group was formed by large collider-based experiments and investigated the technical and organisati… ▽ More

    Submitted 21 May, 2012; originally announced May 2012.

    Report number: DPHEP-2012-001