-
Accelerating template generation in resonant anomaly detection searches with optimal transport
Authors:
Matthew Leigh,
Debajyoti Sengupta,
Benjamin Nachman,
Tobias Golling
Abstract:
We introduce Resonant Anomaly Detection with Optimal Transport (RAD-OT), a method for generating signal templates in resonant anomaly detection searches. RAD-OT leverages the fact that the conditional probability density of the target features vary approximately linearly along the optimal transport path connecting the resonant feature. This does not assume that the conditional density itself is li…
▽ More
We introduce Resonant Anomaly Detection with Optimal Transport (RAD-OT), a method for generating signal templates in resonant anomaly detection searches. RAD-OT leverages the fact that the conditional probability density of the target features vary approximately linearly along the optimal transport path connecting the resonant feature. This does not assume that the conditional density itself is linear with the resonant feature, allowing RAD-OT to efficiently capture multimodal relationships, changes in resolution, etc. By solving the optimal transport problem, RAD-OT can quickly build a template by interpolating between the background distributions in two sideband regions. We demonstrate the performance of RAD-OT using the LHC Olympics R\&D dataset, where we find comparable sensitivity and improved stability with respect to deep learning-based approaches.
△ Less
Submitted 29 July, 2024;
originally announced July 2024.
-
Moment Unfolding
Authors:
Krish Desai,
Benjamin Nachman,
Jesse Thaler
Abstract:
Deconvolving ("unfolding'') detector distortions is a critical step in the comparison of cross section measurements with theoretical predictions in particle and nuclear physics. However, most existing approaches require histogram binning while many theoretical predictions are at the level of statistical moments. We develop a new approach to directly unfold distribution moments as a function of ano…
▽ More
Deconvolving ("unfolding'') detector distortions is a critical step in the comparison of cross section measurements with theoretical predictions in particle and nuclear physics. However, most existing approaches require histogram binning while many theoretical predictions are at the level of statistical moments. We develop a new approach to directly unfold distribution moments as a function of another observable without having to first discretize the data. Our Moment Unfolding technique uses machine learning and is inspired by Generative Adversarial Networks (GANs). We demonstrate the performance of this approach using jet substructure measurements in collider physics. With this illustrative example, we find that our Moment Unfolding protocol is more precise than bin-based approaches and is as or more precise than completely unbinned methods.
△ Less
Submitted 15 July, 2024;
originally announced July 2024.
-
Technical design report for the CODEX-$β$ demonstrator
Authors:
CODEX-b collaboration,
:,
Giulio Aielli,
Juliette Alimena,
James Beacham,
Eli Ben Haim,
Andras Burucs,
Roberto Cardarelli,
Matthew Charles,
Xabier Cid Vidal,
Albert De Roeck,
Biplab Dey,
Silviu Dobrescu,
Ozgur Durmus,
Mohamed Elashri,
Vladimir Gligorov,
Rebeca Gonzalez Suarez,
Thomas Gorordo,
Zarria Gray,
Conor Henderson,
Louis Henry,
Philip Ilten,
Daniel Johnson,
Jacob Kautz,
Simon Knapen
, et al. (28 additional authors not shown)
Abstract:
The CODEX-$β$ apparatus is a demonstrator for the proposed future CODEX-b experiment, a long-lived-particle detector foreseen for operation at IP8 during HL-LHC data-taking. The demonstrator project, intended to collect data in 2025, is described, with a particular focus on the design, construction, and installation of the new apparatus.
The CODEX-$β$ apparatus is a demonstrator for the proposed future CODEX-b experiment, a long-lived-particle detector foreseen for operation at IP8 during HL-LHC data-taking. The demonstrator project, intended to collect data in 2025, is described, with a particular focus on the design, construction, and installation of the new apparatus.
△ Less
Submitted 22 May, 2024;
originally announced June 2024.
-
Design of a SiPM-on-Tile ZDC for the future EIC and its Performance with Graph Neural Networks
Authors:
Ryan Milton,
Sebouh J. Paul,
Barak Schmookler,
Miguel Arratia,
Piyush Karande,
Aaron Angerami,
Fernando Torales Acosta,
Benjamin Nachman
Abstract:
We present a design for a high-granularity zero-degree calorimeter (ZDC) for the upcoming Electron-Ion Collider (EIC). The design uses SiPM-on-tile technology and features a novel staggered-layer arrangement that improves spatial resolution. To fully leverage the design's high granularity and non-trivial geometry, we employ graph neural networks (GNNs) for energy and angle regression as well as si…
▽ More
We present a design for a high-granularity zero-degree calorimeter (ZDC) for the upcoming Electron-Ion Collider (EIC). The design uses SiPM-on-tile technology and features a novel staggered-layer arrangement that improves spatial resolution. To fully leverage the design's high granularity and non-trivial geometry, we employ graph neural networks (GNNs) for energy and angle regression as well as signal classification. The GNN-boosted performance metrics meet, and in some cases, significantly surpass the requirements set in the EIC Yellow Report, laying the groundwork for enhanced measurements that will facilitate a wide physics program. Our studies show that GNNs can significantly enhance the performance of high-granularity CALICE-style calorimeters by automating and optimizing the software compensation algorithms required for these systems. This improvement holds true even in the case of complicated geometries that pose challenges for image-based AI/ML methods.
△ Less
Submitted 11 May, 2024;
originally announced June 2024.
-
Parnassus: An Automated Approach to Accurate, Precise, and Fast Detector Simulation and Reconstruction
Authors:
Etienne Dreyer,
Eilam Gross,
Dmitrii Kobylianskii,
Vinicius Mikuni,
Benjamin Nachman,
Nathalie Soybelman
Abstract:
Detector simulation and reconstruction are a significant computational bottleneck in particle physics. We develop Particle-flow Neural Assisted Simulations (Parnassus) to address this challenge. Our deep learning model takes as input a point cloud (particles impinging on a detector) and produces a point cloud (reconstructed particles). By combining detector simulations and reconstruction into one…
▽ More
Detector simulation and reconstruction are a significant computational bottleneck in particle physics. We develop Particle-flow Neural Assisted Simulations (Parnassus) to address this challenge. Our deep learning model takes as input a point cloud (particles impinging on a detector) and produces a point cloud (reconstructed particles). By combining detector simulations and reconstruction into one step, we aim to minimize resource utilization and enable fast surrogate models suitable for application both inside and outside large collaborations. We demonstrate this approach using a publicly available dataset of jets passed through the full simulation and reconstruction pipeline of the CMS experiment. We show that Parnassus accurately mimics the CMS particle flow algorithm on the (statistically) same events it was trained on and can generalize to jet momentum and type outside of the training distribution.
△ Less
Submitted 31 May, 2024;
originally announced June 2024.
-
Advancing Set-Conditional Set Generation: Diffusion Models for Fast Simulation of Reconstructed Particles
Authors:
Dmitrii Kobylianskii,
Nathalie Soybelman,
Nilotpal Kakati,
Etienne Dreyer,
Benjamin Nachman,
Eilam Gross
Abstract:
The computational intensity of detector simulation and event reconstruction poses a significant difficulty for data analysis in collider experiments. This challenge inspires the continued development of machine learning techniques to serve as efficient surrogate models. We propose a fast emulation approach that combines simulation and reconstruction. In other words, a neural network generates a se…
▽ More
The computational intensity of detector simulation and event reconstruction poses a significant difficulty for data analysis in collider experiments. This challenge inspires the continued development of machine learning techniques to serve as efficient surrogate models. We propose a fast emulation approach that combines simulation and reconstruction. In other words, a neural network generates a set of reconstructed objects conditioned on input particle sets. To make this possible, we advance set-conditional set generation with diffusion models. Using a realistic, generic, and public detector simulation and reconstruction package (COCOA), we show how diffusion models can accurately model the complex spectrum of reconstructed particles inside jets.
△ Less
Submitted 31 May, 2024; v1 submitted 16 May, 2024;
originally announced May 2024.
-
Incorporating Physical Priors into Weakly-Supervised Anomaly Detection
Authors:
Chi Lung Cheng,
Gurpreet Singh,
Benjamin Nachman
Abstract:
We propose a new machine-learning-based anomaly detection strategy for comparing data with a background-only reference (a form of weak supervision). The sensitivity of previous strategies degrades significantly when the signal is too rare or there are many unhelpful features. Our Prior-Assisted Weak Supervision (PAWS) method incorporates information from a class of signal models in order to signif…
▽ More
We propose a new machine-learning-based anomaly detection strategy for comparing data with a background-only reference (a form of weak supervision). The sensitivity of previous strategies degrades significantly when the signal is too rare or there are many unhelpful features. Our Prior-Assisted Weak Supervision (PAWS) method incorporates information from a class of signal models in order to significantly enhance the search sensitivity of weakly supervised approaches. As long as the true signal is in the pre-specified class, PAWS matches the sensitivity of a dedicated, fully supervised method without specifying the exact parameters ahead of time. On the benchmark LHC Olympics anomaly detection dataset, our mix of semi-supervised and weakly supervised learning is able to extend the sensitivity over previous methods by a factor of 10 in cross section. Furthermore, if we add irrelevant (noise) dimensions to the inputs, classical methods degrade by another factor of 10 in cross section while PAWS remains insensitive to noise. This new approach could be applied in a number of scenarios and pushes the frontier of sensitivity between completely model-agnostic approaches and fully model-specific searches.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Unifying Simulation and Inference with Normalizing Flows
Authors:
Haoxing Du,
Claudius Krause,
Vinicius Mikuni,
Benjamin Nachman,
Ian Pang,
David Shih
Abstract:
There have been many applications of deep neural networks to detector calibrations and a growing number of studies that propose deep generative models as automated fast detector simulators. We show that these two tasks can be unified by using maximum likelihood estimation (MLE) from conditional generative models for energy regression. Unlike direct regression techniques, the MLE approach is prior-…
▽ More
There have been many applications of deep neural networks to detector calibrations and a growing number of studies that propose deep generative models as automated fast detector simulators. We show that these two tasks can be unified by using maximum likelihood estimation (MLE) from conditional generative models for energy regression. Unlike direct regression techniques, the MLE approach is prior-independent and non-Gaussian resolutions can be determined from the shape of the likelihood near the maximum. Using an ATLAS-like calorimeter simulation, we demonstrate this concept in the context of calorimeter energy calibration.
△ Less
Submitted 9 May, 2024; v1 submitted 29 April, 2024;
originally announced April 2024.
-
The Landscape of Unfolding with Machine Learning
Authors:
Nathan Huetsch,
Javier Mariño Villadamigo,
Alexander Shmakov,
Sascha Diefenbacher,
Vinicius Mikuni,
Theo Heimel,
Michael Fenton,
Kevin Greif,
Benjamin Nachman,
Daniel Whiteson,
Anja Butter,
Tilman Plehn
Abstract:
Recent innovations from machine learning allow for data unfolding, without binning and including correlations across many dimensions. We describe a set of known, upgraded, and new methods for ML-based unfolding. The performance of these approaches are evaluated on the same two datasets. We find that all techniques are capable of accurately reproducing the particle-level spectra across complex obse…
▽ More
Recent innovations from machine learning allow for data unfolding, without binning and including correlations across many dimensions. We describe a set of known, upgraded, and new methods for ML-based unfolding. The performance of these approaches are evaluated on the same two datasets. We find that all techniques are capable of accurately reproducing the particle-level spectra across complex observables. Given that these approaches are conceptually diverse, they offer an exciting toolkit for a new class of measurements that can probe the Standard Model with an unprecedented level of detail and may enable sensitivity to new phenomena.
△ Less
Submitted 17 May, 2024; v1 submitted 29 April, 2024;
originally announced April 2024.
-
OmniLearn: A Method to Simultaneously Facilitate All Jet Physics Tasks
Authors:
Vinicius Mikuni,
Benjamin Nachman
Abstract:
Machine learning has become an essential tool in jet physics. Due to their complex, high-dimensional nature, jets can be explored holistically by neural networks in ways that are not possible manually. However, innovations in all areas of jet physics are proceeding in parallel. We show that specially constructed machine learning models trained for a specific jet classification task can improve the…
▽ More
Machine learning has become an essential tool in jet physics. Due to their complex, high-dimensional nature, jets can be explored holistically by neural networks in ways that are not possible manually. However, innovations in all areas of jet physics are proceeding in parallel. We show that specially constructed machine learning models trained for a specific jet classification task can improve the accuracy, precision, or speed of all other jet physics tasks. This is demonstrated by training on a particular multiclass classification task and then using the learned representation for different classification tasks, for datasets with a different (full) detector simulation, for jets from a different collision system ($pp$ versus $ep$), for generative models, for likelihood ratio estimation, and for anomaly detection. Our OmniLearn approach is thus a foundation model and is made publicly available for use in any area where state-of-the-art precision is required for analyses involving jets and their substructure.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Measurement of groomed event shape observables in deep-inelastic electron-proton scattering at HERA
Authors:
The H1 collaboration,
V. Andreev,
M. Arratia,
A. Baghdasaryan,
A. Baty,
K. Begzsuren,
A. Bolz,
V. Boudry,
G. Brandt,
D. Britzger,
A. Buniatyan,
L. Bystritskaya,
A. J. Campbell,
K. B. Cantun Avila,
K. Cerny,
V. Chekelian,
Z. Chen,
J. G. Contreras,
J. Cvach,
J. B. Dainton,
K. Daum,
A. Deshpande,
C. Diaconu,
A. Drees,
G. Eckerlin
, et al. (123 additional authors not shown)
Abstract:
The H1 Collaboration at HERA reports the first measurement of groomed event shape observables in deep inelastic electron-proton scattering (DIS) at $\sqrt{s}=319$ GeV, using data recorded between the years 2003 and 2007 with an integrated luminosity of $351$ pb$^{-1}$. Event shapes provide incisive probes of perturbative and non-perturbative QCD. Grooming techniques have been used for jet measurem…
▽ More
The H1 Collaboration at HERA reports the first measurement of groomed event shape observables in deep inelastic electron-proton scattering (DIS) at $\sqrt{s}=319$ GeV, using data recorded between the years 2003 and 2007 with an integrated luminosity of $351$ pb$^{-1}$. Event shapes provide incisive probes of perturbative and non-perturbative QCD. Grooming techniques have been used for jet measurements in hadronic collisions; this paper presents the first application of grooming to DIS data. The analysis is carried out in the Breit frame, utilizing the novel Centauro jet clustering algorithm that is designed for DIS event topologies. Events are required to have squared momentum-transfer $Q^2 > 150$ GeV$^2$ and inelasticity $ 0.2 < y < 0.7$. We report measurements of the production cross section of groomed event 1-jettiness and groomed invariant mass for several choices of grooming parameter. Monte Carlo model calculations and analytic calculations based on Soft Collinear Effective Theory are compared to the measurements.
△ Less
Submitted 1 August, 2024; v1 submitted 15 March, 2024;
originally announced March 2024.
-
Measurement of the 1-jettiness event shape observable in deep-inelastic electron-proton scattering at HERA
Authors:
The H1 collaboration,
V. Andreev,
M. Arratia,
A. Baghdasaryan,
A. Baty,
K. Begzsuren,
A. Bolz,
V. Boudry,
G. Brandt,
D. Britzger,
A. Buniatyan,
L. Bystritskaya,
A. J. Campbell,
K. B. Cantun Avila,
K. Cerny,
V. Chekelian,
Z. Chen,
J. G. Contreras,
J. Cvach,
J. B. Dainton,
K. Daum,
A. Deshpande,
C. Diaconu,
A. Drees,
G. Eckerlin
, et al. (124 additional authors not shown)
Abstract:
The H1 Collaboration reports the first measurement of the 1-jettiness event shape observable $τ_1^b$ in neutral-current deep-inelastic electron-proton scattering (DIS). The observable $τ_1^b$ is equivalent to a thrust observable defined in the Breit frame. The data sample was collected at the HERA $ep$ collider in the years 2003-2007 with center-of-mass energy of $\sqrt{s}=319\,\text{GeV}$, corres…
▽ More
The H1 Collaboration reports the first measurement of the 1-jettiness event shape observable $τ_1^b$ in neutral-current deep-inelastic electron-proton scattering (DIS). The observable $τ_1^b$ is equivalent to a thrust observable defined in the Breit frame. The data sample was collected at the HERA $ep$ collider in the years 2003-2007 with center-of-mass energy of $\sqrt{s}=319\,\text{GeV}$, corresponding to an integrated luminosity of $351.1\,\text{pb}^{-1}$. Triple differential cross sections are provided as a function of $τ_1^b$, event virtuality $Q^2$, and inelasticity $y$, in the kinematic region $Q^2>150\,\text{GeV}^{2}$. Single differential cross section are provided as a function of $τ_1^b$ in a limited kinematic range. Double differential cross sections are measured, in contrast, integrated over $τ_1^b$ and represent the inclusive neutral-current DIS cross section measured as a function of $Q^2$ and $y$. The data are compared to a variety of predictions and include classical and modern Monte Carlo event generators, predictions in fixed-order perturbative QCD where calculations up to $\mathcal{O}(α_s^3)$ are available for $τ_1^b$ or inclusive DIS, and resummed predictions at next-to-leading logarithmic accuracy matched to fixed order predictions at $\mathcal{O}(α_s^2)$. These comparisons reveal sensitivity of the 1-jettiness observable to QCD parton shower and resummation effects, as well as the modeling of hadronization and fragmentation. Within their range of validity, the fixed-order predictions provide a good description of the data. Monte Carlo event generators are predictive over the full measured range and hence their underlying models and parameters can be constrained by comparing to the presented data.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Observation and differential cross section measurement of neutral current DIS events with an empty hemisphere in the Breit frame
Authors:
The H1 collaboration,
V. Andreev,
M. Arratia,
A. Baghdasaryan,
A. Baty,
K. Begzsuren,
A. Bolz,
V. Boudry,
G. Brandt,
D. Britzger,
A. Buniatyan,
L. Bystritskaya,
A. J. Campbell,
K. B. Cantun Avila,
K. Cerny,
V. Chekelian,
Z. Chen,
J. G. Contreras,
J. Cvach,
J. B. Dainton,
K. Daum,
A. Deshpande,
C. Diaconu,
A. Drees,
G. Eckerlin
, et al. (124 additional authors not shown)
Abstract:
The Breit frame provides a natural frame to analyze lepton-proton scattering events. In this reference frame, the parton model hard interactions between a quark and an exchanged boson defines the coordinate system such that the struck quark is back-scattered along the virtual photon momentum direction. In Quantum Chromodynamics (QCD), higher order perturbative or non-perturbative effects can chang…
▽ More
The Breit frame provides a natural frame to analyze lepton-proton scattering events. In this reference frame, the parton model hard interactions between a quark and an exchanged boson defines the coordinate system such that the struck quark is back-scattered along the virtual photon momentum direction. In Quantum Chromodynamics (QCD), higher order perturbative or non-perturbative effects can change this picture drastically. As Bjorken-$x$ decreases below one half, a rather peculiar event signature is predicted with increasing probability, where no radiation is present in one of the two Breit-frame hemispheres and all emissions are to be found in the other hemisphere. At higher orders in $α_s$ or in the presence of soft QCD effects, predictions of the rate of these events are far from trivial, and that motivates measurements with real data. We report on the first observation of the empty current hemisphere events in electron-proton collisions at the HERA collider using data recorded with the H1 detector at a center-of-mass energy of 319 GeV. The fraction of inclusive neutral-current DIS events with an empty hemisphere is found to be $0.0112 \pm 3.9\,\%_\text{stat} \pm 4.5\,\%_\text{syst} \pm 1.6\,\%_\text{mod}$ in the selected kinematic region of $150< Q^2<1500$ GeV$^2$ and inelasticity $0.14< y<0.7$. The data sample corresponds to an integrated luminosity of 351.1 pb$^{-1}$, sufficient to enable differential cross section measurements of these events. The results show an enhanced discriminating power at lower Bjorken-$x$ among different Monte Carlo event generator predictions.
△ Less
Submitted 1 August, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
Seeing Double: Calibrating Two Jets at Once
Authors:
Rikab Gambhir,
Benjamin Nachman
Abstract:
Jet energy calibration is an important aspect of many measurements and searches at the LHC. Currently, these calibrations are performed on a per-jet basis, i.e. agnostic to the properties of other jets in the same event. In this work, we propose taking advantage of the correlations induced by momentum conservation between jets in order to improve their jet energy calibration. By fitting the $p_T$…
▽ More
Jet energy calibration is an important aspect of many measurements and searches at the LHC. Currently, these calibrations are performed on a per-jet basis, i.e. agnostic to the properties of other jets in the same event. In this work, we propose taking advantage of the correlations induced by momentum conservation between jets in order to improve their jet energy calibration. By fitting the $p_T$ asymmetry of dijet events in simulation, while remaining agnostic to the $p_T$ spectra themselves, we are able to obtain correlation-improved maximum likelihood estimates. This approach is demonstrated with simulated jets from the CMS Detector, yielding a $3$-$5\%$ relative improvement in the jet energy resolution, corresponding to a quadrature improvement of approximately 35\%.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Anomaly detection with flow-based fast calorimeter simulators
Authors:
Claudius Krause,
Benjamin Nachman,
Ian Pang,
David Shih,
Yunhao Zhu
Abstract:
Recently, several normalizing flow-based deep generative models have been proposed to accelerate the simulation of calorimeter showers. Using CaloFlow as an example, we show that these models can simultaneously perform unsupervised anomaly detection with no additional training cost. As a demonstration, we consider electromagnetic showers initiated by one (background) or multiple (signal) photons.…
▽ More
Recently, several normalizing flow-based deep generative models have been proposed to accelerate the simulation of calorimeter showers. Using CaloFlow as an example, we show that these models can simultaneously perform unsupervised anomaly detection with no additional training cost. As a demonstration, we consider electromagnetic showers initiated by one (background) or multiple (signal) photons. The CaloFlow model is designed to generate single photon showers, but it also provides access to the shower likelihood. We use this likelihood as an anomaly score and study the showers tagged as being unlikely. As expected, the tagger struggles when the signal photons are nearly collinear, but is otherwise effective. This approach is complementary to a supervised classifier trained on only specific signal models using the same low-level calorimeter inputs. While the supervised classifier is also highly effective at unseen signal models, the unsupervised method is more sensitive in certain regions and thus we expect that the ultimate performance will require a combination of these approaches.
△ Less
Submitted 29 August, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
Integrating Particle Flavor into Deep Learning Models for Hadronization
Authors:
Jay Chan,
Xiangyang Ju,
Adam Kania,
Benjamin Nachman,
Vishnu Sangli,
Andrzej Siodmok
Abstract:
Hadronization models used in event generators are physics-inspired functions with many tunable parameters. Since we do not understand hadronization from first principles, there have been multiple proposals to improve the accuracy of hadronization models by utilizing more flexible parameterizations based on neural networks. These recent proposals have focused on the kinematic properties of hadrons,…
▽ More
Hadronization models used in event generators are physics-inspired functions with many tunable parameters. Since we do not understand hadronization from first principles, there have been multiple proposals to improve the accuracy of hadronization models by utilizing more flexible parameterizations based on neural networks. These recent proposals have focused on the kinematic properties of hadrons, but a full model must also include particle flavor. In this paper, we show how to build a deep learning-based hadronization model that includes both kinematic (continuous) and flavor (discrete) degrees of freedom. Our approach is based on Generative Adversarial Networks and we show the performance within the context of the cluster hadronization model within the Herwig event generator.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
Non-resonant Anomaly Detection with Background Extrapolation
Authors:
Kehang Bai,
Radha Mastandrea,
Benjamin Nachman
Abstract:
Complete anomaly detection strategies that are both signal sensitive and compatible with background estimation have largely focused on resonant signals. Non-resonant new physics scenarios are relatively under-explored and may arise from off-shell effects or final states with significant missing energy. In this paper, we extend a class of weakly supervised anomaly detection strategies developed for…
▽ More
Complete anomaly detection strategies that are both signal sensitive and compatible with background estimation have largely focused on resonant signals. Non-resonant new physics scenarios are relatively under-explored and may arise from off-shell effects or final states with significant missing energy. In this paper, we extend a class of weakly supervised anomaly detection strategies developed for resonant physics to the non-resonant case. Machine learning models are trained to reweight, generate, or morph the background, extrapolated from a control region. A classifier is then trained in a signal region to distinguish the estimated background from the data. The new methods are demonstrated using a semi-visible jet signature as a benchmark signal model, and are shown to automatically identify the anomalous events without specifying the signal ahead of time.
△ Less
Submitted 7 May, 2024; v1 submitted 21 November, 2023;
originally announced November 2023.
-
Safe but Incalculable: Energy-weighting is not all you need
Authors:
Samuel Bright-Thonney,
Benjamin Nachman,
Jesse Thaler
Abstract:
Infrared and collinear (IRC) safety has long been used a proxy for robustness when developing new jet substructure observables. This guiding philosophy has been carried into the deep learning era, where IRC-safe neural networks have been used for many jet studies. For graph-based neural networks, the most straightforward way to achieve IRC safety is to weight particle inputs by their energies. How…
▽ More
Infrared and collinear (IRC) safety has long been used a proxy for robustness when developing new jet substructure observables. This guiding philosophy has been carried into the deep learning era, where IRC-safe neural networks have been used for many jet studies. For graph-based neural networks, the most straightforward way to achieve IRC safety is to weight particle inputs by their energies. However, energy-weighting by itself does not guarantee that perturbative calculations of machine-learned observables will enjoy small non-perturbative corrections. In this paper, we demonstrate the sensitivity of IRC-safe networks to non-perturbative effects, by training an energy flow network (EFN) to maximize its sensitivity to hadronization. We then show how to construct Lipschitz Energy Flow Networks (L-EFNs), which are both IRC safe and relatively insensitive to non-perturbative corrections. We demonstrate the performance of L-EFNs on generated samples of quark and gluon jets, and showcase fascinating differences between the learned latent representations of EFNs and L-EFNs.
△ Less
Submitted 13 February, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
Designing Observables for Measurements with Deep Learning
Authors:
Owen Long,
Benjamin Nachman
Abstract:
Many analyses in particle and nuclear physics use simulations to infer fundamental, effective, or phenomenological parameters of the underlying physics models. When the inference is performed with unfolded cross sections, the observables are designed using physics intuition and heuristics. We propose to design optimal observables with machine learning. Unfolded, differential cross sections in a ne…
▽ More
Many analyses in particle and nuclear physics use simulations to infer fundamental, effective, or phenomenological parameters of the underlying physics models. When the inference is performed with unfolded cross sections, the observables are designed using physics intuition and heuristics. We propose to design optimal observables with machine learning. Unfolded, differential cross sections in a neural network output contain the most information about parameters of interest and can be well-measured by construction. We demonstrate this idea using two physics models for inclusive measurements in deep inelastic scattering.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
Full Phase Space Resonant Anomaly Detection
Authors:
Erik Buhmann,
Cedric Ewen,
Gregor Kasieczka,
Vinicius Mikuni,
Benjamin Nachman,
David Shih
Abstract:
Physics beyond the Standard Model that is resonant in one or more dimensions has been a longstanding focus of countless searches at colliders and beyond. Recently, many new strategies for resonant anomaly detection have been developed, where sideband information can be used in conjunction with modern machine learning, in order to generate synthetic datasets representing the Standard Model backgrou…
▽ More
Physics beyond the Standard Model that is resonant in one or more dimensions has been a longstanding focus of countless searches at colliders and beyond. Recently, many new strategies for resonant anomaly detection have been developed, where sideband information can be used in conjunction with modern machine learning, in order to generate synthetic datasets representing the Standard Model background. Until now, this approach was only able to accommodate a relatively small number of dimensions, limiting the breadth of the search sensitivity. Using recent innovations in point cloud generative models, we show that this strategy can also be applied to the full phase space, using all relevant particles for the anomaly detection. As a proof of principle, we show that the signal from the R\&D dataset from the LHC Olympics is findable with this method, opening up the door to future studies that explore the interplay between depth and breadth in the representation of the data for anomaly detection.
△ Less
Submitted 9 February, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
The Optimal use of Segmentation for Sampling Calorimeters
Authors:
Fernando Torales Acosta,
Bishnu Karki,
Piyush Karande,
Aaron Angerami,
Miguel Arratia,
Kenneth Barish,
Ryan Milton,
Sebastián Morán,
Benjamin Nachman,
Anshuman Sinha
Abstract:
One of the key design choices of any sampling calorimeter is how fine to make the longitudinal and transverse segmentation. To inform this choice, we study the impact of calorimeter segmentation on energy reconstruction. To ensure that the trends are due entirely to hardware and not to a sub-optimal use of segmentation, we deploy deep neural networks to perform the reconstruction. These networks m…
▽ More
One of the key design choices of any sampling calorimeter is how fine to make the longitudinal and transverse segmentation. To inform this choice, we study the impact of calorimeter segmentation on energy reconstruction. To ensure that the trends are due entirely to hardware and not to a sub-optimal use of segmentation, we deploy deep neural networks to perform the reconstruction. These networks make use of all available information by representing the calorimeter as a point cloud. To demonstrate our approach, we simulate a detector similar to the forward calorimeter system intended for use in the ePIC detector, which will operate at the upcoming Electron Ion Collider. We find that for the energy estimation of isolated charged pion showers, relatively fine longitudinal segmentation is key to achieving an energy resolution that is better than 10% across the full phase space. These results provide a valuable benchmark for ongoing EIC detector optimizations and may also inform future studies involving high-granularity calorimeters in other experiments at various facilities.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Flows for Flows: Morphing one Dataset into another with Maximum Likelihood Estimation
Authors:
Tobias Golling,
Samuel Klein,
Radha Mastandrea,
Benjamin Nachman,
John Andrew Raine
Abstract:
Many components of data analysis in high energy physics and beyond require morphing one dataset into another. This is commonly solved via reweighting, but there are many advantages of preserving weights and shifting the data points instead. Normalizing flows are machine learning models with impressive precision on a variety of particle physics tasks. Naively, normalizing flows cannot be used for m…
▽ More
Many components of data analysis in high energy physics and beyond require morphing one dataset into another. This is commonly solved via reweighting, but there are many advantages of preserving weights and shifting the data points instead. Normalizing flows are machine learning models with impressive precision on a variety of particle physics tasks. Naively, normalizing flows cannot be used for morphing because they require knowledge of the probability density of the starting dataset. In most cases in particle physics, we can generate more examples, but we do not know densities explicitly. We propose a protocol called flows for flows for training normalizing flows to morph one dataset into another even if the underlying probability density of neither dataset is known explicitly. This enables a morphing strategy trained with maximum likelihood estimation, a setup that has been shown to be highly effective in related tasks. We study variations on this protocol to explore how far the data points are moved to statistically match the two datasets. Furthermore, we show how to condition the learned flows on particular features in order to create a morphing function for every value of the conditioning feature. For illustration, we demonstrate flows for flows for toy examples as well as a collider physics example involving dijet events
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Improving Generative Model-based Unfolding with Schrödinger Bridges
Authors:
Sascha Diefenbacher,
Guan-Horng Liu,
Vinicius Mikuni,
Benjamin Nachman,
Weili Nie
Abstract:
Machine learning-based unfolding has enabled unbinned and high-dimensional differential cross section measurements. Two main approaches have emerged in this research area: one based on discriminative models and one based on generative models. The main advantage of discriminative models is that they learn a small correction to a starting simulation while generative models scale better to regions of…
▽ More
Machine learning-based unfolding has enabled unbinned and high-dimensional differential cross section measurements. Two main approaches have emerged in this research area: one based on discriminative models and one based on generative models. The main advantage of discriminative models is that they learn a small correction to a starting simulation while generative models scale better to regions of phase space with little data. We propose to use Schroedinger Bridges and diffusion models to create SBUnfold, an unfolding approach that combines the strengths of both discriminative and generative models. The key feature of SBUnfold is that its generative model maps one set of events into another without having to go through a known probability density as is the case for normalizing flows and standard diffusion models. We show that SBUnfold achieves excellent performance compared to state of the art methods on a synthetic Z+jets dataset.
△ Less
Submitted 22 September, 2023; v1 submitted 23 August, 2023;
originally announced August 2023.
-
Refining Fast Calorimeter Simulations with a Schrödinger Bridge
Authors:
Sascha Diefenbacher,
Vinicius Mikuni,
Benjamin Nachman
Abstract:
Machine learning-based simulations, especially calorimeter simulations, are promising tools for approximating the precision of classical high energy physics simulations with a fraction of the generation time. Nearly all methods proposed so far learn neural networks that map a random variable with a known probability density, like a Gaussian, to realistic-looking events. In many cases, physics even…
▽ More
Machine learning-based simulations, especially calorimeter simulations, are promising tools for approximating the precision of classical high energy physics simulations with a fraction of the generation time. Nearly all methods proposed so far learn neural networks that map a random variable with a known probability density, like a Gaussian, to realistic-looking events. In many cases, physics events are not close to Gaussian and so these neural networks have to learn a highly complex function. We study an alternative approach: Schrödinger bridge Quality Improvement via Refinement of Existing Lightweight Simulations (SQuIRELS). SQuIRELS leverages the power of diffusion-based neural networks and Schrödinger bridges to map between samples where the probability density is not known explicitly. We apply SQuIRELS to the task of refining a classical fast simulation to approximate a full classical simulation. On simulated calorimeter events, we find that SQuIRELS is able to reproduce highly non-trivial features of the full simulation with a fraction of the generation time.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
CaloScore v2: Single-shot Calorimeter Shower Simulation with Diffusion Models
Authors:
Vinicius Mikuni,
Benjamin Nachman
Abstract:
Diffusion generative models are promising alternatives for fast surrogate models, producing high-fidelity physics simulations. However, the generation time often requires an expensive denoising process with hundreds of function evaluations, restricting the current applicability of these models in a realistic setting. In this work, we report updates on the CaloScore architecture, detailing the chan…
▽ More
Diffusion generative models are promising alternatives for fast surrogate models, producing high-fidelity physics simulations. However, the generation time often requires an expensive denoising process with hundreds of function evaluations, restricting the current applicability of these models in a realistic setting. In this work, we report updates on the CaloScore architecture, detailing the changes in the diffusion process, which produces higher quality samples, and the use of progressive distillation, resulting in a diffusion model capable of generating new samples with a single function evaluation. We demonstrate these improvements using the Calorimeter Simulation Challenge 2022 dataset.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
The Interplay of Machine Learning--based Resonant Anomaly Detection Methods
Authors:
Tobias Golling,
Gregor Kasieczka,
Claudius Krause,
Radha Mastandrea,
Benjamin Nachman,
John Andrew Raine,
Debajyoti Sengupta,
David Shih,
Manuel Sommerhalder
Abstract:
Machine learning--based anomaly detection (AD) methods are promising tools for extending the coverage of searches for physics beyond the Standard Model (BSM). One class of AD methods that has received significant attention is resonant anomaly detection, where the BSM is assumed to be localized in at least one known variable. While there have been many methods proposed to identify such a BSM signal…
▽ More
Machine learning--based anomaly detection (AD) methods are promising tools for extending the coverage of searches for physics beyond the Standard Model (BSM). One class of AD methods that has received significant attention is resonant anomaly detection, where the BSM is assumed to be localized in at least one known variable. While there have been many methods proposed to identify such a BSM signal that make use of simulated or detected data in different ways, there has not yet been a study of the methods' complementarity. To this end, we address two questions. First, in the absence of any signal, do different methods pick the same events as signal-like? If not, then we can significantly reduce the false-positive rate by comparing different methods on the same dataset. Second, if there is a signal, are different methods fully correlated? Even if their maximum performance is the same, since we do not know how much signal is present, it may be beneficial to combine approaches. Using the Large Hadron Collider (LHC) Olympics dataset, we provide quantitative answers to these questions. We find that there are significant gains possible by combining multiple methods, which will strengthen the search program at the LHC and beyond.
△ Less
Submitted 14 March, 2024; v1 submitted 20 July, 2023;
originally announced July 2023.
-
Artificial Intelligence for the Electron Ion Collider (AI4EIC)
Authors:
C. Allaire,
R. Ammendola,
E. -C. Aschenauer,
M. Balandat,
M. Battaglieri,
J. Bernauer,
M. Bondì,
N. Branson,
T. Britton,
A. Butter,
I. Chahrour,
P. Chatagnon,
E. Cisbani,
E. W. Cline,
S. Dash,
C. Dean,
W. Deconinck,
A. Deshpande,
M. Diefenthaler,
R. Ent,
C. Fanelli,
M. Finger,
M. Finger, Jr.,
E. Fol,
S. Furletov
, et al. (70 additional authors not shown)
Abstract:
The Electron-Ion Collider (EIC), a state-of-the-art facility for studying the strong force, is expected to begin commissioning its first experiments in 2028. This is an opportune time for artificial intelligence (AI) to be included from the start at this facility and in all phases that lead up to the experiments. The second annual workshop organized by the AI4EIC working group, which recently took…
▽ More
The Electron-Ion Collider (EIC), a state-of-the-art facility for studying the strong force, is expected to begin commissioning its first experiments in 2028. This is an opportune time for artificial intelligence (AI) to be included from the start at this facility and in all phases that lead up to the experiments. The second annual workshop organized by the AI4EIC working group, which recently took place, centered on exploring all current and prospective application areas of AI for the EIC. This workshop is not only beneficial for the EIC, but also provides valuable insights for the newly established ePIC collaboration at EIC. This paper summarizes the different activities and R&D projects covered across the sessions of the workshop and provides an overview of the goals, approaches and strategies regarding AI/ML in the EIC community, as well as cutting-edge techniques currently studied in other experiments.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
Comparison of Point Cloud and Image-based Models for Calorimeter Fast Simulation
Authors:
Fernando Torales Acosta,
Vinicius Mikuni,
Benjamin Nachman,
Miguel Arratia,
Bishnu Karki,
Ryan Milton,
Piyush Karande,
Aaron Angerami
Abstract:
Score based generative models are a new class of generative models that have been shown to accurately generate high dimensional calorimeter datasets. Recent advances in generative models have used images with 3D voxels to represent and model complex calorimeter showers. Point clouds, however, are likely a more natural representation of calorimeter showers, particularly in calorimeters with high gr…
▽ More
Score based generative models are a new class of generative models that have been shown to accurately generate high dimensional calorimeter datasets. Recent advances in generative models have used images with 3D voxels to represent and model complex calorimeter showers. Point clouds, however, are likely a more natural representation of calorimeter showers, particularly in calorimeters with high granularity. Point clouds preserve all of the information of the original simulation, more naturally deal with sparse datasets, and can be implemented with more compact models and data files. In this work, two state-of-the-art score based models are trained on the same set of calorimeter simulation and directly compared.
△ Less
Submitted 31 July, 2023; v1 submitted 10 July, 2023;
originally announced July 2023.
-
Learning to Isolate Muons in Data
Authors:
Edmund Witkowski,
Benjamin Nachman,
Daniel Whiteson
Abstract:
We use unlabeled collision data and weakly-supervised learning to train models which can distinguish prompt muons from non-prompt muons using patterns of low-level particle activity in the vicinity of the muon, and interpret the models in the space of energy flow polynomials. Particle activity associated with muons is a valuable tool for identifying prompt muons, those due to heavy boson decay, fr…
▽ More
We use unlabeled collision data and weakly-supervised learning to train models which can distinguish prompt muons from non-prompt muons using patterns of low-level particle activity in the vicinity of the muon, and interpret the models in the space of energy flow polynomials. Particle activity associated with muons is a valuable tool for identifying prompt muons, those due to heavy boson decay, from muons produced in the decay of heavy flavor jets. The high-dimensional information is typically reduced to a single scalar quantity, isolation, but previous work in simulated samples suggests that valuable discriminating information is lost in this reduction. We extend these studies in LHC collisions recorded by the CMS experiment, where true class labels are not available, requiring the use of the invariant mass spectrum to obtain macroscopic sample information. This allows us to employ Classification Without Labels (CWoLa), a weakly supervised learning technique, to train models. Our results confirm that isolation does not describe events as well as the full low-level calorimeter information, and we are able to identify single energy flow polynomials capable of closing the performance gap. These polynomials are not the same ones derived from simulation, highlighting the importance of training directly on data.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
High-dimensional and Permutation Invariant Anomaly Detection
Authors:
Vinicius Mikuni,
Benjamin Nachman
Abstract:
Methods for anomaly detection of new physics processes are often limited to low-dimensional spaces due to the difficulty of learning high-dimensional probability densities. Particularly at the constituent level, incorporating desirable properties such as permutation invariance and variable-length inputs becomes difficult within popular density estimation methods. In this work, we introduce a permu…
▽ More
Methods for anomaly detection of new physics processes are often limited to low-dimensional spaces due to the difficulty of learning high-dimensional probability densities. Particularly at the constituent level, incorporating desirable properties such as permutation invariance and variable-length inputs becomes difficult within popular density estimation methods. In this work, we introduce a permutation-invariant density estimator for particle physics data based on diffusion models, specifically designed to handle variable-length inputs. We demonstrate the efficacy of our methodology by utilizing the learned density as a permutation-invariant anomaly detection score, effectively identifying jets with low likelihood under the background-only hypothesis. To validate our density estimation method, we investigate the ratio of learned densities and compare to those obtained by a supervised classification algorithm.
△ Less
Submitted 7 February, 2024; v1 submitted 6 June, 2023;
originally announced June 2023.
-
Fitting a Deep Generative Hadronization Model
Authors:
Jay Chan,
Xiangyang Ju,
Adam Kania,
Benjamin Nachman,
Vishnu Sangli,
Andrzej Siodmok
Abstract:
Hadronization is a critical step in the simulation of high-energy particle and nuclear physics experiments. As there is no first principles understanding of this process, physically-inspired hadronization models have a large number of parameters that are fit to data. Deep generative models are a natural replacement for classical techniques, since they are more flexible and may be able to improve t…
▽ More
Hadronization is a critical step in the simulation of high-energy particle and nuclear physics experiments. As there is no first principles understanding of this process, physically-inspired hadronization models have a large number of parameters that are fit to data. Deep generative models are a natural replacement for classical techniques, since they are more flexible and may be able to improve the overall precision. Proof of principle studies have shown how to use neural networks to emulate specific hadronization when trained using the inputs and outputs of classical methods. However, these approaches will not work with data, where we do not have a matching between observed hadrons and partons. In this paper, we develop a protocol for fitting a deep generative hadronization model in a realistic setting, where we only have access to a set of hadrons in data. Our approach uses a variation of a Generative Adversarial Network with a permutation invariant discriminator. We find that this setup is able to match the hadronization model in Herwig with multiple sets of parameters. This work represents a significant step forward in a longer term program to develop, train, and integrate machine learning-based hadronization models into parton shower Monte Carlo programs.
△ Less
Submitted 24 July, 2023; v1 submitted 26 May, 2023;
originally announced May 2023.
-
ELSA -- Enhanced latent spaces for improved collider simulations
Authors:
Benjamin Nachman,
Ramon Winterhalder
Abstract:
Simulations play a key role for inference in collider physics. We explore various approaches for enhancing the precision of simulations using machine learning, including interventions at the end of the simulation chain (reweighting), at the beginning of the simulation chain (pre-processing), and connections between the end and beginning (latent space refinement). To clearly illustrate our approach…
▽ More
Simulations play a key role for inference in collider physics. We explore various approaches for enhancing the precision of simulations using machine learning, including interventions at the end of the simulation chain (reweighting), at the beginning of the simulation chain (pre-processing), and connections between the end and beginning (latent space refinement). To clearly illustrate our approaches, we use W+jets matrix element surrogate simulations based on normalizing flows as a prototypical example. First, weights in the data space are derived using machine learning classifiers. Then, we pull back the data-space weights to the latent space to produce unweighted examples and employ the Latent Space Refinement (LASER) protocol using Hamiltonian Monte Carlo. An alternative approach is an augmented normalizing flow, which allows for different dimensions in the latent and target spaces. These methods are studied for various pre-processing strategies, including a new and general method for massive particles at hadron colliders that is a tweak on the widely-used RAMBO-on-diet mapping. We find that modified simulations can achieve sub-percent precision across a wide range of phase space.
△ Less
Submitted 21 October, 2023; v1 submitted 12 May, 2023;
originally announced May 2023.
-
Parton Labeling without Matching: Unveiling Emergent Labelling Capabilities in Regression Models
Authors:
Shikai Qiu,
Shuo Han,
Xiangyang Ju,
Benjamin Nachman,
Haichen Wang
Abstract:
Parton labeling methods are widely used when reconstructing collider events with top quarks or other massive particles. State-of-the-art techniques are based on machine learning and require training data with events that have been matched using simulations with truth information. In nature, there is no unique matching between partons and final state objects due to the properties of the strong forc…
▽ More
Parton labeling methods are widely used when reconstructing collider events with top quarks or other massive particles. State-of-the-art techniques are based on machine learning and require training data with events that have been matched using simulations with truth information. In nature, there is no unique matching between partons and final state objects due to the properties of the strong force and due to acceptance effects. We propose a new approach to parton labeling that circumvents these challenges by recycling regression models. The final state objects that are most relevant for a regression model to predict the properties of a particular top quark are assigned to said parent particle without having any parton-matched training data. This approach is demonstrated using simulated events with top quarks and outperforms the widely-used $χ^2$ method.
△ Less
Submitted 7 July, 2024; v1 submitted 18 April, 2023;
originally announced April 2023.
-
Fast Point Cloud Generation with Diffusion Models in High Energy Physics
Authors:
Vinicius Mikuni,
Benjamin Nachman,
Mariel Pettee
Abstract:
Many particle physics datasets like those generated at colliders are described by continuous coordinates (in contrast to grid points like in an image), respect a number of symmetries (like permutation invariance), and have a stochastic dimensionality. For this reason, standard deep generative models that produce images or at least a fixed set of features are limiting. We introduce a new neural net…
▽ More
Many particle physics datasets like those generated at colliders are described by continuous coordinates (in contrast to grid points like in an image), respect a number of symmetries (like permutation invariance), and have a stochastic dimensionality. For this reason, standard deep generative models that produce images or at least a fixed set of features are limiting. We introduce a new neural network simulation based on a diffusion model that addresses these limitations named Fast Point Cloud Diffusion (FPCD). We show that our approach can reproduce the complex properties of hadronic jets from proton-proton collisions with competitive precision to other recently proposed models. Additionally, we use a procedure called progressive distillation to accelerate the generation time of our method, which is typically a significant challenge for diffusion models despite their state-of-the-art precision.
△ Less
Submitted 17 July, 2023; v1 submitted 3 April, 2023;
originally announced April 2023.
-
Unbinned Deep Learning Jet Substructure Measurement in High $Q^2$ ep collisions at HERA
Authors:
The H1 collaboration,
V. Andreev,
M. Arratia,
A. Baghdasaryan,
A. Baty,
K. Begzsuren,
A. Bolz,
V. Boudry,
G. Brandt,
D. Britzger,
A. Buniatyan,
L. Bystritskaya,
A. J. Campbell,
K. B. Cantun Avila,
K. Cerny,
V. Chekelian,
Z. Chen,
J. G. Contreras,
J. Cvach,
J. B. Dainton,
K. Daum,
A. Deshpande,
C. Diaconu,
A. Drees,
G. Eckerlin
, et al. (120 additional authors not shown)
Abstract:
The radiation pattern within high energy quark- and gluon-initiated jets (jet substructure) is used extensively as a precision probe of the strong force as well as an environment for optimizing event generators with numerous applications in high energy particle and nuclear physics. Looking at electron-proton collisions is of particular interest as many of the complications present at hadron collid…
▽ More
The radiation pattern within high energy quark- and gluon-initiated jets (jet substructure) is used extensively as a precision probe of the strong force as well as an environment for optimizing event generators with numerous applications in high energy particle and nuclear physics. Looking at electron-proton collisions is of particular interest as many of the complications present at hadron colliders are absent. A detailed study of modern jet substructure observables, jet angularities, in electron-proton collisions is presented using data recorded using the H1 detector at HERA. The measurement is unbinned and multi-dimensional, using machine learning to correct for detector effects. All of the available reconstructed object information of the respective jets is interpreted by a graph neural network, achieving superior precision on a selected set of jet angularities. Training these networks was enabled by the use of a large number of GPUs in the Perlmutter supercomputer at Berkeley Lab. The particle jets are reconstructed in the laboratory frame, using the $k_{\mathrm{T}}$ jet clustering algorithm. Results are reported at high transverse momentum transfer $Q^2>150$ GeV${}^2$, and inelasticity $0.2 < y < 0.7$. The analysis is also performed in sub-regions of $Q^2$, thus probing scale dependencies of the substructure variables. The data are compared with a variety of predictions and point towards possible improvements of such models.
△ Less
Submitted 14 September, 2023; v1 submitted 23 March, 2023;
originally announced March 2023.
-
Unbinned Profiled Unfolding
Authors:
Jay Chan,
Benjamin Nachman
Abstract:
Unfolding is an important procedure in particle physics experiments which corrects for detector effects and provides differential cross section measurements that can be used for a number of downstream tasks, such as extracting fundamental physics parameters. Traditionally, unfolding is done by discretizing the target phase space into a finite number of bins and is limited in the number of unfolded…
▽ More
Unfolding is an important procedure in particle physics experiments which corrects for detector effects and provides differential cross section measurements that can be used for a number of downstream tasks, such as extracting fundamental physics parameters. Traditionally, unfolding is done by discretizing the target phase space into a finite number of bins and is limited in the number of unfolded variables. Recently, there have been a number of proposals to perform unbinned unfolding with machine learning. However, none of these methods (like most unfolding methods) allow for simultaneously constraining (profiling) nuisance parameters. We propose a new machine learning-based unfolding method that results in an unbinned differential cross section and can profile nuisance parameters. The machine learning loss function is the full likelihood function, based on binned inputs at detector-level. We first demonstrate the method with simple Gaussian examples and then show the impact on a simulated Higgs boson cross section measurement.
△ Less
Submitted 7 July, 2023; v1 submitted 10 February, 2023;
originally announced February 2023.
-
Report of the 2021 U.S. Community Study on the Future of Particle Physics (Snowmass 2021) Summary Chapter
Authors:
Joel N. Butler,
R. Sekhar Chivukula,
André de Gouvêa,
Tao Han,
Young-Kee Kim,
Priscilla Cushman,
Glennys R. Farrar,
Yury G. Kolomensky,
Sergei Nagaitsev,
Nicolás Yunes,
Stephen Gourlay,
Tor Raubenheimer,
Vladimir Shiltsev,
Kétévi A. Assamagan,
Breese Quinn,
V. Daniel Elvira,
Steven Gottlieb,
Benjamin Nachman,
Aaron S. Chou,
Marcelle Soares-Santos,
Tim M. P. Tait,
Meenakshi Narain,
Laura Reina,
Alessandro Tricoli,
Phillip S. Barbeau
, et al. (18 additional authors not shown)
Abstract:
The 2021-22 High-Energy Physics Community Planning Exercise (a.k.a. ``Snowmass 2021'') was organized by the Division of Particles and Fields of the American Physical Society. Snowmass 2021 was a scientific study that provided an opportunity for the entire U.S. particle physics community, along with its international partners, to identify the most important scientific questions in High Energy Physi…
▽ More
The 2021-22 High-Energy Physics Community Planning Exercise (a.k.a. ``Snowmass 2021'') was organized by the Division of Particles and Fields of the American Physical Society. Snowmass 2021 was a scientific study that provided an opportunity for the entire U.S. particle physics community, along with its international partners, to identify the most important scientific questions in High Energy Physics for the following decade, with an eye to the decade after that, and the experiments, facilities, infrastructure, and R&D needed to pursue them. This Snowmass summary report synthesizes the lessons learned and the main conclusions of the Community Planning Exercise as a whole and presents a community-informed synopsis of U.S. particle physics at the beginning of 2023. This document, along with the Snowmass reports from the various subfields, will provide input to the 2023 Particle Physics Project Prioritization Panel (P5) subpanel of the U.S. High-Energy Physics Advisory Panel (HEPAP), and will help to guide and inform the activity of the U.S. particle physics community during the next decade and beyond.
△ Less
Submitted 3 December, 2023; v1 submitted 16 January, 2023;
originally announced January 2023.
-
FETA: Flow-Enhanced Transportation for Anomaly Detection
Authors:
Tobias Golling,
Samuel Klein,
Radha Mastandrea,
Benjamin Nachman
Abstract:
Resonant anomaly detection is a promising framework for model-independent searches for new particles. Weakly supervised resonant anomaly detection methods compare data with a potential signal against a template of the Standard Model (SM) background inferred from sideband regions. We propose a means to generate this background template that uses a flow-based model to create a mapping between high-f…
▽ More
Resonant anomaly detection is a promising framework for model-independent searches for new particles. Weakly supervised resonant anomaly detection methods compare data with a potential signal against a template of the Standard Model (SM) background inferred from sideband regions. We propose a means to generate this background template that uses a flow-based model to create a mapping between high-fidelity SM simulations and the data. The flow is trained in sideband regions with the signal region blinded, and the flow is conditioned on the resonant feature (mass) such that it can be interpolated into the signal region. To illustrate this approach, we use simulated collisions from the Large Hadron Collider (LHC) Olympics Dataset. We find that our flow-constructed background method has competitive sensitivity with other recent proposals and can therefore provide complementary information to improve future searches.
△ Less
Submitted 14 June, 2023; v1 submitted 21 December, 2022;
originally announced December 2022.
-
Resonant Anomaly Detection with Multiple Reference Datasets
Authors:
Mayee F. Chen,
Benjamin Nachman,
Frederic Sala
Abstract:
An important class of techniques for resonant anomaly detection in high energy physics builds models that can distinguish between reference and target datasets, where only the latter has appreciable signal. Such techniques, including Classification Without Labels (CWoLa) and Simulation Assisted Likelihood-free Anomaly Detection (SALAD) rely on a single reference dataset. They cannot take advantage…
▽ More
An important class of techniques for resonant anomaly detection in high energy physics builds models that can distinguish between reference and target datasets, where only the latter has appreciable signal. Such techniques, including Classification Without Labels (CWoLa) and Simulation Assisted Likelihood-free Anomaly Detection (SALAD) rely on a single reference dataset. They cannot take advantage of commonly-available multiple datasets and thus cannot fully exploit available information. In this work, we propose generalizations of CWoLa and SALAD for settings where multiple reference datasets are available, building on weak supervision techniques. We demonstrate improved performance in a number of settings with realistic and synthetic data. As an added benefit, our generalizations enable us to provide finite-sample guarantees, improving on existing asymptotic analyses.
△ Less
Submitted 20 December, 2022;
originally announced December 2022.
-
Efficiently Moving Instead of Reweighting Collider Events with Machine Learning
Authors:
Radha Mastandrea,
Benjamin Nachman
Abstract:
There are many cases in collider physics and elsewhere where a calibration dataset is used to predict the known physics and / or noise of a target region of phase space. This calibration dataset usually cannot be used out-of-the-box but must be tweaked, often with conditional importance weights, to be maximally realistic. Using resonant anomaly detection as an example, we compare a number of alter…
▽ More
There are many cases in collider physics and elsewhere where a calibration dataset is used to predict the known physics and / or noise of a target region of phase space. This calibration dataset usually cannot be used out-of-the-box but must be tweaked, often with conditional importance weights, to be maximally realistic. Using resonant anomaly detection as an example, we compare a number of alternative approaches based on transporting events with normalizing flows instead of reweighting them. We find that the accuracy of the morphed calibration dataset depends on the degree to which the transport task is set up to carry out optimal transport, which motivates future research into this area.
△ Less
Submitted 12 December, 2022;
originally announced December 2022.
-
Geometry Optimization for Long-lived Particle Detectors
Authors:
Thomas Gorordo,
Simon Knapen,
Benjamin Nachman,
Dean J. Robinson,
Adi Suresh
Abstract:
The proposed designs of many auxiliary long-lived particle (LLP) detectors at the LHC call for the instrumentation of a large surface area inside the detector volume, in order to reliably reconstruct tracks and LLP decay vertices. Taking the CODEX-b detector as an example, we provide a proof-of-concept optimization analysis that demonstrates the required instrumented surface area can be substantia…
▽ More
The proposed designs of many auxiliary long-lived particle (LLP) detectors at the LHC call for the instrumentation of a large surface area inside the detector volume, in order to reliably reconstruct tracks and LLP decay vertices. Taking the CODEX-b detector as an example, we provide a proof-of-concept optimization analysis that demonstrates the required instrumented surface area can be substantially reduced for many LLP models, while only marginally affecting the LLP signal efficiency. This optimization permits a significant reduction in cost and installation time, and may also inform the installation order for modular detector elements. We derive a branch-and-bound based optimization algorithm that permits highly computationally efficient determination of optimal detector configurations, subject to any specified LLP vertex and track reconstruction requirements. We outline the features of a newly-developed generalized simulation framework, for the computation of LLP signal efficiencies across a range of LLP models and detector geometries.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
Machine-Learning Compression for Particle Physics Discoveries
Authors:
Jack H. Collins,
Yifeng Huang,
Simon Knapen,
Benjamin Nachman,
Daniel Whiteson
Abstract:
In collider-based particle and nuclear physics experiments, data are produced at such extreme rates that only a subset can be recorded for later analysis. Typically, algorithms select individual collision events for preservation and store the complete experimental response. A relatively new alternative strategy is to additionally save a partial record for a larger subset of events, allowing for la…
▽ More
In collider-based particle and nuclear physics experiments, data are produced at such extreme rates that only a subset can be recorded for later analysis. Typically, algorithms select individual collision events for preservation and store the complete experimental response. A relatively new alternative strategy is to additionally save a partial record for a larger subset of events, allowing for later specific analysis of a larger fraction of events. We propose a strategy that bridges these paradigms by compressing entire events for generic offline analysis but at a lower fidelity. An optimal-transport-based $β$ Variational Autoencoder (VAE) is used to automate the compression and the hyperparameter $β$ controls the compression fidelity. We introduce a new approach for multi-objective learning functions by simultaneously learning a VAE appropriate for all values of $β$ through parameterization. We present an example use case, a di-muon resonance search at the Large Hadron Collider (LHC), where we show that simulated data compressed by our $β$-VAE has enough fidelity to distinguish distinct signal morphologies.
△ Less
Submitted 18 December, 2022; v1 submitted 20 October, 2022;
originally announced October 2022.
-
ATHENA Detector Proposal -- A Totally Hermetic Electron Nucleus Apparatus proposed for IP6 at the Electron-Ion Collider
Authors:
ATHENA Collaboration,
J. Adam,
L. Adamczyk,
N. Agrawal,
C. Aidala,
W. Akers,
M. Alekseev,
M. M. Allen,
F. Ameli,
A. Angerami,
P. Antonioli,
N. J. Apadula,
A. Aprahamian,
W. Armstrong,
M. Arratia,
J. R. Arrington,
A. Asaturyan,
E. C. Aschenauer,
K. Augsten,
S. Aune,
K. Bailey,
C. Baldanza,
M. Bansal,
F. Barbosa,
L. Barion
, et al. (415 additional authors not shown)
Abstract:
ATHENA has been designed as a general purpose detector capable of delivering the full scientific scope of the Electron-Ion Collider. Careful technology choices provide fine tracking and momentum resolution, high performance electromagnetic and hadronic calorimetry, hadron identification over a wide kinematic range, and near-complete hermeticity. This article describes the detector design and its e…
▽ More
ATHENA has been designed as a general purpose detector capable of delivering the full scientific scope of the Electron-Ion Collider. Careful technology choices provide fine tracking and momentum resolution, high performance electromagnetic and hadronic calorimetry, hadron identification over a wide kinematic range, and near-complete hermeticity. This article describes the detector design and its expected performance in the most relevant physics channels. It includes an evaluation of detector technology choices, the technical challenges to realizing the detector and the R&D required to meet those challenges.
△ Less
Submitted 13 October, 2022;
originally announced October 2022.
-
The Future of High Energy Physics Software and Computing
Authors:
V. Daniel Elvira,
Steven Gottlieb,
Oliver Gutsche,
Benjamin Nachman,
S. Bailey,
W. Bhimji,
P. Boyle,
G. Cerati,
M. Carrasco Kind,
K. Cranmer,
G. Davies,
V. D. Elvira,
R. Gardner,
K. Heitmann,
M. Hildreth,
W. Hopkins,
T. Humble,
M. Lin,
P. Onyisi,
J. Qiang,
K. Pedro,
G. Perdue,
A. Roberts,
M. Savage,
P. Shanahan
, et al. (3 additional authors not shown)
Abstract:
Software and Computing (S&C) are essential to all High Energy Physics (HEP) experiments and many theoretical studies. The size and complexity of S&C are now commensurate with that of experimental instruments, playing a critical role in experimental design, data acquisition/instrumental control, reconstruction, and analysis. Furthermore, S&C often plays a leading role in driving the precision of th…
▽ More
Software and Computing (S&C) are essential to all High Energy Physics (HEP) experiments and many theoretical studies. The size and complexity of S&C are now commensurate with that of experimental instruments, playing a critical role in experimental design, data acquisition/instrumental control, reconstruction, and analysis. Furthermore, S&C often plays a leading role in driving the precision of theoretical calculations and simulations. Within this central role in HEP, S&C has been immensely successful over the last decade. This report looks forward to the next decade and beyond, in the context of the 2021 Particle Physics Community Planning Exercise ("Snowmass") organized by the Division of Particles and Fields (DPF) of the American Physical Society.
△ Less
Submitted 8 November, 2022; v1 submitted 11 October, 2022;
originally announced October 2022.
-
Precision QCD, Hadronic Structure & Forward QCD, Heavy Ions: Report of Energy Frontier Topical Groups 5, 6, 7 submitted to Snowmass 2021
Authors:
M. Begel,
S. Hoeche,
M. Schmitt,
H. -W. Lin,
P. M. Nadolsky,
C. Royon,
Y-J. Lee,
S. Mukherjee,
C. Baldenegro,
J. Campbell,
G. Chachamis,
F. G. Celiberto,
A. M. Cooper-Sarkar,
D. d'Enterria,
M. Diefenthaler,
M. Fucilla,
M. V. Garzelli,
M. Guzzi,
M. Hentschinski,
T. J. Hobbs,
J. Huston,
J. Isaacson,
S. R. Klein,
F. Kling,
P. Kotko
, et al. (25 additional authors not shown)
Abstract:
This report was prepared on behalf of three Energy Frontier Topical Groups of the Snowmass 2021 Community Planning Exercise. It summarizes the status and implications of studies of strong interactions in high-energy experiments and QCD theory. We emphasize the rich landscape and broad impact of these studies in the decade ahead. Hadronic interactions play a central role in the high-luminosity Larg…
▽ More
This report was prepared on behalf of three Energy Frontier Topical Groups of the Snowmass 2021 Community Planning Exercise. It summarizes the status and implications of studies of strong interactions in high-energy experiments and QCD theory. We emphasize the rich landscape and broad impact of these studies in the decade ahead. Hadronic interactions play a central role in the high-luminosity Large Hadron Collider (LHC) physics program, and strong synergies exist between the (HL-)LHC and planned or proposed experiments at the U.S. Electron-Ion Collider, CERN forward physics experiments, high-intensity facilities, and future TeV-range lepton and hadron colliders. Prospects for precision determinations of the strong coupling and a variety of nonperturbative distribution and fragmentation functions are examined. We also review the potential of envisioned tests of new dynamical regimes of QCD in high-energy and high-density scattering processes with nucleon, ion, and photon initial states. The important role of the high-energy heavy-ion program in studies of nuclear structure and the nuclear medium, and its connections with QCD involving nucleons are summarized. We address ongoing and future theoretical advancements in multi-loop QCD computations, lattice QCD, jet substructure, and event generators. Cross-cutting connections between experimental measurements, theoretical predictions, large-scale data analysis, and high-performance computing are emphasized.
△ Less
Submitted 19 November, 2022; v1 submitted 29 September, 2022;
originally announced September 2022.
-
Anomaly Detection under Coordinate Transformations
Authors:
Gregor Kasieczka,
Radha Mastandrea,
Vinicius Mikuni,
Benjamin Nachman,
Mariel Pettee,
David Shih
Abstract:
There is a growing need for machine learning-based anomaly detection strategies to broaden the search for Beyond-the-Standard-Model (BSM) physics at the Large Hadron Collider (LHC) and elsewhere. The first step of any anomaly detection approach is to specify observables and then use them to decide on a set of anomalous events. One common choice is to select events that have low probability density…
▽ More
There is a growing need for machine learning-based anomaly detection strategies to broaden the search for Beyond-the-Standard-Model (BSM) physics at the Large Hadron Collider (LHC) and elsewhere. The first step of any anomaly detection approach is to specify observables and then use them to decide on a set of anomalous events. One common choice is to select events that have low probability density. It is a well-known fact that probability densities are not invariant under coordinate transformations, so the sensitivity can depend on the initial choice of coordinates. The broader machine learning community has recently connected coordinate sensitivity with anomaly detection and our goal is to bring awareness of this issue to the growing high energy physics literature on anomaly detection. In addition to analytical explanations, we provide numerical examples from simple random variables and from the LHC Olympics Dataset that show how using probability density as an anomaly score can lead to events being classified as anomalous or not depending on the coordinate frame.
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
Solid State Detectors and Tracking for Snowmass
Authors:
A. Affolder,
A. Apresyan,
S. Worm,
M. Albrow,
D. Ally,
D. Ambrose,
E. Anderssen,
N. Apadula,
P. Asenov,
W. Armstrong,
M. Artuso,
A. Barbier,
P. Barletta,
L. Bauerdick,
D. Berry,
M. Bomben,
M. Boscardin,
J. Brau,
W. Brooks,
M. Breidenbach,
J. Buckley,
V. Cairo,
R. Caputo,
L. Carpenter,
M. Centis-Vignali
, et al. (110 additional authors not shown)
Abstract:
Tracking detectors are of vital importance for collider-based high energy physics (HEP) experiments. The primary purpose of tracking detectors is the precise reconstruction of charged particle trajectories and the reconstruction of secondary vertices. The performance requirements from the community posed by the future collider experiments require an evolution of tracking systems, necessitating the…
▽ More
Tracking detectors are of vital importance for collider-based high energy physics (HEP) experiments. The primary purpose of tracking detectors is the precise reconstruction of charged particle trajectories and the reconstruction of secondary vertices. The performance requirements from the community posed by the future collider experiments require an evolution of tracking systems, necessitating the development of new techniques, materials and technologies in order to fully exploit their physics potential. In this article we summarize the discussions and conclusions of the 2022 Snowmass Instrumentation Frontier subgroup on Solid State and Tracking Detectors (Snowmass IF03).
△ Less
Submitted 19 October, 2022; v1 submitted 8 September, 2022;
originally announced September 2022.
-
When, Where, and How to Open Data: A Personal Perspective
Authors:
Benjamin Nachman
Abstract:
This is a personal perspective on data sharing in the context of public data releases suitable for generic analysis. These open data can be a powerful tool for expanding the science of high energy physics, but care must be taken in when, where, and how they are utilized. I argue that data preservation even within collaborations needs additional support in order to maximize our science potential. A…
▽ More
This is a personal perspective on data sharing in the context of public data releases suitable for generic analysis. These open data can be a powerful tool for expanding the science of high energy physics, but care must be taken in when, where, and how they are utilized. I argue that data preservation even within collaborations needs additional support in order to maximize our science potential. Additionally, it should also be easier for non-collaboration members to engage with collaborations. Finally, I advocate that we recognize a new type of high energy physicist: the 'data physicist', who would be optimally suited to analyze open data as well as develop and deploy new advanced data science tools so that we can use our precious data to their fullest potential.
This document has been coordinated with a white paper on open data commissioned by the American Physical Society's (APS) Division of Particles and Field (DPS) Community Planning Exercise ('Snowmass') Theory Frontier [1] and relevant also for the Computational Frontier.
△ Less
Submitted 16 August, 2022;
originally announced August 2022.
-
Morphing parton showers with event derivatives
Authors:
Benjamin Nachman,
Stefan Prestel
Abstract:
We develop EventMover, a differentiable parton shower event generator. This tool generates high- and variable-length scattering events that can be moved with simulation derivatives to change the value of the scale $Λ_\mathrm{QCD}$ defining the strong coupling constant, without introducing statistical variations between samples. To demonstrate the potential for EventMover, we compare the output of…
▽ More
We develop EventMover, a differentiable parton shower event generator. This tool generates high- and variable-length scattering events that can be moved with simulation derivatives to change the value of the scale $Λ_\mathrm{QCD}$ defining the strong coupling constant, without introducing statistical variations between samples. To demonstrate the potential for EventMover, we compare the output of the simulation with $e^+e^-$ data to show how one could fit $Λ_\mathrm{QCD}$ with only a single event sample. This is a critical step towards a fully differentiable event generator for particle and nuclear physics.
△ Less
Submitted 3 August, 2022;
originally announced August 2022.
-
Score-based Generative Models for Calorimeter Shower Simulation
Authors:
Vinicius Mikuni,
Benjamin Nachman
Abstract:
Score-based generative models are a new class of generative algorithms that have been shown to produce realistic images even in high dimensional spaces, currently surpassing other state-of-the-art models for different benchmark categories and applications. In this work we introduce CaloScore, a score-based generative model for collider physics applied to calorimeter shower generation. Three differ…
▽ More
Score-based generative models are a new class of generative algorithms that have been shown to produce realistic images even in high dimensional spaces, currently surpassing other state-of-the-art models for different benchmark categories and applications. In this work we introduce CaloScore, a score-based generative model for collider physics applied to calorimeter shower generation. Three different diffusion models are investigated using the Fast Calorimeter Simulation Challenge 2022 dataset. CaloScore is the first application of a score-based generative model in collider physics and is able to produce high-fidelity calorimeter images for all datasets, providing an alternative paradigm for calorimeter shower simulation.
△ Less
Submitted 19 October, 2022; v1 submitted 17 June, 2022;
originally announced June 2022.