-
SDP bounds on quantum codes
Authors:
Gerard Anglès Munné,
Andrew Nemec,
Felix Huber
Abstract:
This paper provides a semidefinite programming hierarchy based on state polynomial optimization to determine the existence of quantum codes with given parameters. The hierarchy is complete, in the sense that if a $(\!(n,K,δ)\!)_2$ code does not exist then a level of the hierarchy is infeasible. It is not limited to stabilizer codes and thus applicable generally. While it is formally dimension-free…
▽ More
This paper provides a semidefinite programming hierarchy based on state polynomial optimization to determine the existence of quantum codes with given parameters. The hierarchy is complete, in the sense that if a $(\!(n,K,δ)\!)_2$ code does not exist then a level of the hierarchy is infeasible. It is not limited to stabilizer codes and thus applicable generally. While it is formally dimension-free, we restrict it to qubit codes through quasi-Clifford algebras. We derive the quantum analog of a range of classical results: first, from an intermediate level a Lovász bound for self-dual quantum codes is recovered. Second, a symmetrization of a minor variation of this Lovász bound recovers the quantum Delsarte bound. Third, a symmetry reduction using the Terwilliger algebra leads to semidefinite programming bounds of size $O(n^4)$. With this we give an alternative proof that there is no $(\!(7,1,4)\!)_2$ quantum code, and show that $(\!(8,9,3)\!)_2$ and $(\!(10,5,4)\!)_2$ codes do not exist.
△ Less
Submitted 19 August, 2024;
originally announced August 2024.
-
The Contribution of XAI for the Safe Development and Certification of AI: An Expert-Based Analysis
Authors:
Benjamin Fresz,
Vincent Philipp Göbels,
Safa Omri,
Danilo Brajovic,
Andreas Aichele,
Janika Kutz,
Jens Neuhüttler,
Marco F. Huber
Abstract:
Developing and certifying safe - or so-called trustworthy - AI has become an increasingly salient issue, especially in light of upcoming regulation such as the EU AI Act. In this context, the black-box nature of machine learning models limits the use of conventional avenues of approach towards certifying complex technical systems. As a potential solution, methods to give insights into this black-b…
▽ More
Developing and certifying safe - or so-called trustworthy - AI has become an increasingly salient issue, especially in light of upcoming regulation such as the EU AI Act. In this context, the black-box nature of machine learning models limits the use of conventional avenues of approach towards certifying complex technical systems. As a potential solution, methods to give insights into this black-box - devised in the field of eXplainable AI (XAI) - could be used. In this study, the potential and shortcomings of such methods for the purpose of safe AI development and certification are discussed in 15 qualitative interviews with experts out of the areas of (X)AI and certification. We find that XAI methods can be a helpful asset for safe AI development, as they can show biases and failures of ML-models, but since certification relies on comprehensive and correct information about technical systems, their impact is expected to be limited.
△ Less
Submitted 22 July, 2024;
originally announced August 2024.
-
Bayesian modelling of VAR precision matrices using stochastic block networks
Authors:
Florian Huber,
Gary Koop,
Massimiliano Marcellino,
Tobias Scheckel
Abstract:
Commonly used priors for Vector Autoregressions (VARs) induce shrinkage on the autoregressive coefficients. Introducing shrinkage on the error covariance matrix is sometimes done but, in the vast majority of cases, without considering the network structure of the shocks and by placing the prior on the lower Cholesky factor of the precision matrix. In this paper, we propose a prior on the VAR error…
▽ More
Commonly used priors for Vector Autoregressions (VARs) induce shrinkage on the autoregressive coefficients. Introducing shrinkage on the error covariance matrix is sometimes done but, in the vast majority of cases, without considering the network structure of the shocks and by placing the prior on the lower Cholesky factor of the precision matrix. In this paper, we propose a prior on the VAR error precision matrix directly. Our prior, which resembles a standard spike and slab prior, models variable inclusion probabilities through a stochastic block model that clusters shocks into groups. Within groups, the probability of having relations across group members is higher (inducing less sparsity) whereas relations across groups imply a lower probability that members of each group are conditionally related. We show in simulations that our approach recovers the true network structure well. Using a US macroeconomic data set, we illustrate how our approach can be used to cluster shocks together and that this feature leads to improved density forecasts.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
ViPro: Enabling and Controlling Video Prediction for Complex Dynamical Scenarios using Procedural Knowledge
Authors:
Patrick Takenaka,
Johannes Maucher,
Marco F. Huber
Abstract:
We propose a novel architecture design for video prediction in order to utilize procedural domain knowledge directly as part of the computational graph of data-driven models. On the basis of new challenging scenarios we show that state-of-the-art video predictors struggle in complex dynamical settings, and highlight that the introduction of prior process knowledge makes their learning problem feas…
▽ More
We propose a novel architecture design for video prediction in order to utilize procedural domain knowledge directly as part of the computational graph of data-driven models. On the basis of new challenging scenarios we show that state-of-the-art video predictors struggle in complex dynamical settings, and highlight that the introduction of prior process knowledge makes their learning problem feasible. Our approach results in the learning of a symbolically addressable interface between data-driven aspects in the model and our dedicated procedural knowledge module, which we utilize in downstream control tasks.
△ Less
Submitted 26 June, 2024;
originally announced July 2024.
-
Large-scale quantum reservoir learning with an analog quantum computer
Authors:
Milan Kornjača,
Hong-Ye Hu,
Chen Zhao,
Jonathan Wurtz,
Phillip Weinberg,
Majd Hamdan,
Andrii Zhdanov,
Sergio H. Cantu,
Hengyun Zhou,
Rodrigo Araiza Bravo,
Kevin Bagnall,
James I. Basham,
Joseph Campo,
Adam Choukri,
Robert DeAngelo,
Paige Frederick,
David Haines,
Julian Hammett,
Ning Hsu,
Ming-Guang Hu,
Florian Huber,
Paul Niklas Jepsen,
Ningyuan Jia,
Thomas Karolyshyn,
Minho Kwon
, et al. (28 additional authors not shown)
Abstract:
Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lac…
▽ More
Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lack potential for quantum advantage. To address this, we develop a general-purpose, gradient-free, and scalable quantum reservoir learning algorithm that harnesses the quantum dynamics of neutral-atom analog quantum computers to process data. We experimentally implement the algorithm, achieving competitive performance across various categories of machine learning tasks, including binary and multi-class classification, as well as timeseries prediction. Effective and improving learning is observed with increasing system sizes of up to 108 qubits, demonstrating the largest quantum machine learning experiment to date. We further observe comparative quantum kernel advantage in learning tasks by constructing synthetic datasets based on the geometric differences between generated quantum and classical data kernels. Our findings demonstrate the potential of utilizing classically intractable quantum correlations for effective machine learning. We expect these results to stimulate further extensions to different quantum hardware and machine learning paradigms, including early fault-tolerant hardware and generative machine learning tasks.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Identifying Ordinary Differential Equations for Data-efficient Model-based Reinforcement Learning
Authors:
Tobias Nagel,
Marco F. Huber
Abstract:
The identification of a mathematical dynamics model is a crucial step in the designing process of a controller. However, it is often very difficult to identify the system's governing equations, especially in complex environments that combine physical laws of different disciplines. In this paper, we present a new approach that allows identifying an ordinary differential equation by means of a physi…
▽ More
The identification of a mathematical dynamics model is a crucial step in the designing process of a controller. However, it is often very difficult to identify the system's governing equations, especially in complex environments that combine physical laws of different disciplines. In this paper, we present a new approach that allows identifying an ordinary differential equation by means of a physics-informed machine learning algorithm. Our method introduces a special neural network that allows exploiting prior human knowledge to a certain degree and extends it autonomously, so that the resulting differential equations describe the system as accurately as possible. We validate the method on a Duffing oscillator with simulation data and, additionally, on a cascaded tank example with real-world data. Subsequently, we use the developed algorithm in a model-based reinforcement learning framework by alternately identifying and controlling a system to a target state. We test the performance by swinging-up an inverted pendulum on a cart.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Guiding Video Prediction with Explicit Procedural Knowledge
Authors:
Patrick Takenaka,
Johannes Maucher,
Marco F. Huber
Abstract:
We propose a general way to integrate procedural knowledge of a domain into deep learning models. We apply it to the case of video prediction, building on top of object-centric deep models and show that this leads to a better performance than using data-driven models alone. We develop an architecture that facilitates latent space disentanglement in order to use the integrated procedural knowledge,…
▽ More
We propose a general way to integrate procedural knowledge of a domain into deep learning models. We apply it to the case of video prediction, building on top of object-centric deep models and show that this leads to a better performance than using data-driven models alone. We develop an architecture that facilitates latent space disentanglement in order to use the integrated procedural knowledge, and establish a setup that allows the model to learn the procedural interface in the latent space using the downstream task of video prediction. We contrast the performance to a state-of-the-art data-driven approach and show that problems where purely data-driven approaches struggle can be handled by using knowledge about the domain, providing an alternative to simply collecting more data.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Markovian Lifts of Stochastic Volterra Equations in Sobolev Spaces: Solution theory, an Ito Formula and Invariant Measures
Authors:
Florian Huber
Abstract:
We investigate Markovian lifts of stochastic Volterra equations (SVEs) with completely monotone kernels and general coefficients within a class of weighted Sobolev spaces. Our primary focus is developing a comprehensive solution theory for a class of non-local stochastic evolution equations (SEEs) encompassing these Markovian lifts. This enables us to provide conditions for the existence of invari…
▽ More
We investigate Markovian lifts of stochastic Volterra equations (SVEs) with completely monotone kernels and general coefficients within a class of weighted Sobolev spaces. Our primary focus is developing a comprehensive solution theory for a class of non-local stochastic evolution equations (SEEs) encompassing these Markovian lifts. This enables us to provide conditions for the existence of invariant measures for the lifted processes and the corresponding SVE. Another key contribution is an Ito-type formula for the stochastic Volterra equations under consideration.
△ Less
Submitted 18 June, 2024; v1 submitted 14 June, 2024;
originally announced June 2024.
-
Reinforcement learning-based architecture search for quantum machine learning
Authors:
Frederic Rapp,
David A. Kreplin,
Marco F. Huber,
Marco Roth
Abstract:
Quantum machine learning models use encoding circuits to map data into a quantum Hilbert space. While it is well known that the architecture of these circuits significantly influences core properties of the resulting model, they are often chosen heuristically. In this work, we present a novel approach using reinforcement learning techniques to generate problem-specific encoding circuits to improve…
▽ More
Quantum machine learning models use encoding circuits to map data into a quantum Hilbert space. While it is well known that the architecture of these circuits significantly influences core properties of the resulting model, they are often chosen heuristically. In this work, we present a novel approach using reinforcement learning techniques to generate problem-specific encoding circuits to improve the performance of quantum machine learning models. By specifically using a model-based reinforcement learning algorithm, we reduce the number of necessary circuit evaluations during the search, providing a sample-efficient framework. In contrast to previous search algorithms, our method uses a layered circuit structure that significantly reduces the search space. Additionally, our approach can account for multiple objectives such as solution quality, hardware restrictions and circuit depth. We benchmark our tailored circuits against various reference models, including models with problem-agnostic circuits and classical models. Our results highlight the effectiveness of problem-specific encoding circuits in enhancing QML model performance.
△ Less
Submitted 5 August, 2024; v1 submitted 4 June, 2024;
originally announced June 2024.
-
Quantum quench dynamics as a shortcut to adiabaticity
Authors:
Alexander Lukin,
Benjamin F. Schiffer,
Boris Braverman,
Sergio H. Cantu,
Florian Huber,
Alexei Bylinskii,
Jesse Amato-Grill,
Nishad Maskara,
Madelyn Cain,
Dominik S. Wild,
Rhine Samajdar,
Mikhail D. Lukin
Abstract:
The ability to efficiently prepare ground states of quantum Hamiltonians via adiabatic protocols is typically limited by the smallest energy gap encountered during the quantum evolution. This presents a key obstacle for quantum simulation and realizations of adiabatic quantum algorithms in large systems, particularly when the adiabatic gap vanishes exponentially with system size. Using QuEra's Aqu…
▽ More
The ability to efficiently prepare ground states of quantum Hamiltonians via adiabatic protocols is typically limited by the smallest energy gap encountered during the quantum evolution. This presents a key obstacle for quantum simulation and realizations of adiabatic quantum algorithms in large systems, particularly when the adiabatic gap vanishes exponentially with system size. Using QuEra's Aquila programmable quantum simulator based on Rydberg atom arrays, we experimentally demonstrate a method to circumvent such limitations. Specifically, we develop and test a "sweep-quench-sweep" quantum algorithm in which the incorporation of a quench step serves as a remedy to the diverging adiabatic timescale. These quenches introduce a macroscopic reconfiguration between states separated by an extensively large Hamming distance, akin to quantum many-body scars. Our experiments show that this approach significantly outperforms the adiabatic algorithm, illustrating that such quantum quench algorithms can provide a shortcut to adiabaticity for large-scale many-body quantum systems.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
HIPer: A Human-Inspired Scene Perception Model for Multifunctional Mobile Robots
Authors:
Florenz Graf,
Jochen Lindermayr,
Birgit Graf,
Werner Kraus,
Marco F. Huber
Abstract:
Taking over arbitrary tasks like humans do with a mobile service robot in open-world settings requires a holistic scene perception for decision-making and high-level control. This paper presents a human-inspired scene perception model to minimize the gap between human and robotic capabilities. The approach takes over fundamental neuroscience concepts, such as a triplet perception split into recogn…
▽ More
Taking over arbitrary tasks like humans do with a mobile service robot in open-world settings requires a holistic scene perception for decision-making and high-level control. This paper presents a human-inspired scene perception model to minimize the gap between human and robotic capabilities. The approach takes over fundamental neuroscience concepts, such as a triplet perception split into recognition, knowledge representation, and knowledge interpretation. A recognition system splits the background and foreground to integrate exchangeable image-based object detectors and SLAM, a multi-layer knowledge base represents scene information in a hierarchical structure and offers interfaces for high-level control, and knowledge interpretation methods deploy spatio-temporal scene analysis and perceptual learning for self-adjustment. A single-setting ablation study is used to evaluate the impact of each component on the overall performance for a fetch-and-carry scenario in two simulated and one real-world environment.
△ Less
Submitted 27 April, 2024;
originally announced April 2024.
-
Polynomial interacting particle systems and non-linear SPDEs for market capitalization curves
Authors:
Christa Cuchiero,
Florian Huber
Abstract:
Motivated by the robustness of the capital distribution curves, we study the behavior of a certain polynomial equity market model as the number of companies goes to infinity. More precisely, we extend volatility-stabilized market models introduced by Fernholz et al. by allowing for a common noise term such that the models remain polynomial. As the number of companies approaches infinity, we show t…
▽ More
Motivated by the robustness of the capital distribution curves, we study the behavior of a certain polynomial equity market model as the number of companies goes to infinity. More precisely, we extend volatility-stabilized market models introduced by Fernholz et al. by allowing for a common noise term such that the models remain polynomial. As the number of companies approaches infinity, we show that the limit of the empirical measure of the $N$-company system converges to the unique solution of a degenerate, non-linear SPDE. The obtained limit also has a representation as the conditional probability of the solution to a certain McKean-Vlasov SDE. Together with its conditional, this is again a polynomial process for which we can prove pathwise uniqueness as well as regularity properties for the marginal densities. We also provide conditional propagation of chaos results and numerical implementations of the particle system as well as its limiting equations.
△ Less
Submitted 17 April, 2024; v1 submitted 16 April, 2024;
originally announced April 2024.
-
Overview of Publicly Available Degradation Data Sets for Tasks within Prognostics and Health Management
Authors:
Fabian Mauthe,
Christopher Braun,
Julian Raible,
Peter Zeiler,
Marco F. Huber
Abstract:
Central to the efficacy of prognostics and health management methods is the acquisition and analysis of degradation data, which encapsulates the evolving health condition of engineering systems over time. Degradation data serves as a rich source of information, offering invaluable insights into the underlying degradation processes, failure modes, and performance trends of engineering systems. This…
▽ More
Central to the efficacy of prognostics and health management methods is the acquisition and analysis of degradation data, which encapsulates the evolving health condition of engineering systems over time. Degradation data serves as a rich source of information, offering invaluable insights into the underlying degradation processes, failure modes, and performance trends of engineering systems. This paper provides an overview of publicly available degradation data sets.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Measurement of groomed event shape observables in deep-inelastic electron-proton scattering at HERA
Authors:
The H1 collaboration,
V. Andreev,
M. Arratia,
A. Baghdasaryan,
A. Baty,
K. Begzsuren,
A. Bolz,
V. Boudry,
G. Brandt,
D. Britzger,
A. Buniatyan,
L. Bystritskaya,
A. J. Campbell,
K. B. Cantun Avila,
K. Cerny,
V. Chekelian,
Z. Chen,
J. G. Contreras,
J. Cvach,
J. B. Dainton,
K. Daum,
A. Deshpande,
C. Diaconu,
A. Drees,
G. Eckerlin
, et al. (123 additional authors not shown)
Abstract:
The H1 Collaboration at HERA reports the first measurement of groomed event shape observables in deep inelastic electron-proton scattering (DIS) at $\sqrt{s}=319$ GeV, using data recorded between the years 2003 and 2007 with an integrated luminosity of $351$ pb$^{-1}$. Event shapes provide incisive probes of perturbative and non-perturbative QCD. Grooming techniques have been used for jet measurem…
▽ More
The H1 Collaboration at HERA reports the first measurement of groomed event shape observables in deep inelastic electron-proton scattering (DIS) at $\sqrt{s}=319$ GeV, using data recorded between the years 2003 and 2007 with an integrated luminosity of $351$ pb$^{-1}$. Event shapes provide incisive probes of perturbative and non-perturbative QCD. Grooming techniques have been used for jet measurements in hadronic collisions; this paper presents the first application of grooming to DIS data. The analysis is carried out in the Breit frame, utilizing the novel Centauro jet clustering algorithm that is designed for DIS event topologies. Events are required to have squared momentum-transfer $Q^2 > 150$ GeV$^2$ and inelasticity $ 0.2 < y < 0.7$. We report measurements of the production cross section of groomed event 1-jettiness and groomed invariant mass for several choices of grooming parameter. Monte Carlo model calculations and analytic calculations based on Soft Collinear Effective Theory are compared to the measurements.
△ Less
Submitted 1 August, 2024; v1 submitted 15 March, 2024;
originally announced March 2024.
-
Measurement of the 1-jettiness event shape observable in deep-inelastic electron-proton scattering at HERA
Authors:
The H1 collaboration,
V. Andreev,
M. Arratia,
A. Baghdasaryan,
A. Baty,
K. Begzsuren,
A. Bolz,
V. Boudry,
G. Brandt,
D. Britzger,
A. Buniatyan,
L. Bystritskaya,
A. J. Campbell,
K. B. Cantun Avila,
K. Cerny,
V. Chekelian,
Z. Chen,
J. G. Contreras,
J. Cvach,
J. B. Dainton,
K. Daum,
A. Deshpande,
C. Diaconu,
A. Drees,
G. Eckerlin
, et al. (124 additional authors not shown)
Abstract:
The H1 Collaboration reports the first measurement of the 1-jettiness event shape observable $τ_1^b$ in neutral-current deep-inelastic electron-proton scattering (DIS). The observable $τ_1^b$ is equivalent to a thrust observable defined in the Breit frame. The data sample was collected at the HERA $ep$ collider in the years 2003-2007 with center-of-mass energy of $\sqrt{s}=319\,\text{GeV}$, corres…
▽ More
The H1 Collaboration reports the first measurement of the 1-jettiness event shape observable $τ_1^b$ in neutral-current deep-inelastic electron-proton scattering (DIS). The observable $τ_1^b$ is equivalent to a thrust observable defined in the Breit frame. The data sample was collected at the HERA $ep$ collider in the years 2003-2007 with center-of-mass energy of $\sqrt{s}=319\,\text{GeV}$, corresponding to an integrated luminosity of $351.1\,\text{pb}^{-1}$. Triple differential cross sections are provided as a function of $τ_1^b$, event virtuality $Q^2$, and inelasticity $y$, in the kinematic region $Q^2>150\,\text{GeV}^{2}$. Single differential cross section are provided as a function of $τ_1^b$ in a limited kinematic range. Double differential cross sections are measured, in contrast, integrated over $τ_1^b$ and represent the inclusive neutral-current DIS cross section measured as a function of $Q^2$ and $y$. The data are compared to a variety of predictions and include classical and modern Monte Carlo event generators, predictions in fixed-order perturbative QCD where calculations up to $\mathcal{O}(α_s^3)$ are available for $τ_1^b$ or inclusive DIS, and resummed predictions at next-to-leading logarithmic accuracy matched to fixed order predictions at $\mathcal{O}(α_s^2)$. These comparisons reveal sensitivity of the 1-jettiness observable to QCD parton shower and resummation effects, as well as the modeling of hadronization and fragmentation. Within their range of validity, the fixed-order predictions provide a good description of the data. Monte Carlo event generators are predictive over the full measured range and hence their underlying models and parameters can be constrained by comparing to the presented data.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Observation and differential cross section measurement of neutral current DIS events with an empty hemisphere in the Breit frame
Authors:
The H1 collaboration,
V. Andreev,
M. Arratia,
A. Baghdasaryan,
A. Baty,
K. Begzsuren,
A. Bolz,
V. Boudry,
G. Brandt,
D. Britzger,
A. Buniatyan,
L. Bystritskaya,
A. J. Campbell,
K. B. Cantun Avila,
K. Cerny,
V. Chekelian,
Z. Chen,
J. G. Contreras,
J. Cvach,
J. B. Dainton,
K. Daum,
A. Deshpande,
C. Diaconu,
A. Drees,
G. Eckerlin
, et al. (124 additional authors not shown)
Abstract:
The Breit frame provides a natural frame to analyze lepton-proton scattering events. In this reference frame, the parton model hard interactions between a quark and an exchanged boson defines the coordinate system such that the struck quark is back-scattered along the virtual photon momentum direction. In Quantum Chromodynamics (QCD), higher order perturbative or non-perturbative effects can chang…
▽ More
The Breit frame provides a natural frame to analyze lepton-proton scattering events. In this reference frame, the parton model hard interactions between a quark and an exchanged boson defines the coordinate system such that the struck quark is back-scattered along the virtual photon momentum direction. In Quantum Chromodynamics (QCD), higher order perturbative or non-perturbative effects can change this picture drastically. As Bjorken-$x$ decreases below one half, a rather peculiar event signature is predicted with increasing probability, where no radiation is present in one of the two Breit-frame hemispheres and all emissions are to be found in the other hemisphere. At higher orders in $α_s$ or in the presence of soft QCD effects, predictions of the rate of these events are far from trivial, and that motivates measurements with real data. We report on the first observation of the empty current hemisphere events in electron-proton collisions at the HERA collider using data recorded with the H1 detector at a center-of-mass energy of 319 GeV. The fraction of inclusive neutral-current DIS events with an empty hemisphere is found to be $0.0112 \pm 3.9\,\%_\text{stat} \pm 4.5\,\%_\text{syst} \pm 1.6\,\%_\text{mod}$ in the selected kinematic region of $150< Q^2<1500$ GeV$^2$ and inelasticity $0.14< y<0.7$. The data sample corresponds to an integrated luminosity of 351.1 pb$^{-1}$, sufficient to enable differential cross section measurements of these events. The results show an enhanced discriminating power at lower Bjorken-$x$ among different Monte Carlo event generator predictions.
△ Less
Submitted 1 August, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
RoboGrind: Intuitive and Interactive Surface Treatment with Industrial Robots
Authors:
Benjamin Alt,
Florian Stöckl,
Silvan Müller,
Christopher Braun,
Julian Raible,
Saad Alhasan,
Oliver Rettig,
Lukas Ringle,
Darko Katic,
Rainer Jäkel,
Michael Beetz,
Marcus Strand,
Marco F. Huber
Abstract:
Surface treatment tasks such as grinding, sanding or polishing are a vital step of the value chain in many industries, but are notoriously challenging to automate. We present RoboGrind, an integrated system for the intuitive, interactive automation of surface treatment tasks with industrial robots. It combines a sophisticated 3D perception pipeline for surface scanning and automatic defect identif…
▽ More
Surface treatment tasks such as grinding, sanding or polishing are a vital step of the value chain in many industries, but are notoriously challenging to automate. We present RoboGrind, an integrated system for the intuitive, interactive automation of surface treatment tasks with industrial robots. It combines a sophisticated 3D perception pipeline for surface scanning and automatic defect identification, an interactive voice-controlled wizard system for the AI-assisted bootstrapping and parameterization of robot programs, and an automatic planning and execution pipeline for force-controlled robotic surface treatment. RoboGrind is evaluated both under laboratory and real-world conditions in the context of refabricating fiberglass wind turbine blades.
△ Less
Submitted 27 February, 2024; v1 submitted 26 February, 2024;
originally announced February 2024.
-
On Convolutional Vision Transformers for Yield Prediction
Authors:
Alvin Inderka,
Florian Huber,
Volker Steinhage
Abstract:
While a variety of methods offer good yield prediction on histogrammed remote sensing data, vision Transformers are only sparsely represented in the literature. The Convolution vision Transformer (CvT) is being tested to evaluate vision Transformers that are currently achieving state-of-the-art results in many other vision tasks. CvT combines some of the advantages of convolution with the advantag…
▽ More
While a variety of methods offer good yield prediction on histogrammed remote sensing data, vision Transformers are only sparsely represented in the literature. The Convolution vision Transformer (CvT) is being tested to evaluate vision Transformers that are currently achieving state-of-the-art results in many other vision tasks. CvT combines some of the advantages of convolution with the advantages of dynamic attention and global context fusion of Transformers. It performs worse than widely tested methods such as XGBoost and CNNs, but shows that Transformers have potential to improve yield prediction.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Nowcasting economic activity in European regions using a mixed-frequency dynamic factor model
Authors:
Luca Barbaglia,
Lorenzo Frattarolo,
Niko Hauzenberger,
Dominik Hirschbuehl,
Florian Huber,
Luca Onorante,
Michael Pfarrhofer,
Luca Tiozzo Pezzoli
Abstract:
Timely information about the state of regional economies can be essential for planning, implementing and evaluating locally targeted economic policies. However, European regional accounts for output are published at an annual frequency and with a two-year delay. To obtain robust and more timely measures in a computationally efficient manner, we propose a mixed-frequency dynamic factor model that a…
▽ More
Timely information about the state of regional economies can be essential for planning, implementing and evaluating locally targeted economic policies. However, European regional accounts for output are published at an annual frequency and with a two-year delay. To obtain robust and more timely measures in a computationally efficient manner, we propose a mixed-frequency dynamic factor model that accounts for national information to produce high-frequency estimates of the regional gross value added (GVA). We show that our model produces reliable nowcasts of GVA in 162 regions across 12 European countries.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
Probing quantum floating phases in Rydberg atom arrays
Authors:
Jin Zhang,
Sergio H. Cantú,
Fangli Liu,
Alexei Bylinskii,
Boris Braverman,
Florian Huber,
Jesse Amato-Grill,
Alexander Lukin,
Nathan Gemelke,
Alexander Keesling,
Sheng-Tao Wang,
Y. Meurice,
S. -W. Tsai
Abstract:
The floating phase, a critical incommensurate phase, has been theoretically predicted as a potential intermediate phase between crystalline ordered and disordered phases. In this study, we investigate the different quantum phases that arise in ladder arrays comprising up to 92 neutral-atom qubits and experimentally observe the emergence of the quantum floating phase. We analyze the site-resolved R…
▽ More
The floating phase, a critical incommensurate phase, has been theoretically predicted as a potential intermediate phase between crystalline ordered and disordered phases. In this study, we investigate the different quantum phases that arise in ladder arrays comprising up to 92 neutral-atom qubits and experimentally observe the emergence of the quantum floating phase. We analyze the site-resolved Rydberg state densities and the distribution of state occurrences. The site-resolved measurement reveals the formation of domain walls within the commensurate ordered phase, which subsequently proliferate and give rise to the floating phase with incommensurate quasi-long-range order. By analyzing the Fourier spectra of the Rydberg density-density correlations, we observe clear signatures of the incommensurate wave order of the floating phase. Furthermore, as the experimental system sizes increase, we show that the wave vectors approach a continuum of values incommensurate with the lattice. Our work motivates future studies to further explore the nature of commensurate-incommensurate phase transitions and their non-equilibrium physics.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
auto-sktime: Automated Time Series Forecasting
Authors:
Marc-André Zöller,
Marius Lindauer,
Marco F. Huber
Abstract:
In today's data-driven landscape, time series forecasting is pivotal in decision-making across various sectors. Yet, the proliferation of more diverse time series data, coupled with the expanding landscape of available forecasting methods, poses significant challenges for forecasters. To meet the growing demand for efficient forecasting, we introduce auto-sktime, a novel framework for automated ti…
▽ More
In today's data-driven landscape, time series forecasting is pivotal in decision-making across various sectors. Yet, the proliferation of more diverse time series data, coupled with the expanding landscape of available forecasting methods, poses significant challenges for forecasters. To meet the growing demand for efficient forecasting, we introduce auto-sktime, a novel framework for automated time series forecasting. The proposed framework uses the power of automated machine learning (AutoML) techniques to automate the creation of the entire forecasting pipeline. The framework employs Bayesian optimization, to automatically construct pipelines from statistical, machine learning (ML) and deep neural network (DNN) models. Furthermore, we propose three essential improvements to adapt AutoML to time series data. First, pipeline templates to account for the different supported forecasting models. Second, a novel warm-starting technique to start the optimization from prior optimization runs. Third, we adapt multi-fidelity optimizations to make them applicable to a search space containing statistical, ML and DNN models. Experimental results on 64 diverse real-world time series datasets demonstrate the effectiveness and efficiency of the framework, outperforming traditional methods while requiring minimal human involvement.
△ Less
Submitted 30 April, 2024; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Bayesian Nonlinear Regression using Sums of Simple Functions
Authors:
Florian Huber
Abstract:
This paper proposes a new Bayesian machine learning model that can be applied to large datasets arising in macroeconomics. Our framework sums over many simple two-component location mixtures. The transition between components is determined by a logistic function that depends on a single threshold variable and two hyperparameters. Each of these individual models only accounts for a minor portion of…
▽ More
This paper proposes a new Bayesian machine learning model that can be applied to large datasets arising in macroeconomics. Our framework sums over many simple two-component location mixtures. The transition between components is determined by a logistic function that depends on a single threshold variable and two hyperparameters. Each of these individual models only accounts for a minor portion of the variation in the endogenous variables. But many of them are capable of capturing arbitrary nonlinear conditional mean relations. Conjugate priors enable fast and efficient inference. In simulations, we show that our approach produces accurate point and density forecasts. In a real-data exercise, we forecast US macroeconomic aggregates and consider the nonlinear effects of financial shocks in a large-scale nonlinear VAR.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
Predictive Density Combination Using a Tree-Based Synthesis Function
Authors:
Tony Chernis,
Niko Hauzenberger,
Florian Huber,
Gary Koop,
James Mitchell
Abstract:
Bayesian predictive synthesis (BPS) provides a method for combining multiple predictive distributions based on agent/expert opinion analysis theory and encompasses a range of existing density forecast pooling methods. The key ingredient in BPS is a ``synthesis'' function. This is typically specified parametrically as a dynamic linear regression. In this paper, we develop a nonparametric treatment…
▽ More
Bayesian predictive synthesis (BPS) provides a method for combining multiple predictive distributions based on agent/expert opinion analysis theory and encompasses a range of existing density forecast pooling methods. The key ingredient in BPS is a ``synthesis'' function. This is typically specified parametrically as a dynamic linear regression. In this paper, we develop a nonparametric treatment of the synthesis function using regression trees. We show the advantages of our tree-based approach in two macroeconomic forecasting applications. The first uses density forecasts for GDP growth from the euro area's Survey of Professional Forecasters. The second combines density forecasts of US inflation produced by many regression models involving different predictors. Both applications demonstrate the benefits -- in terms of improved forecast accuracy and interpretability -- of modeling the synthesis function nonparametrically.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
Improving the Effectiveness of Deep Generative Data
Authors:
Ruyu Wang,
Sabrina Schmedding,
Marco F. Huber
Abstract:
Recent deep generative models (DGMs) such as generative adversarial networks (GANs) and diffusion probabilistic models (DPMs) have shown their impressive ability in generating high-fidelity photorealistic images. Although looking appealing to human eyes, training a model on purely synthetic images for downstream image processing tasks like image classification often results in an undesired perform…
▽ More
Recent deep generative models (DGMs) such as generative adversarial networks (GANs) and diffusion probabilistic models (DPMs) have shown their impressive ability in generating high-fidelity photorealistic images. Although looking appealing to human eyes, training a model on purely synthetic images for downstream image processing tasks like image classification often results in an undesired performance drop compared to training on real data. Previous works have demonstrated that enhancing a real dataset with synthetic images from DGMs can be beneficial. However, the improvements were subjected to certain circumstances and yet were not comparable to adding the same number of real images. In this work, we propose a new taxonomy to describe factors contributing to this commonly observed phenomenon and investigate it on the popular CIFAR-10 dataset. We hypothesize that the Content Gap accounts for a large portion of the performance drop when using synthetic images from DGM and propose strategies to better utilize them in downstream tasks. Extensive experiments on multiple datasets showcase that our method outperforms baselines on downstream classification tasks both in case of training on synthetic only (Synthetic-to-Real) and training on a mix of real and synthetic data (Data Augmentation), particularly in the data-scarce scenario.
△ Less
Submitted 8 November, 2023; v1 submitted 7 November, 2023;
originally announced November 2023.
-
Comment on "Photons can tell 'contradictory' answer about where they have been''
Authors:
Gregory Reznik,
Carlotta Versmold,
Jan Dziewior,
Florian Huber,
Harald Weinfurter,
Justin Dressel,
Lev Vaidman
Abstract:
Yuan and Feng [Eur. Phys. J. Plus 138:70, 2023] recently proposed a modification of the nested Mach-Zehnder interferometer experiment performed by Danan et al. [Phys. Rev. Lett. 111:240402, 2013] and argued that photons give "contradictory" answers about where they have been, when traces are locally imprinted on them in different ways. They concluded that their results are comprehensible from what…
▽ More
Yuan and Feng [Eur. Phys. J. Plus 138:70, 2023] recently proposed a modification of the nested Mach-Zehnder interferometer experiment performed by Danan et al. [Phys. Rev. Lett. 111:240402, 2013] and argued that photons give "contradictory" answers about where they have been, when traces are locally imprinted on them in different ways. They concluded that their results are comprehensible from what they call the "three-path interference viewpoint", but difficult to explain from the "discontinuous trajectory" viewpoint advocated by Danan et al. We argue that the weak trace approach (the basis of the "discontinuous trajectory" viewpoint) provides a consistent explanation of the Yuan-Feng experiment. The contradictory messages of the photons are just another example of photons lying about where they have been when the experimental method of Danan et al. is applied in an inappropriate setup.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Uncertainty relations from state polynomial optimization
Authors:
Moisés Bermejo Morán,
Felix Huber
Abstract:
Uncertainty relations are a fundamental feature of quantum mechanics. How can these relations be found systematically? Here we develop a semidefinite programming hierarchy for additive uncertainty relations in the variances of non-commuting observables. Our hierarchy is built on the state polynomial optimization framework, also known as scalar extension. The hierarchy is complete, in the sense tha…
▽ More
Uncertainty relations are a fundamental feature of quantum mechanics. How can these relations be found systematically? Here we develop a semidefinite programming hierarchy for additive uncertainty relations in the variances of non-commuting observables. Our hierarchy is built on the state polynomial optimization framework, also known as scalar extension. The hierarchy is complete, in the sense that it converges to tight uncertainty relations. We improve upon upper bounds for all 1292 additive uncertainty relations on up to nine operators for which a tight bound is not known. The bounds are dimension-free and depend entirely on the algebraic relations among the operators. The techniques apply to a range of scenarios, including Pauli, Heisenberg-Weyl, and fermionic operators, and generalize to higher order moments and multiplicative uncertainty relations.
△ Less
Submitted 6 August, 2024; v1 submitted 1 October, 2023;
originally announced October 2023.
-
Model Reporting for Certifiable AI: A Proposal from Merging EU Regulation into AI Development
Authors:
Danilo Brajovic,
Niclas Renner,
Vincent Philipp Goebels,
Philipp Wagner,
Benjamin Fresz,
Martin Biller,
Mara Klaeb,
Janika Kutz,
Jens Neuhuettler,
Marco F. Huber
Abstract:
Despite large progress in Explainable and Safe AI, practitioners suffer from a lack of regulation and standards for AI safety. In this work we merge recent regulation efforts by the European Union and first proposals for AI guidelines with recent trends in research: data and model cards. We propose the use of standardized cards to document AI applications throughout the development process. Our ma…
▽ More
Despite large progress in Explainable and Safe AI, practitioners suffer from a lack of regulation and standards for AI safety. In this work we merge recent regulation efforts by the European Union and first proposals for AI guidelines with recent trends in research: data and model cards. We propose the use of standardized cards to document AI applications throughout the development process. Our main contribution is the introduction of use-case and operation cards, along with updates for data and model cards to cope with regulatory requirements. We reference both recent research as well as the source of the regulation in our cards and provide references to additional support material and toolboxes whenever possible. The goal is to design cards that help practitioners develop safe AI systems throughout the development process, while enabling efficient third-party auditing of AI applications, being easy to understand, and building trust in the system. Our work incorporates insights from interviews with certification experts as well as developers and individuals working with the developed AI applications.
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
Self-supervised Optimization of Hand Pose Estimation using Anatomical Features and Iterative Learning
Authors:
Christian Jauch,
Timo Leitritz,
Marco F. Huber
Abstract:
Manual assembly workers face increasing complexity in their work. Human-centered assistance systems could help, but object recognition as an enabling technology hinders sophisticated human-centered design of these systems. At the same time, activity recognition based on hand poses suffers from poor pose estimation in complex usage scenarios, such as wearing gloves. This paper presents a self-super…
▽ More
Manual assembly workers face increasing complexity in their work. Human-centered assistance systems could help, but object recognition as an enabling technology hinders sophisticated human-centered design of these systems. At the same time, activity recognition based on hand poses suffers from poor pose estimation in complex usage scenarios, such as wearing gloves. This paper presents a self-supervised pipeline for adapting hand pose estimation to specific use cases with minimal human interaction. This enables cheap and robust hand posebased activity recognition. The pipeline consists of a general machine learning model for hand pose estimation trained on a generalized dataset, spatial and temporal filtering to account for anatomical constraints of the hand, and a retraining step to improve the model. Different parameter combinations are evaluated on a publicly available and annotated dataset. The best parameter and model combination is then applied to unlabelled videos from a manual assembly scenario. The effectiveness of the pipeline is demonstrated by training an activity recognition as a downstream task in the manual assembly scenario.
△ Less
Submitted 6 July, 2023;
originally announced July 2023.
-
Automated Machine Learning for Remaining Useful Life Predictions
Authors:
Marc-André Zöller,
Fabian Mauthe,
Peter Zeiler,
Marius Lindauer,
Marco F. Huber
Abstract:
Being able to predict the remaining useful life (RUL) of an engineering system is an important task in prognostics and health management. Recently, data-driven approaches to RUL predictions are becoming prevalent over model-based approaches since no underlying physical knowledge of the engineering system is required. Yet, this just replaces required expertise of the underlying physics with machine…
▽ More
Being able to predict the remaining useful life (RUL) of an engineering system is an important task in prognostics and health management. Recently, data-driven approaches to RUL predictions are becoming prevalent over model-based approaches since no underlying physical knowledge of the engineering system is required. Yet, this just replaces required expertise of the underlying physics with machine learning (ML) expertise, which is often also not available. Automated machine learning (AutoML) promises to build end-to-end ML pipelines automatically enabling domain experts without ML expertise to create their own models. This paper introduces AutoRUL, an AutoML-driven end-to-end approach for automatic RUL predictions. AutoRUL combines fine-tuned standard regression methods to an ensemble with high predictive power. By evaluating the proposed method on eight real-world and synthetic datasets against state-of-the-art hand-crafted models, we show that AutoML provides a viable alternative to hand-crafted data-driven RUL predictions. Consequently, creating RUL predictions can be made more accessible for domain experts using AutoML by eliminating ML expertise from data-driven model construction.
△ Less
Submitted 21 June, 2023;
originally announced June 2023.
-
Aquila: QuEra's 256-qubit neutral-atom quantum computer
Authors:
Jonathan Wurtz,
Alexei Bylinskii,
Boris Braverman,
Jesse Amato-Grill,
Sergio H. Cantu,
Florian Huber,
Alexander Lukin,
Fangli Liu,
Phillip Weinberg,
John Long,
Sheng-Tao Wang,
Nathan Gemelke,
Alexander Keesling
Abstract:
The neutral-atom quantum computer "Aquila" is QuEra's latest device available through the Braket cloud service on Amazon Web Services (AWS). Aquila is a "field-programmable qubit array" (FPQA) operated as an analog Hamiltonian simulator on a user-configurable architecture, executing programmable coherent quantum dynamics on up to 256 neutral-atom qubits. This whitepaper serves as an overview of Aq…
▽ More
The neutral-atom quantum computer "Aquila" is QuEra's latest device available through the Braket cloud service on Amazon Web Services (AWS). Aquila is a "field-programmable qubit array" (FPQA) operated as an analog Hamiltonian simulator on a user-configurable architecture, executing programmable coherent quantum dynamics on up to 256 neutral-atom qubits. This whitepaper serves as an overview of Aquila and its capabilities: how it works under the hood, key performance benchmarks, and examples that demonstrate some quintessential applications. This includes an overview of neutral-atom quantum computing, as well as five examples of increasing complexity from single-qubit dynamics to combinatorial optimization, implemented on Aquila. This whitepaper is intended for readers who are interested in learning more about neutral-atom quantum computing, as a guide for those who are ready to start using Aquila, and as a reference point for its performance as an analog quantum computer.
△ Less
Submitted 20 June, 2023;
originally announced June 2023.
-
Fast and Order-invariant Inference in Bayesian VARs with Non-Parametric Shocks
Authors:
Florian Huber,
Gary Koop
Abstract:
The shocks which hit macroeconomic models such as Vector Autoregressions (VARs) have the potential to be non-Gaussian, exhibiting asymmetries and fat tails. This consideration motivates the VAR developed in this paper which uses a Dirichlet process mixture (DPM) to model the shocks. However, we do not follow the obvious strategy of simply modeling the VAR errors with a DPM since this would lead to…
▽ More
The shocks which hit macroeconomic models such as Vector Autoregressions (VARs) have the potential to be non-Gaussian, exhibiting asymmetries and fat tails. This consideration motivates the VAR developed in this paper which uses a Dirichlet process mixture (DPM) to model the shocks. However, we do not follow the obvious strategy of simply modeling the VAR errors with a DPM since this would lead to computationally infeasible Bayesian inference in larger VARs and potentially a sensitivity to the way the variables are ordered in the VAR. Instead we develop a particular additive error structure inspired by Bayesian nonparametric treatments of random effects in panel data models. We show that this leads to a model which allows for computationally fast and order-invariant inference in large VARs with nonparametric shocks. Our empirical results with nonparametric VARs of various dimensions shows that nonparametric treatment of the VAR errors is particularly useful in periods such as the financial crisis and the pandemic.
△ Less
Submitted 26 May, 2023;
originally announced May 2023.
-
Towards Optimal Energy Management Strategy for Hybrid Electric Vehicle with Reinforcement Learning
Authors:
Xinyang Wu,
Elisabeth Wedernikow,
Christof Nitsche,
Marco F. Huber
Abstract:
In recent years, the development of Artificial Intelligence (AI) has shown tremendous potential in diverse areas. Among them, reinforcement learning (RL) has proven to be an effective solution for learning intelligent control strategies. As an inevitable trend for mitigating climate change, hybrid electric vehicles (HEVs) rely on efficient energy management strategies (EMS) to minimize energy cons…
▽ More
In recent years, the development of Artificial Intelligence (AI) has shown tremendous potential in diverse areas. Among them, reinforcement learning (RL) has proven to be an effective solution for learning intelligent control strategies. As an inevitable trend for mitigating climate change, hybrid electric vehicles (HEVs) rely on efficient energy management strategies (EMS) to minimize energy consumption. Many researchers have employed RL to learn optimal EMS for specific vehicle models. However, most of these models tend to be complex and proprietary, making them unsuitable for broad applicability. This paper presents a novel framework, in which we implement and integrate RL-based EMS with the open-source vehicle simulation tool called FASTSim. The learned RL-based EMSs are evaluated on various vehicle models using different test drive cycles and prove to be effective in improving energy efficiency.
△ Less
Submitted 21 May, 2023;
originally announced May 2023.
-
Coarsened Bayesian VARs -- Correcting BVARs for Incorrect Specification
Authors:
Florian Huber,
Massimiliano Marcellino
Abstract:
Model mis-specification in multivariate econometric models can strongly influence quantities of interest such as structural parameters, forecast distributions or responses to structural shocks, even more so if higher-order forecasts or responses are considered, due to parameter convolution. We propose a simple method for addressing these specification issues in the context of Bayesian VARs. Our me…
▽ More
Model mis-specification in multivariate econometric models can strongly influence quantities of interest such as structural parameters, forecast distributions or responses to structural shocks, even more so if higher-order forecasts or responses are considered, due to parameter convolution. We propose a simple method for addressing these specification issues in the context of Bayesian VARs. Our method, called coarsened Bayesian VARs (cBVARs), replaces the exact likelihood with a coarsened likelihood that takes into account that the model might be mis-specified along important but unknown dimensions. Coupled with a conjugate prior, this results in a computationally simple model. As opposed to more flexible specifications, our approach avoids overfitting, is simple to implement and estimation is fast. The resulting cBVAR performs well in simulations for several types of mis-specification. Applied to US data, cBVARs improve point and density forecasts compared to standard BVARs, and lead to milder but more persistent negative effects of uncertainty shocks on output.
△ Less
Submitted 26 May, 2023; v1 submitted 16 April, 2023;
originally announced April 2023.
-
Grouping Shapley Value Feature Importances of Random Forests for explainable Yield Prediction
Authors:
Florian Huber,
Hannes Engler,
Anna Kicherer,
Katja Herzog,
Reinhard Töpfer,
Volker Steinhage
Abstract:
Explainability in yield prediction helps us fully explore the potential of machine learning models that are already able to achieve high accuracy for a variety of yield prediction scenarios. The data included for the prediction of yields are intricate and the models are often difficult to understand. However, understanding the models can be simplified by using natural groupings of the input featur…
▽ More
Explainability in yield prediction helps us fully explore the potential of machine learning models that are already able to achieve high accuracy for a variety of yield prediction scenarios. The data included for the prediction of yields are intricate and the models are often difficult to understand. However, understanding the models can be simplified by using natural groupings of the input features. Grouping can be achieved, for example, by the time the features are captured or by the sensor used to do so. The state-of-the-art for interpreting machine learning models is currently defined by the game-theoretic approach of Shapley values. To handle groups of features, the calculated Shapley values are typically added together, ignoring the theoretical limitations of this approach. We explain the concept of Shapley values directly computed for predefined groups of features and introduce an algorithm to compute them efficiently on tree structures. We provide a blueprint for designing swarm plots that combine many local explanations for global understanding. Extensive evaluation of two different yield prediction problems shows the worth of our approach and demonstrates how we can enable a better understanding of yield prediction models in the future, ultimately leading to mutual enrichment of research and application.
△ Less
Submitted 14 April, 2023;
originally announced April 2023.
-
Unbinned Deep Learning Jet Substructure Measurement in High $Q^2$ ep collisions at HERA
Authors:
The H1 collaboration,
V. Andreev,
M. Arratia,
A. Baghdasaryan,
A. Baty,
K. Begzsuren,
A. Bolz,
V. Boudry,
G. Brandt,
D. Britzger,
A. Buniatyan,
L. Bystritskaya,
A. J. Campbell,
K. B. Cantun Avila,
K. Cerny,
V. Chekelian,
Z. Chen,
J. G. Contreras,
J. Cvach,
J. B. Dainton,
K. Daum,
A. Deshpande,
C. Diaconu,
A. Drees,
G. Eckerlin
, et al. (120 additional authors not shown)
Abstract:
The radiation pattern within high energy quark- and gluon-initiated jets (jet substructure) is used extensively as a precision probe of the strong force as well as an environment for optimizing event generators with numerous applications in high energy particle and nuclear physics. Looking at electron-proton collisions is of particular interest as many of the complications present at hadron collid…
▽ More
The radiation pattern within high energy quark- and gluon-initiated jets (jet substructure) is used extensively as a precision probe of the strong force as well as an environment for optimizing event generators with numerous applications in high energy particle and nuclear physics. Looking at electron-proton collisions is of particular interest as many of the complications present at hadron colliders are absent. A detailed study of modern jet substructure observables, jet angularities, in electron-proton collisions is presented using data recorded using the H1 detector at HERA. The measurement is unbinned and multi-dimensional, using machine learning to correct for detector effects. All of the available reconstructed object information of the respective jets is interpreted by a graph neural network, achieving superior precision on a selected set of jet angularities. Training these networks was enabled by the use of a large number of GPUs in the Perlmutter supercomputer at Berkeley Lab. The particle jets are reconstructed in the laboratory frame, using the $k_{\mathrm{T}}$ jet clustering algorithm. Results are reported at high transverse momentum transfer $Q^2>150$ GeV${}^2$, and inelasticity $0.2 < y < 0.7$. The analysis is also performed in sub-regions of $Q^2$, thus probing scale dependencies of the substructure variables. The data are compared with a variety of predictions and point towards possible improvements of such models.
△ Less
Submitted 14 September, 2023; v1 submitted 23 March, 2023;
originally announced March 2023.
-
Optimizing CAD Models with Latent Space Manipulation
Authors:
Jannes Elstner,
Raoul G. C. Schönhof,
Steffen Tauber,
Marco F Huber
Abstract:
When it comes to the optimization of CAD models in the automation domain, neural networks currently play only a minor role. Optimizing abstract features such as automation capability is challenging, since they can be very difficult to simulate, are too complex for rule-based systems, and also have little to no data available for machine-learning methods. On the other hand, image manipulation metho…
▽ More
When it comes to the optimization of CAD models in the automation domain, neural networks currently play only a minor role. Optimizing abstract features such as automation capability is challenging, since they can be very difficult to simulate, are too complex for rule-based systems, and also have little to no data available for machine-learning methods. On the other hand, image manipulation methods that can manipulate abstract features in images such as StyleCLIP have seen much success. They rely on the latent space of pretrained generative adversarial networks, and could therefore also make use of the vast amount of unlabeled CAD data. In this paper, we show that such an approach is also suitable for optimizing abstract automation-related features of CAD parts. We achieved this by extending StyleCLIP to work with CAD models in the form of voxel models, which includes using a 3D StyleGAN and a custom classifier. Finally, we demonstrate the ability of our system for the optimiziation of automation-related features by optimizing the grabability of various CAD models. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/) Peer review under the responsibility of the scientific committee of the 33rd CIRP Design Conference.
△ Less
Submitted 9 March, 2023;
originally announced March 2023.
-
Entanglement detection with trace polynomials
Authors:
Albert Rico,
Felix Huber
Abstract:
We provide a systematic method for nonlinear entanglement detection based on trace polynomial inequalities. In particular, this allows to employ multi-partite witnesses for the detection of bipartite states, and vice versa. We identify witnesses for which linear detection of an entangled state fails, but for which nonlinear detection succeeds. With the trace polynomial formulation a great variety…
▽ More
We provide a systematic method for nonlinear entanglement detection based on trace polynomial inequalities. In particular, this allows to employ multi-partite witnesses for the detection of bipartite states, and vice versa. We identify witnesses for which linear detection of an entangled state fails, but for which nonlinear detection succeeds. With the trace polynomial formulation a great variety of witnesses arise from immamant inequalities, which can be implemented in the laboratory through randomized measurements.
△ Less
Submitted 15 February, 2024; v1 submitted 14 March, 2023;
originally announced March 2023.
-
Bell inequalities with overlapping measurements
Authors:
Moisés Bermejo Morán,
Alejandro Pozas-Kerstjens,
Felix Huber
Abstract:
Which nonlocal correlations can be obtained, when a party has access to more than one subsystem? While traditionally nonlocality deals with spacelike separated parties, this question becomes important with quantum technologies that connect devices by means of small shared systems. Here we study Bell inequalities where measurements of different parties can have overlap. This allows to accommodate p…
▽ More
Which nonlocal correlations can be obtained, when a party has access to more than one subsystem? While traditionally nonlocality deals with spacelike separated parties, this question becomes important with quantum technologies that connect devices by means of small shared systems. Here we study Bell inequalities where measurements of different parties can have overlap. This allows to accommodate problems in quantum information such as the existence of quantum error correction codes in the framework of non-locality. The scenarios considered show an interesting behaviour with respect to Hilbert space dimension, overlap, and symmetry.
△ Less
Submitted 31 August, 2023; v1 submitted 3 March, 2023;
originally announced March 2023.
-
A tale of two tails: 130 years of growth-at-risk
Authors:
Martin Gächter,
Elias Hasler,
Florian Huber
Abstract:
We extend the existing growth-at-risk (GaR) literature by examining a long time period of 130 years in a time-varying parameter regression model. We identify several important insights for policymakers. First, both the level as well as the determinants of GaR vary significantly over time. Second, the stability of upside risks to GDP growth reported in earlier research is specific to the period kno…
▽ More
We extend the existing growth-at-risk (GaR) literature by examining a long time period of 130 years in a time-varying parameter regression model. We identify several important insights for policymakers. First, both the level as well as the determinants of GaR vary significantly over time. Second, the stability of upside risks to GDP growth reported in earlier research is specific to the period known as the Great Moderation, with the distribution of risks being more balanced before the 1970s. Third, the distribution of GDP growth has significantly narrowed since the end of the Bretton Woods system. Fourth, financial stress is always linked to higher downside risks, but it does not affect upside risks. Finally, other risk indicators, such as credit growth and house prices, not only drive downside risks, but also contribute to increased upside risks during boom periods. In this context, the paper also adds to the financial cycle literature by completing the picture of drivers (and risks) for both booms and recessions over time.
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
Defect Transfer GAN: Diverse Defect Synthesis for Data Augmentation
Authors:
Ruyu Wang,
Sabrina Hoppe,
Eduardo Monari,
Marco F. Huber
Abstract:
Data-hunger and data-imbalance are two major pitfalls in many deep learning approaches. For example, on highly optimized production lines, defective samples are hardly acquired while non-defective samples come almost for free. The defects however often seem to resemble each other, e.g., scratches on different products may only differ in a few characteristics. In this work, we introduce a framework…
▽ More
Data-hunger and data-imbalance are two major pitfalls in many deep learning approaches. For example, on highly optimized production lines, defective samples are hardly acquired while non-defective samples come almost for free. The defects however often seem to resemble each other, e.g., scratches on different products may only differ in a few characteristics. In this work, we introduce a framework, Defect Transfer GAN (DT-GAN), which learns to represent defect types independent of and across various background products and yet can apply defect-specific styles to generate realistic defective images. An empirical study on the MVTec AD and two additional datasets showcase DT-GAN outperforms state-of-the-art image synthesis methods w.r.t. sample fidelity and diversity in defect generation. We further demonstrate benefits for a critical downstream task in manufacturing -- defect classification. Results show that the augmented data from DT-GAN provides consistent gains even in the few samples regime and reduces the error rate up to 51% compared to both traditional and advanced data augmentation methods.
△ Less
Submitted 16 February, 2023;
originally announced February 2023.
-
Nonlinearities in Macroeconomic Tail Risk through the Lens of Big Data Quantile Regressions
Authors:
Jan Prüser,
Florian Huber
Abstract:
Modeling and predicting extreme movements in GDP is notoriously difficult and the selection of appropriate covariates and/or possible forms of nonlinearities are key in obtaining precise forecasts. In this paper, our focus is on using large datasets in quantile regression models to forecast the conditional distribution of US GDP growth. To capture possible non-linearities, we include several nonli…
▽ More
Modeling and predicting extreme movements in GDP is notoriously difficult and the selection of appropriate covariates and/or possible forms of nonlinearities are key in obtaining precise forecasts. In this paper, our focus is on using large datasets in quantile regression models to forecast the conditional distribution of US GDP growth. To capture possible non-linearities, we include several nonlinear specifications. The resulting models will be huge dimensional and we thus rely on a set of shrinkage priors. Since Markov Chain Monte Carlo estimation becomes slow in these dimensions, we rely on fast variational Bayes approximations to the posterior distribution of the coefficients and the latent states. We find that our proposed set of models produces precise forecasts. These gains are especially pronounced in the tails. Using Gaussian processes to approximate the nonlinear component of the model further improves the good performance, in particular in the right tail.
△ Less
Submitted 22 September, 2023; v1 submitted 31 January, 2023;
originally announced January 2023.
-
Bayesian Forecasting in Economics and Finance: A Modern Review
Authors:
Gael M. Martin,
David T. Frazier,
Worapree Maneesoonthorn,
Ruben Loaiza-Maya,
Florian Huber,
Gary Koop,
John Maheu,
Didier Nibbering,
Anastasios Panagiotelis
Abstract:
The Bayesian statistical paradigm provides a principled and coherent approach to probabilistic forecasting. Uncertainty about all unknowns that characterize any forecasting problem -- model, parameters, latent states -- is able to be quantified explicitly, and factored into the forecast distribution via the process of integration or averaging. Allied with the elegance of the method, Bayesian forec…
▽ More
The Bayesian statistical paradigm provides a principled and coherent approach to probabilistic forecasting. Uncertainty about all unknowns that characterize any forecasting problem -- model, parameters, latent states -- is able to be quantified explicitly, and factored into the forecast distribution via the process of integration or averaging. Allied with the elegance of the method, Bayesian forecasting is now underpinned by the burgeoning field of Bayesian computation, which enables Bayesian forecasts to be produced for virtually any problem, no matter how large, or complex. The current state of play in Bayesian forecasting in economics and finance is the subject of this review. The aim is to provide the reader with an overview of modern approaches to the field, set in some historical context; and with sufficient computational detail given to assist the reader with implementation.
△ Less
Submitted 28 July, 2023; v1 submitted 7 December, 2022;
originally announced December 2022.
-
Giant Planet Observations in NASA's Planetary Data System
Authors:
Nancy J. Chanover,
James M. Bauer,
John J. Blalock,
Mitchell K. Gordon,
Lyle F. Huber,
Mia J. T. Mace,
Lynn D. V. Neakrase,
Matthew S. Tiscareno,
Raymond J. Walker
Abstract:
While there have been far fewer missions to the outer Solar System than to the inner Solar System, spacecraft destined for the giant planets have conducted a wide range of fundamental investigations, returning data that continues to reshape our understanding of these complex systems, sometimes decades after the data were acquired. These data are preserved and accessible from national and internati…
▽ More
While there have been far fewer missions to the outer Solar System than to the inner Solar System, spacecraft destined for the giant planets have conducted a wide range of fundamental investigations, returning data that continues to reshape our understanding of these complex systems, sometimes decades after the data were acquired. These data are preserved and accessible from national and international planetary science archives. For all NASA planetary missions and instruments the data are available from the science discipline nodes of the NASA Planetary Data System (PDS). Looking ahead, the PDS will be the primary repository for giant planets data from several upcoming missions and derived datasets, as well as supporting research conducted to aid in the interpretation of the remotely sensed giant planets data already archived in the PDS.
△ Less
Submitted 5 December, 2022;
originally announced December 2022.
-
Mixture of Decision Trees for Interpretable Machine Learning
Authors:
Simeon Brüggenjürgen,
Nina Schaaf,
Pascal Kerschke,
Marco F. Huber
Abstract:
This work introduces a novel interpretable machine learning method called Mixture of Decision Trees (MoDT). It constitutes a special case of the Mixture of Experts ensemble architecture, which utilizes a linear model as gating function and decision trees as experts. Our proposed method is ideally suited for problems that cannot be satisfactorily learned by a single decision tree, but which can alt…
▽ More
This work introduces a novel interpretable machine learning method called Mixture of Decision Trees (MoDT). It constitutes a special case of the Mixture of Experts ensemble architecture, which utilizes a linear model as gating function and decision trees as experts. Our proposed method is ideally suited for problems that cannot be satisfactorily learned by a single decision tree, but which can alternatively be divided into subproblems. Each subproblem can then be learned well from a single decision tree. Therefore, MoDT can be considered as a method that improves performance while maintaining interpretability by making each of its decisions understandable and traceable to humans.
Our work is accompanied by a Python implementation, which uses an interpretable gating function, a fast learning algorithm, and a direct interface to fine-tuned interpretable visualization methods. The experiments confirm that the implementation works and, more importantly, show the superiority of our approach compared to single decision trees and random forests of similar complexity.
△ Less
Submitted 26 November, 2022;
originally announced November 2022.
-
Photons are lying about where they have been, again
Authors:
Gregory Reznik,
Carlotta Versmold,
Jan Dziewior,
Florian Huber,
Shrobona Bagchi,
Harald Weinfurter,
Justin Dressel,
Lev Vaidman
Abstract:
Bhati and Arvind [Phys. Lett. A, 127955 (2022)] recently argued that in a specially designed experiment the timing of photon detection events demonstrates photon presence at a location at which they are not present according to the weak value approach. The alleged contradiction is resolved by a subtle interference effect resulting in anomalous sensitivity of the signal imprinted on the postselecte…
▽ More
Bhati and Arvind [Phys. Lett. A, 127955 (2022)] recently argued that in a specially designed experiment the timing of photon detection events demonstrates photon presence at a location at which they are not present according to the weak value approach. The alleged contradiction is resolved by a subtle interference effect resulting in anomalous sensitivity of the signal imprinted on the postselected photons for the interaction at this location, similarly to the case of a nested Mach-Zehnder interferometer with a Dove prism [Quant. Stud.: Mat. Found. 2, 255 (2015)]. We perform an in depth analysis of the characterization of the presence of a pre- and postselected particle at a particular location based on information imprinted on the particle itself. The theoretical results are tested by a computer simulation of the proposed experiment.
△ Less
Submitted 31 March, 2023; v1 submitted 22 November, 2022;
originally announced November 2022.
-
Refuting spectral compatibility of quantum marginals
Authors:
Felix Huber,
Nikolai Wyderka
Abstract:
The spectral variant of the quantum marginal problem asks: Given prescribed spectra for a set of quantum marginals, does there exist a compatible joint state? The main idea of this work is a symmetry-reduced semidefinite programming hierarchy for detecting incompatible spectra. The hierarchy is complete, in the sense that it detects every incompatible set of spectra. The refutations it provides ar…
▽ More
The spectral variant of the quantum marginal problem asks: Given prescribed spectra for a set of quantum marginals, does there exist a compatible joint state? The main idea of this work is a symmetry-reduced semidefinite programming hierarchy for detecting incompatible spectra. The hierarchy is complete, in the sense that it detects every incompatible set of spectra. The refutations it provides are dimension-free, certifying incompatibility in all local dimensions. The hierarchy equally applies to the sums of Hermitian matrices problem, to optimize trace polynomials on the positive cone, to the compatibility of invariants, and to certify vanishing Kronecker coefficients.
△ Less
Submitted 15 March, 2023; v1 submitted 11 November, 2022;
originally announced November 2022.
-
Synthesis and physical properties of uranium thin-film hydrides UH2 and \b{eta}-UH3
Authors:
Evgenia A. Tereshina-Chitrova,
Ladislav Havela,
Mykhaylo Paukov,
Oleksandra Koloskova,
Lukas Horak,
Milan Dopita,
Mayerling Martinez Celis,
Miroslav Cieslar,
Zbynek Soban,
Thomas Gouder,
Frank Huber
Abstract:
Formation of thin uranium hydrides films, UH2 and \b{eta}-UH3, synthesized by a reactive dc sputtering of uranium metal, was explored using variable deposition conditions. Obtained stable oxygen-free hydride films were studied by a variety of methods, both in situ (photoelectron spectroscopy - XPS), and ex-situ (x-ray diffraction - XRD, transmission electron microscopy - TEM), electrical resistivi…
▽ More
Formation of thin uranium hydrides films, UH2 and \b{eta}-UH3, synthesized by a reactive dc sputtering of uranium metal, was explored using variable deposition conditions. Obtained stable oxygen-free hydride films were studied by a variety of methods, both in situ (photoelectron spectroscopy - XPS), and ex-situ (x-ray diffraction - XRD, transmission electron microscopy - TEM), electrical resistivity, and magnetometry). Both types of hydrides are ferromagnetic, the Curie temperatures of UH2 and \b{eta}-UH3 are approx. 120 and 170 K, respectively. Ferromagnetism in the thin films is robust and does not depend on structure details while electrical resistivity data reflect disorder in both types of hydrides.
△ Less
Submitted 11 November, 2022;
originally announced November 2022.
-
Bayesian Neural Networks for Macroeconomic Analysis
Authors:
Niko Hauzenberger,
Florian Huber,
Karin Klieber,
Massimiliano Marcellino
Abstract:
Macroeconomic data is characterized by a limited number of observations (small T), many time series (big K) but also by featuring temporal dependence. Neural networks, by contrast, are designed for datasets with millions of observations and covariates. In this paper, we develop Bayesian neural networks (BNNs) that are well-suited for handling datasets commonly used for macroeconomic analysis in po…
▽ More
Macroeconomic data is characterized by a limited number of observations (small T), many time series (big K) but also by featuring temporal dependence. Neural networks, by contrast, are designed for datasets with millions of observations and covariates. In this paper, we develop Bayesian neural networks (BNNs) that are well-suited for handling datasets commonly used for macroeconomic analysis in policy institutions. Our approach avoids extensive specification searches through a novel mixture specification for the activation function that appropriately selects the form of nonlinearities. Shrinkage priors are used to prune the network and force irrelevant neurons to zero. To cope with heteroskedasticity, the BNN is augmented with a stochastic volatility model for the error term. We illustrate how the model can be used in a policy institution by first showing that our different BNNs produce precise density forecasts, typically better than those from other machine learning methods. Finally, we showcase how our model can be used to recover nonlinearities in the reaction of macroeconomic aggregates to financial shocks.
△ Less
Submitted 2 April, 2024; v1 submitted 9 November, 2022;
originally announced November 2022.
-
Kalman-Bucy-Informed Neural Network for System Identification
Authors:
Tobias Nagel,
Marco F. Huber
Abstract:
Identifying parameters in a system of nonlinear, ordinary differential equations is vital for designing a robust controller. However, if the system is stochastic in its nature or if only noisy measurements are available, standard optimization algorithms for system identification usually fail. We present a new approach that combines the recent advances in physics-informed neural networks and the we…
▽ More
Identifying parameters in a system of nonlinear, ordinary differential equations is vital for designing a robust controller. However, if the system is stochastic in its nature or if only noisy measurements are available, standard optimization algorithms for system identification usually fail. We present a new approach that combines the recent advances in physics-informed neural networks and the well-known achievements of Kalman filters in order to find parameters in a continuous-time system with noisy measurements. In doing so, our approach allows estimating the parameters together with the mean value and covariance matrix of the system's state vector. We show that the method works for complex systems by identifying the parameters of a double pendulum.
△ Less
Submitted 7 October, 2022;
originally announced October 2022.
-
Bayesian Modeling of TVP-VARs Using Regression Trees
Authors:
Niko Hauzenberger,
Florian Huber,
Gary Koop,
James Mitchell
Abstract:
In light of widespread evidence of parameter instability in macroeconomic models, many time-varying parameter (TVP) models have been proposed. This paper proposes a nonparametric TVP-VAR model using Bayesian additive regression trees (BART) that models the TVPs as an unknown function of effect modifiers. The novelty of this model arises from the fact that the law of motion driving the parameters i…
▽ More
In light of widespread evidence of parameter instability in macroeconomic models, many time-varying parameter (TVP) models have been proposed. This paper proposes a nonparametric TVP-VAR model using Bayesian additive regression trees (BART) that models the TVPs as an unknown function of effect modifiers. The novelty of this model arises from the fact that the law of motion driving the parameters is treated nonparametrically. This leads to great flexibility in the nature and extent of parameter change, both in the conditional mean and in the conditional variance. Parsimony is achieved through adopting nonparametric factor structures and use of shrinkage priors. In an application to US macroeconomic data, we illustrate the use of our model in tracking both the evolving nature of the Phillips curve and how the effects of business cycle shocks on inflation measures vary nonlinearly with changes in the effect modifiers.
△ Less
Submitted 5 May, 2023; v1 submitted 24 September, 2022;
originally announced September 2022.