Search | arXiv e-print repository

SDP bounds on quantum codes

Authors: Gerard Anglès Munné, Andrew Nemec, Felix Huber

Abstract: This paper provides a semidefinite programming hierarchy based on state polynomial optimization to determine the existence of quantum codes with given parameters. The hierarchy is complete, in the sense that if a $(\!(n,K,δ)\!)_2$ code does not exist then a level of the hierarchy is infeasible. It is not limited to stabilizer codes and thus applicable generally. While it is formally dimension-free… ▽ More This paper provides a semidefinite programming hierarchy based on state polynomial optimization to determine the existence of quantum codes with given parameters. The hierarchy is complete, in the sense that if a $(\!(n,K,δ)\!)_2$ code does not exist then a level of the hierarchy is infeasible. It is not limited to stabilizer codes and thus applicable generally. While it is formally dimension-free, we restrict it to qubit codes through quasi-Clifford algebras. We derive the quantum analog of a range of classical results: first, from an intermediate level a Lovász bound for self-dual quantum codes is recovered. Second, a symmetrization of a minor variation of this Lovász bound recovers the quantum Delsarte bound. Third, a symmetry reduction using the Terwilliger algebra leads to semidefinite programming bounds of size $O(n^4)$. With this we give an alternative proof that there is no $(\!(7,1,4)\!)_2$ quantum code, and show that $(\!(8,9,3)\!)_2$ and $(\!(10,5,4)\!)_2$ codes do not exist. △ Less

Submitted 19 August, 2024; originally announced August 2024.

Comments: 51 pages

arXiv:2408.02379 [pdf, ps, other]

The Contribution of XAI for the Safe Development and Certification of AI: An Expert-Based Analysis

Authors: Benjamin Fresz, Vincent Philipp Göbels, Safa Omri, Danilo Brajovic, Andreas Aichele, Janika Kutz, Jens Neuhüttler, Marco F. Huber

Abstract: Developing and certifying safe - or so-called trustworthy - AI has become an increasingly salient issue, especially in light of upcoming regulation such as the EU AI Act. In this context, the black-box nature of machine learning models limits the use of conventional avenues of approach towards certifying complex technical systems. As a potential solution, methods to give insights into this black-b… ▽ More Developing and certifying safe - or so-called trustworthy - AI has become an increasingly salient issue, especially in light of upcoming regulation such as the EU AI Act. In this context, the black-box nature of machine learning models limits the use of conventional avenues of approach towards certifying complex technical systems. As a potential solution, methods to give insights into this black-box - devised in the field of eXplainable AI (XAI) - could be used. In this study, the potential and shortcomings of such methods for the purpose of safe AI development and certification are discussed in 15 qualitative interviews with experts out of the areas of (X)AI and certification. We find that XAI methods can be a helpful asset for safe AI development, as they can show biases and failures of ML-models, but since certification relies on comprehensive and correct information about technical systems, their impact is expected to be limited. △ Less

Submitted 22 July, 2024; originally announced August 2024.

arXiv:2407.16349 [pdf, other]

Bayesian modelling of VAR precision matrices using stochastic block networks

Authors: Florian Huber, Gary Koop, Massimiliano Marcellino, Tobias Scheckel

Abstract: Commonly used priors for Vector Autoregressions (VARs) induce shrinkage on the autoregressive coefficients. Introducing shrinkage on the error covariance matrix is sometimes done but, in the vast majority of cases, without considering the network structure of the shocks and by placing the prior on the lower Cholesky factor of the precision matrix. In this paper, we propose a prior on the VAR error… ▽ More Commonly used priors for Vector Autoregressions (VARs) induce shrinkage on the autoregressive coefficients. Introducing shrinkage on the error covariance matrix is sometimes done but, in the vast majority of cases, without considering the network structure of the shocks and by placing the prior on the lower Cholesky factor of the precision matrix. In this paper, we propose a prior on the VAR error precision matrix directly. Our prior, which resembles a standard spike and slab prior, models variable inclusion probabilities through a stochastic block model that clusters shocks into groups. Within groups, the probability of having relations across group members is higher (inducing less sparsity) whereas relations across groups imply a lower probability that members of each group are conditionally related. We show in simulations that our approach recovers the true network structure well. Using a US macroeconomic data set, we illustrate how our approach can be used to cluster shocks together and that this feature leads to improved density forecasts. △ Less

Submitted 23 July, 2024; originally announced July 2024.

arXiv:2407.09537 [pdf, other]

ViPro: Enabling and Controlling Video Prediction for Complex Dynamical Scenarios using Procedural Knowledge

Authors: Patrick Takenaka, Johannes Maucher, Marco F. Huber

Abstract: We propose a novel architecture design for video prediction in order to utilize procedural domain knowledge directly as part of the computational graph of data-driven models. On the basis of new challenging scenarios we show that state-of-the-art video predictors struggle in complex dynamical settings, and highlight that the introduction of prior process knowledge makes their learning problem feas… ▽ More We propose a novel architecture design for video prediction in order to utilize procedural domain knowledge directly as part of the computational graph of data-driven models. On the basis of new challenging scenarios we show that state-of-the-art video predictors struggle in complex dynamical settings, and highlight that the introduction of prior process knowledge makes their learning problem feasible. Our approach results in the learning of a symbolically addressable interface between data-driven aspects in the model and our dedicated procedural knowledge module, which we utilize in downstream control tasks. △ Less

Submitted 26 June, 2024; originally announced July 2024.

Comments: accepted at NeSy2024, to be published in LNCS/LNAI

arXiv:2407.02553 [pdf, other]

Large-scale quantum reservoir learning with an analog quantum computer

Authors: Milan Kornjača, Hong-Ye Hu, Chen Zhao, Jonathan Wurtz, Phillip Weinberg, Majd Hamdan, Andrii Zhdanov, Sergio H. Cantu, Hengyun Zhou, Rodrigo Araiza Bravo, Kevin Bagnall, James I. Basham, Joseph Campo, Adam Choukri, Robert DeAngelo, Paige Frederick, David Haines, Julian Hammett, Ning Hsu, Ming-Guang Hu, Florian Huber, Paul Niklas Jepsen, Ningyuan Jia, Thomas Karolyshyn, Minho Kwon , et al. (28 additional authors not shown)

Abstract: Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lac… ▽ More Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lack potential for quantum advantage. To address this, we develop a general-purpose, gradient-free, and scalable quantum reservoir learning algorithm that harnesses the quantum dynamics of neutral-atom analog quantum computers to process data. We experimentally implement the algorithm, achieving competitive performance across various categories of machine learning tasks, including binary and multi-class classification, as well as timeseries prediction. Effective and improving learning is observed with increasing system sizes of up to 108 qubits, demonstrating the largest quantum machine learning experiment to date. We further observe comparative quantum kernel advantage in learning tasks by constructing synthetic datasets based on the geometric differences between generated quantum and classical data kernels. Our findings demonstrate the potential of utilizing classically intractable quantum correlations for effective machine learning. We expect these results to stimulate further extensions to different quantum hardware and machine learning paradigms, including early fault-tolerant hardware and generative machine learning tasks. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: 10 + 14 pages, 4 + 7 figures

arXiv:2406.19817 [pdf, ps, other]

Identifying Ordinary Differential Equations for Data-efficient Model-based Reinforcement Learning

Authors: Tobias Nagel, Marco F. Huber

Abstract: The identification of a mathematical dynamics model is a crucial step in the designing process of a controller. However, it is often very difficult to identify the system's governing equations, especially in complex environments that combine physical laws of different disciplines. In this paper, we present a new approach that allows identifying an ordinary differential equation by means of a physi… ▽ More The identification of a mathematical dynamics model is a crucial step in the designing process of a controller. However, it is often very difficult to identify the system's governing equations, especially in complex environments that combine physical laws of different disciplines. In this paper, we present a new approach that allows identifying an ordinary differential equation by means of a physics-informed machine learning algorithm. Our method introduces a special neural network that allows exploiting prior human knowledge to a certain degree and extends it autonomously, so that the resulting differential equations describe the system as accurately as possible. We validate the method on a Duffing oscillator with simulation data and, additionally, on a cascaded tank example with real-world data. Subsequently, we use the developed algorithm in a model-based reinforcement learning framework by alternately identifying and controlling a system to a target state. We test the performance by swinging-up an inverted pendulum on a cart. △ Less

Submitted 28 June, 2024; originally announced June 2024.

Comments: 10 pages, 6 figures, accepted at the IEEE World Congress on Computational Intelligence 2024

arXiv:2406.18220 [pdf, other]

doi 10.1109/ICCVW60793.2023.00116

Guiding Video Prediction with Explicit Procedural Knowledge

Authors: Patrick Takenaka, Johannes Maucher, Marco F. Huber

Abstract: We propose a general way to integrate procedural knowledge of a domain into deep learning models. We apply it to the case of video prediction, building on top of object-centric deep models and show that this leads to a better performance than using data-driven models alone. We develop an architecture that facilitates latent space disentanglement in order to use the integrated procedural knowledge,… ▽ More We propose a general way to integrate procedural knowledge of a domain into deep learning models. We apply it to the case of video prediction, building on top of object-centric deep models and show that this leads to a better performance than using data-driven models alone. We develop an architecture that facilitates latent space disentanglement in order to use the integrated procedural knowledge, and establish a setup that allows the model to learn the procedural interface in the latent space using the downstream task of video prediction. We contrast the performance to a state-of-the-art data-driven approach and show that problems where purely data-driven approaches struggle can be handled by using knowledge about the domain, providing an alternative to simply collecting more data. △ Less

Submitted 26 June, 2024; originally announced June 2024.

Comments: Published in 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)

Journal ref: 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Paris, France, 2023, pp. 1076-1084

arXiv:2406.10352 [pdf, ps, other]

Markovian Lifts of Stochastic Volterra Equations in Sobolev Spaces: Solution theory, an Ito Formula and Invariant Measures

Authors: Florian Huber

Abstract: We investigate Markovian lifts of stochastic Volterra equations (SVEs) with completely monotone kernels and general coefficients within a class of weighted Sobolev spaces. Our primary focus is developing a comprehensive solution theory for a class of non-local stochastic evolution equations (SEEs) encompassing these Markovian lifts. This enables us to provide conditions for the existence of invari… ▽ More We investigate Markovian lifts of stochastic Volterra equations (SVEs) with completely monotone kernels and general coefficients within a class of weighted Sobolev spaces. Our primary focus is developing a comprehensive solution theory for a class of non-local stochastic evolution equations (SEEs) encompassing these Markovian lifts. This enables us to provide conditions for the existence of invariant measures for the lifted processes and the corresponding SVE. Another key contribution is an Ito-type formula for the stochastic Volterra equations under consideration. △ Less

Submitted 18 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

arXiv:2406.02717 [pdf, other]

Reinforcement learning-based architecture search for quantum machine learning

Authors: Frederic Rapp, David A. Kreplin, Marco F. Huber, Marco Roth

Abstract: Quantum machine learning models use encoding circuits to map data into a quantum Hilbert space. While it is well known that the architecture of these circuits significantly influences core properties of the resulting model, they are often chosen heuristically. In this work, we present a novel approach using reinforcement learning techniques to generate problem-specific encoding circuits to improve… ▽ More Quantum machine learning models use encoding circuits to map data into a quantum Hilbert space. While it is well known that the architecture of these circuits significantly influences core properties of the resulting model, they are often chosen heuristically. In this work, we present a novel approach using reinforcement learning techniques to generate problem-specific encoding circuits to improve the performance of quantum machine learning models. By specifically using a model-based reinforcement learning algorithm, we reduce the number of necessary circuit evaluations during the search, providing a sample-efficient framework. In contrast to previous search algorithms, our method uses a layered circuit structure that significantly reduces the search space. Additionally, our approach can account for multiple objectives such as solution quality, hardware restrictions and circuit depth. We benchmark our tailored circuits against various reference models, including models with problem-agnostic circuits and classical models. Our results highlight the effectiveness of problem-specific encoding circuits in enhancing QML model performance. △ Less

Submitted 5 August, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

Comments: 14 pages, 5 figures, 1 table; Updated authorship, and improved RL section

arXiv:2405.21019 [pdf, other]

Quantum quench dynamics as a shortcut to adiabaticity

Authors: Alexander Lukin, Benjamin F. Schiffer, Boris Braverman, Sergio H. Cantu, Florian Huber, Alexei Bylinskii, Jesse Amato-Grill, Nishad Maskara, Madelyn Cain, Dominik S. Wild, Rhine Samajdar, Mikhail D. Lukin

Abstract: The ability to efficiently prepare ground states of quantum Hamiltonians via adiabatic protocols is typically limited by the smallest energy gap encountered during the quantum evolution. This presents a key obstacle for quantum simulation and realizations of adiabatic quantum algorithms in large systems, particularly when the adiabatic gap vanishes exponentially with system size. Using QuEra's Aqu… ▽ More The ability to efficiently prepare ground states of quantum Hamiltonians via adiabatic protocols is typically limited by the smallest energy gap encountered during the quantum evolution. This presents a key obstacle for quantum simulation and realizations of adiabatic quantum algorithms in large systems, particularly when the adiabatic gap vanishes exponentially with system size. Using QuEra's Aquila programmable quantum simulator based on Rydberg atom arrays, we experimentally demonstrate a method to circumvent such limitations. Specifically, we develop and test a "sweep-quench-sweep" quantum algorithm in which the incorporation of a quench step serves as a remedy to the diverging adiabatic timescale. These quenches introduce a macroscopic reconfiguration between states separated by an extensively large Hamming distance, akin to quantum many-body scars. Our experiments show that this approach significantly outperforms the adiabatic algorithm, illustrating that such quantum quench algorithms can provide a shortcut to adiabaticity for large-scale many-body quantum systems. △ Less

Submitted 31 May, 2024; originally announced May 2024.

arXiv:2404.17791 [pdf, other]

doi 10.1109/TRO.2024.3420799

HIPer: A Human-Inspired Scene Perception Model for Multifunctional Mobile Robots

Authors: Florenz Graf, Jochen Lindermayr, Birgit Graf, Werner Kraus, Marco F. Huber

Abstract: Taking over arbitrary tasks like humans do with a mobile service robot in open-world settings requires a holistic scene perception for decision-making and high-level control. This paper presents a human-inspired scene perception model to minimize the gap between human and robotic capabilities. The approach takes over fundamental neuroscience concepts, such as a triplet perception split into recogn… ▽ More Taking over arbitrary tasks like humans do with a mobile service robot in open-world settings requires a holistic scene perception for decision-making and high-level control. This paper presents a human-inspired scene perception model to minimize the gap between human and robotic capabilities. The approach takes over fundamental neuroscience concepts, such as a triplet perception split into recognition, knowledge representation, and knowledge interpretation. A recognition system splits the background and foreground to integrate exchangeable image-based object detectors and SLAM, a multi-layer knowledge base represents scene information in a hierarchical structure and offers interfaces for high-level control, and knowledge interpretation methods deploy spatio-temporal scene analysis and perceptual learning for self-adjustment. A single-setting ablation study is used to evaluate the impact of each component on the overall performance for a fetch-and-carry scenario in two simulated and one real-world environment. △ Less

Submitted 27 April, 2024; originally announced April 2024.

Report number: IEEE T-RO 24-0146

Journal ref: 2024 IEEE Transactions on Robotics (T-RO)

arXiv:2404.10744 [pdf, other]

Polynomial interacting particle systems and non-linear SPDEs for market capitalization curves

Authors: Christa Cuchiero, Florian Huber

Abstract: Motivated by the robustness of the capital distribution curves, we study the behavior of a certain polynomial equity market model as the number of companies goes to infinity. More precisely, we extend volatility-stabilized market models introduced by Fernholz et al. by allowing for a common noise term such that the models remain polynomial. As the number of companies approaches infinity, we show t… ▽ More Motivated by the robustness of the capital distribution curves, we study the behavior of a certain polynomial equity market model as the number of companies goes to infinity. More precisely, we extend volatility-stabilized market models introduced by Fernholz et al. by allowing for a common noise term such that the models remain polynomial. As the number of companies approaches infinity, we show that the limit of the empirical measure of the $N$-company system converges to the unique solution of a degenerate, non-linear SPDE. The obtained limit also has a representation as the conditional probability of the solution to a certain McKean-Vlasov SDE. Together with its conditional, this is again a polynomial process for which we can prove pathwise uniqueness as well as regularity properties for the marginal densities. We also provide conditional propagation of chaos results and numerical implementations of the particle system as well as its limiting equations. △ Less

Submitted 17 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

arXiv:2403.13694 [pdf, ps, other]

Overview of Publicly Available Degradation Data Sets for Tasks within Prognostics and Health Management

Authors: Fabian Mauthe, Christopher Braun, Julian Raible, Peter Zeiler, Marco F. Huber

Abstract: Central to the efficacy of prognostics and health management methods is the acquisition and analysis of degradation data, which encapsulates the evolving health condition of engineering systems over time. Degradation data serves as a rich source of information, offering invaluable insights into the underlying degradation processes, failure modes, and performance trends of engineering systems. This… ▽ More Central to the efficacy of prognostics and health management methods is the acquisition and analysis of degradation data, which encapsulates the evolving health condition of engineering systems over time. Degradation data serves as a rich source of information, offering invaluable insights into the underlying degradation processes, failure modes, and performance trends of engineering systems. This paper provides an overview of publicly available degradation data sets. △ Less

Submitted 20 March, 2024; originally announced March 2024.

arXiv:2403.10134 [pdf, other]

doi 10.1140/epjc/s10052-024-12987-0

Measurement of groomed event shape observables in deep-inelastic electron-proton scattering at HERA

Authors: The H1 collaboration, V. Andreev, M. Arratia, A. Baghdasaryan, A. Baty, K. Begzsuren, A. Bolz, V. Boudry, G. Brandt, D. Britzger, A. Buniatyan, L. Bystritskaya, A. J. Campbell, K. B. Cantun Avila, K. Cerny, V. Chekelian, Z. Chen, J. G. Contreras, J. Cvach, J. B. Dainton, K. Daum, A. Deshpande, C. Diaconu, A. Drees, G. Eckerlin , et al. (123 additional authors not shown)

Abstract: The H1 Collaboration at HERA reports the first measurement of groomed event shape observables in deep inelastic electron-proton scattering (DIS) at $\sqrt{s}=319$ GeV, using data recorded between the years 2003 and 2007 with an integrated luminosity of $351$ pb$^{-1}$. Event shapes provide incisive probes of perturbative and non-perturbative QCD. Grooming techniques have been used for jet measurem… ▽ More The H1 Collaboration at HERA reports the first measurement of groomed event shape observables in deep inelastic electron-proton scattering (DIS) at $\sqrt{s}=319$ GeV, using data recorded between the years 2003 and 2007 with an integrated luminosity of $351$ pb$^{-1}$. Event shapes provide incisive probes of perturbative and non-perturbative QCD. Grooming techniques have been used for jet measurements in hadronic collisions; this paper presents the first application of grooming to DIS data. The analysis is carried out in the Breit frame, utilizing the novel Centauro jet clustering algorithm that is designed for DIS event topologies. Events are required to have squared momentum-transfer $Q^2 > 150$ GeV$^2$ and inelasticity $ 0.2 < y < 0.7$. We report measurements of the production cross section of groomed event 1-jettiness and groomed invariant mass for several choices of grooming parameter. Monte Carlo model calculations and analytic calculations based on Soft Collinear Effective Theory are compared to the measurements. △ Less

Submitted 1 August, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

Comments: 32 pages, 17 tables, 7 figures, version as accepted by EPJ C

Report number: DESY-24-036

Journal ref: EPJC 84 (2024), 718

arXiv:2403.10109 [pdf, other]

Measurement of the 1-jettiness event shape observable in deep-inelastic electron-proton scattering at HERA

Authors: The H1 collaboration, V. Andreev, M. Arratia, A. Baghdasaryan, A. Baty, K. Begzsuren, A. Bolz, V. Boudry, G. Brandt, D. Britzger, A. Buniatyan, L. Bystritskaya, A. J. Campbell, K. B. Cantun Avila, K. Cerny, V. Chekelian, Z. Chen, J. G. Contreras, J. Cvach, J. B. Dainton, K. Daum, A. Deshpande, C. Diaconu, A. Drees, G. Eckerlin , et al. (124 additional authors not shown)

Abstract: The H1 Collaboration reports the first measurement of the 1-jettiness event shape observable $τ_1^b$ in neutral-current deep-inelastic electron-proton scattering (DIS). The observable $τ_1^b$ is equivalent to a thrust observable defined in the Breit frame. The data sample was collected at the HERA $ep$ collider in the years 2003-2007 with center-of-mass energy of $\sqrt{s}=319\,\text{GeV}$, corres… ▽ More The H1 Collaboration reports the first measurement of the 1-jettiness event shape observable $τ_1^b$ in neutral-current deep-inelastic electron-proton scattering (DIS). The observable $τ_1^b$ is equivalent to a thrust observable defined in the Breit frame. The data sample was collected at the HERA $ep$ collider in the years 2003-2007 with center-of-mass energy of $\sqrt{s}=319\,\text{GeV}$, corresponding to an integrated luminosity of $351.1\,\text{pb}^{-1}$. Triple differential cross sections are provided as a function of $τ_1^b$, event virtuality $Q^2$, and inelasticity $y$, in the kinematic region $Q^2>150\,\text{GeV}^{2}$. Single differential cross section are provided as a function of $τ_1^b$ in a limited kinematic range. Double differential cross sections are measured, in contrast, integrated over $τ_1^b$ and represent the inclusive neutral-current DIS cross section measured as a function of $Q^2$ and $y$. The data are compared to a variety of predictions and include classical and modern Monte Carlo event generators, predictions in fixed-order perturbative QCD where calculations up to $\mathcal{O}(α_s^3)$ are available for $τ_1^b$ or inclusive DIS, and resummed predictions at next-to-leading logarithmic accuracy matched to fixed order predictions at $\mathcal{O}(α_s^2)$. These comparisons reveal sensitivity of the 1-jettiness observable to QCD parton shower and resummation effects, as well as the modeling of hadronization and fragmentation. Within their range of validity, the fixed-order predictions provide a good description of the data. Monte Carlo event generators are predictive over the full measured range and hence their underlying models and parameters can be constrained by comparing to the presented data. △ Less

Submitted 15 March, 2024; originally announced March 2024.

Comments: 45 pages, 38 tables, 13 figures

Report number: DESY-24-035

arXiv:2403.08982 [pdf, other]

doi 10.1140/epjc/s10052-024-13003-1

Observation and differential cross section measurement of neutral current DIS events with an empty hemisphere in the Breit frame

Authors: The H1 collaboration, V. Andreev, M. Arratia, A. Baghdasaryan, A. Baty, K. Begzsuren, A. Bolz, V. Boudry, G. Brandt, D. Britzger, A. Buniatyan, L. Bystritskaya, A. J. Campbell, K. B. Cantun Avila, K. Cerny, V. Chekelian, Z. Chen, J. G. Contreras, J. Cvach, J. B. Dainton, K. Daum, A. Deshpande, C. Diaconu, A. Drees, G. Eckerlin , et al. (124 additional authors not shown)

Abstract: The Breit frame provides a natural frame to analyze lepton-proton scattering events. In this reference frame, the parton model hard interactions between a quark and an exchanged boson defines the coordinate system such that the struck quark is back-scattered along the virtual photon momentum direction. In Quantum Chromodynamics (QCD), higher order perturbative or non-perturbative effects can chang… ▽ More The Breit frame provides a natural frame to analyze lepton-proton scattering events. In this reference frame, the parton model hard interactions between a quark and an exchanged boson defines the coordinate system such that the struck quark is back-scattered along the virtual photon momentum direction. In Quantum Chromodynamics (QCD), higher order perturbative or non-perturbative effects can change this picture drastically. As Bjorken-$x$ decreases below one half, a rather peculiar event signature is predicted with increasing probability, where no radiation is present in one of the two Breit-frame hemispheres and all emissions are to be found in the other hemisphere. At higher orders in $α_s$ or in the presence of soft QCD effects, predictions of the rate of these events are far from trivial, and that motivates measurements with real data. We report on the first observation of the empty current hemisphere events in electron-proton collisions at the HERA collider using data recorded with the H1 detector at a center-of-mass energy of 319 GeV. The fraction of inclusive neutral-current DIS events with an empty hemisphere is found to be $0.0112 \pm 3.9\,\%_\text{stat} \pm 4.5\,\%_\text{syst} \pm 1.6\,\%_\text{mod}$ in the selected kinematic region of $150< Q^2<1500$ GeV$^2$ and inelasticity $0.14< y<0.7$. The data sample corresponds to an integrated luminosity of 351.1 pb$^{-1}$, sufficient to enable differential cross section measurements of these events. The results show an enhanced discriminating power at lower Bjorken-$x$ among different Monte Carlo event generator predictions. △ Less

Submitted 1 August, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

Comments: 13 pages, 5 figures, 2 Tables. This version as accepted for publication

Report number: DESY-24-034

Journal ref: EPJC 84 (2024), 720

arXiv:2402.16542 [pdf, other]

RoboGrind: Intuitive and Interactive Surface Treatment with Industrial Robots

Authors: Benjamin Alt, Florian Stöckl, Silvan Müller, Christopher Braun, Julian Raible, Saad Alhasan, Oliver Rettig, Lukas Ringle, Darko Katic, Rainer Jäkel, Michael Beetz, Marcus Strand, Marco F. Huber

Abstract: Surface treatment tasks such as grinding, sanding or polishing are a vital step of the value chain in many industries, but are notoriously challenging to automate. We present RoboGrind, an integrated system for the intuitive, interactive automation of surface treatment tasks with industrial robots. It combines a sophisticated 3D perception pipeline for surface scanning and automatic defect identif… ▽ More Surface treatment tasks such as grinding, sanding or polishing are a vital step of the value chain in many industries, but are notoriously challenging to automate. We present RoboGrind, an integrated system for the intuitive, interactive automation of surface treatment tasks with industrial robots. It combines a sophisticated 3D perception pipeline for surface scanning and automatic defect identification, an interactive voice-controlled wizard system for the AI-assisted bootstrapping and parameterization of robot programs, and an automatic planning and execution pipeline for force-controlled robotic surface treatment. RoboGrind is evaluated both under laboratory and real-world conditions in the context of refabricating fiberglass wind turbine blades. △ Less

Submitted 27 February, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

Comments: 7 pages, 6 figures, accepted to the 2024 IEEE International Conference on Robotics and Automation (ICRA 2024)

MSC Class: 68T40 ACM Class: I.2.6; I.2.2; I.2.9

arXiv:2402.05557 [pdf, ps, other]

On Convolutional Vision Transformers for Yield Prediction

Authors: Alvin Inderka, Florian Huber, Volker Steinhage

Abstract: While a variety of methods offer good yield prediction on histogrammed remote sensing data, vision Transformers are only sparsely represented in the literature. The Convolution vision Transformer (CvT) is being tested to evaluate vision Transformers that are currently achieving state-of-the-art results in many other vision tasks. CvT combines some of the advantages of convolution with the advantag… ▽ More While a variety of methods offer good yield prediction on histogrammed remote sensing data, vision Transformers are only sparsely represented in the literature. The Convolution vision Transformer (CvT) is being tested to evaluate vision Transformers that are currently achieving state-of-the-art results in many other vision tasks. CvT combines some of the advantages of convolution with the advantages of dynamic attention and global context fusion of Transformers. It performs worse than widely tested methods such as XGBoost and CNNs, but shows that Transformers have potential to improve yield prediction. △ Less

Submitted 8 February, 2024; originally announced February 2024.

arXiv:2401.10054 [pdf, other]

Nowcasting economic activity in European regions using a mixed-frequency dynamic factor model

Authors: Luca Barbaglia, Lorenzo Frattarolo, Niko Hauzenberger, Dominik Hirschbuehl, Florian Huber, Luca Onorante, Michael Pfarrhofer, Luca Tiozzo Pezzoli

Abstract: Timely information about the state of regional economies can be essential for planning, implementing and evaluating locally targeted economic policies. However, European regional accounts for output are published at an annual frequency and with a two-year delay. To obtain robust and more timely measures in a computationally efficient manner, we propose a mixed-frequency dynamic factor model that a… ▽ More Timely information about the state of regional economies can be essential for planning, implementing and evaluating locally targeted economic policies. However, European regional accounts for output are published at an annual frequency and with a two-year delay. To obtain robust and more timely measures in a computationally efficient manner, we propose a mixed-frequency dynamic factor model that accounts for national information to produce high-frequency estimates of the regional gross value added (GVA). We show that our model produces reliable nowcasts of GVA in 162 regions across 12 European countries. △ Less

Submitted 18 January, 2024; originally announced January 2024.

Comments: JEL: C22, C53, R11; keywords: factor models, mixed-frequency, nowcasting, regional data

arXiv:2401.08087 [pdf, other]

Probing quantum floating phases in Rydberg atom arrays

Authors: Jin Zhang, Sergio H. Cantú, Fangli Liu, Alexei Bylinskii, Boris Braverman, Florian Huber, Jesse Amato-Grill, Alexander Lukin, Nathan Gemelke, Alexander Keesling, Sheng-Tao Wang, Y. Meurice, S. -W. Tsai

Abstract: The floating phase, a critical incommensurate phase, has been theoretically predicted as a potential intermediate phase between crystalline ordered and disordered phases. In this study, we investigate the different quantum phases that arise in ladder arrays comprising up to 92 neutral-atom qubits and experimentally observe the emergence of the quantum floating phase. We analyze the site-resolved R… ▽ More The floating phase, a critical incommensurate phase, has been theoretically predicted as a potential intermediate phase between crystalline ordered and disordered phases. In this study, we investigate the different quantum phases that arise in ladder arrays comprising up to 92 neutral-atom qubits and experimentally observe the emergence of the quantum floating phase. We analyze the site-resolved Rydberg state densities and the distribution of state occurrences. The site-resolved measurement reveals the formation of domain walls within the commensurate ordered phase, which subsequently proliferate and give rise to the floating phase with incommensurate quasi-long-range order. By analyzing the Fourier spectra of the Rydberg density-density correlations, we observe clear signatures of the incommensurate wave order of the floating phase. Furthermore, as the experimental system sizes increase, we show that the wave vectors approach a continuum of values incommensurate with the lattice. Our work motivates future studies to further explore the nature of commensurate-incommensurate phase transitions and their non-equilibrium physics. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: 27 pages, 21 figures

arXiv:2312.08528 [pdf, other]

auto-sktime: Automated Time Series Forecasting

Authors: Marc-André Zöller, Marius Lindauer, Marco F. Huber

Abstract: In today's data-driven landscape, time series forecasting is pivotal in decision-making across various sectors. Yet, the proliferation of more diverse time series data, coupled with the expanding landscape of available forecasting methods, poses significant challenges for forecasters. To meet the growing demand for efficient forecasting, we introduce auto-sktime, a novel framework for automated ti… ▽ More In today's data-driven landscape, time series forecasting is pivotal in decision-making across various sectors. Yet, the proliferation of more diverse time series data, coupled with the expanding landscape of available forecasting methods, poses significant challenges for forecasters. To meet the growing demand for efficient forecasting, we introduce auto-sktime, a novel framework for automated time series forecasting. The proposed framework uses the power of automated machine learning (AutoML) techniques to automate the creation of the entire forecasting pipeline. The framework employs Bayesian optimization, to automatically construct pipelines from statistical, machine learning (ML) and deep neural network (DNN) models. Furthermore, we propose three essential improvements to adapt AutoML to time series data. First, pipeline templates to account for the different supported forecasting models. Second, a novel warm-starting technique to start the optimization from prior optimization runs. Third, we adapt multi-fidelity optimizations to make them applicable to a search space containing statistical, ML and DNN models. Experimental results on 64 diverse real-world time series datasets demonstrate the effectiveness and efficiency of the framework, outperforming traditional methods while requiring minimal human involvement. △ Less

Submitted 30 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

Comments: Accepted at LION18

arXiv:2312.01881 [pdf, other]

Bayesian Nonlinear Regression using Sums of Simple Functions

Authors: Florian Huber

Abstract: This paper proposes a new Bayesian machine learning model that can be applied to large datasets arising in macroeconomics. Our framework sums over many simple two-component location mixtures. The transition between components is determined by a logistic function that depends on a single threshold variable and two hyperparameters. Each of these individual models only accounts for a minor portion of… ▽ More This paper proposes a new Bayesian machine learning model that can be applied to large datasets arising in macroeconomics. Our framework sums over many simple two-component location mixtures. The transition between components is determined by a logistic function that depends on a single threshold variable and two hyperparameters. Each of these individual models only accounts for a minor portion of the variation in the endogenous variables. But many of them are capable of capturing arbitrary nonlinear conditional mean relations. Conjugate priors enable fast and efficient inference. In simulations, we show that our approach produces accurate point and density forecasts. In a real-data exercise, we forecast US macroeconomic aggregates and consider the nonlinear effects of financial shocks in a large-scale nonlinear VAR. △ Less

Submitted 4 December, 2023; originally announced December 2023.

arXiv:2311.12671 [pdf, other]

Predictive Density Combination Using a Tree-Based Synthesis Function

Authors: Tony Chernis, Niko Hauzenberger, Florian Huber, Gary Koop, James Mitchell

Abstract: Bayesian predictive synthesis (BPS) provides a method for combining multiple predictive distributions based on agent/expert opinion analysis theory and encompasses a range of existing density forecast pooling methods. The key ingredient in BPS is a ``synthesis'' function. This is typically specified parametrically as a dynamic linear regression. In this paper, we develop a nonparametric treatment… ▽ More Bayesian predictive synthesis (BPS) provides a method for combining multiple predictive distributions based on agent/expert opinion analysis theory and encompasses a range of existing density forecast pooling methods. The key ingredient in BPS is a ``synthesis'' function. This is typically specified parametrically as a dynamic linear regression. In this paper, we develop a nonparametric treatment of the synthesis function using regression trees. We show the advantages of our tree-based approach in two macroeconomic forecasting applications. The first uses density forecasts for GDP growth from the euro area's Survey of Professional Forecasters. The second combines density forecasts of US inflation produced by many regression models involving different predictors. Both applications demonstrate the benefits -- in terms of improved forecast accuracy and interpretability -- of modeling the synthesis function nonparametrically. △ Less

Submitted 21 November, 2023; originally announced November 2023.

arXiv:2311.03959 [pdf, other]

Improving the Effectiveness of Deep Generative Data

Authors: Ruyu Wang, Sabrina Schmedding, Marco F. Huber

Abstract: Recent deep generative models (DGMs) such as generative adversarial networks (GANs) and diffusion probabilistic models (DPMs) have shown their impressive ability in generating high-fidelity photorealistic images. Although looking appealing to human eyes, training a model on purely synthetic images for downstream image processing tasks like image classification often results in an undesired perform… ▽ More Recent deep generative models (DGMs) such as generative adversarial networks (GANs) and diffusion probabilistic models (DPMs) have shown their impressive ability in generating high-fidelity photorealistic images. Although looking appealing to human eyes, training a model on purely synthetic images for downstream image processing tasks like image classification often results in an undesired performance drop compared to training on real data. Previous works have demonstrated that enhancing a real dataset with synthetic images from DGMs can be beneficial. However, the improvements were subjected to certain circumstances and yet were not comparable to adding the same number of real images. In this work, we propose a new taxonomy to describe factors contributing to this commonly observed phenomenon and investigate it on the popular CIFAR-10 dataset. We hypothesize that the Content Gap accounts for a large portion of the performance drop when using synthetic images from DGM and propose strategies to better utilize them in downstream tasks. Extensive experiments on multiple datasets showcase that our method outperforms baselines on downstream classification tasks both in case of training on synthetic only (Synthetic-to-Real) and training on a mix of real and synthetic data (Data Augmentation), particularly in the data-scarce scenario. △ Less

Submitted 8 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

Comments: Accepted by WACV2024

arXiv:2311.03525 [pdf, other]

Comment on "Photons can tell 'contradictory' answer about where they have been''

Authors: Gregory Reznik, Carlotta Versmold, Jan Dziewior, Florian Huber, Harald Weinfurter, Justin Dressel, Lev Vaidman

Abstract: Yuan and Feng [Eur. Phys. J. Plus 138:70, 2023] recently proposed a modification of the nested Mach-Zehnder interferometer experiment performed by Danan et al. [Phys. Rev. Lett. 111:240402, 2013] and argued that photons give "contradictory" answers about where they have been, when traces are locally imprinted on them in different ways. They concluded that their results are comprehensible from what… ▽ More Yuan and Feng [Eur. Phys. J. Plus 138:70, 2023] recently proposed a modification of the nested Mach-Zehnder interferometer experiment performed by Danan et al. [Phys. Rev. Lett. 111:240402, 2013] and argued that photons give "contradictory" answers about where they have been, when traces are locally imprinted on them in different ways. They concluded that their results are comprehensible from what they call the "three-path interference viewpoint", but difficult to explain from the "discontinuous trajectory" viewpoint advocated by Danan et al. We argue that the weak trace approach (the basis of the "discontinuous trajectory" viewpoint) provides a consistent explanation of the Yuan-Feng experiment. The contradictory messages of the photons are just another example of photons lying about where they have been when the experimental method of Danan et al. is applied in an inappropriate setup. △ Less

Submitted 6 November, 2023; originally announced November 2023.

arXiv:2310.00612 [pdf, other]

doi 10.1103/PhysRevLett.132.200202

Uncertainty relations from state polynomial optimization

Authors: Moisés Bermejo Morán, Felix Huber

Abstract: Uncertainty relations are a fundamental feature of quantum mechanics. How can these relations be found systematically? Here we develop a semidefinite programming hierarchy for additive uncertainty relations in the variances of non-commuting observables. Our hierarchy is built on the state polynomial optimization framework, also known as scalar extension. The hierarchy is complete, in the sense tha… ▽ More Uncertainty relations are a fundamental feature of quantum mechanics. How can these relations be found systematically? Here we develop a semidefinite programming hierarchy for additive uncertainty relations in the variances of non-commuting observables. Our hierarchy is built on the state polynomial optimization framework, also known as scalar extension. The hierarchy is complete, in the sense that it converges to tight uncertainty relations. We improve upon upper bounds for all 1292 additive uncertainty relations on up to nine operators for which a tight bound is not known. The bounds are dimension-free and depend entirely on the algebraic relations among the operators. The techniques apply to a range of scenarios, including Pauli, Heisenberg-Weyl, and fermionic operators, and generalize to higher order moments and multiplicative uncertainty relations. △ Less

Submitted 6 August, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

Comments: 9 pages, 4 tables. Accepted version

Journal ref: Physical Review Letters 132, 200202 (2024)

arXiv:2307.11525 [pdf, other]

Model Reporting for Certifiable AI: A Proposal from Merging EU Regulation into AI Development

Authors: Danilo Brajovic, Niclas Renner, Vincent Philipp Goebels, Philipp Wagner, Benjamin Fresz, Martin Biller, Mara Klaeb, Janika Kutz, Jens Neuhuettler, Marco F. Huber

Abstract: Despite large progress in Explainable and Safe AI, practitioners suffer from a lack of regulation and standards for AI safety. In this work we merge recent regulation efforts by the European Union and first proposals for AI guidelines with recent trends in research: data and model cards. We propose the use of standardized cards to document AI applications throughout the development process. Our ma… ▽ More Despite large progress in Explainable and Safe AI, practitioners suffer from a lack of regulation and standards for AI safety. In this work we merge recent regulation efforts by the European Union and first proposals for AI guidelines with recent trends in research: data and model cards. We propose the use of standardized cards to document AI applications throughout the development process. Our main contribution is the introduction of use-case and operation cards, along with updates for data and model cards to cope with regulatory requirements. We reference both recent research as well as the source of the regulation in our cards and provide references to additional support material and toolboxes whenever possible. The goal is to design cards that help practitioners develop safe AI systems throughout the development process, while enabling efficient third-party auditing of AI applications, being easy to understand, and building trust in the system. Our work incorporates insights from interviews with certification experts as well as developers and individuals working with the developed AI applications. △ Less

Submitted 21 July, 2023; originally announced July 2023.

Comments: 54 pages, 1 figure, to be submitted

arXiv:2307.03007 [pdf, other]

doi 10.1109/SMC53992.2023.10394319

Self-supervised Optimization of Hand Pose Estimation using Anatomical Features and Iterative Learning

Authors: Christian Jauch, Timo Leitritz, Marco F. Huber

Abstract: Manual assembly workers face increasing complexity in their work. Human-centered assistance systems could help, but object recognition as an enabling technology hinders sophisticated human-centered design of these systems. At the same time, activity recognition based on hand poses suffers from poor pose estimation in complex usage scenarios, such as wearing gloves. This paper presents a self-super… ▽ More Manual assembly workers face increasing complexity in their work. Human-centered assistance systems could help, but object recognition as an enabling technology hinders sophisticated human-centered design of these systems. At the same time, activity recognition based on hand poses suffers from poor pose estimation in complex usage scenarios, such as wearing gloves. This paper presents a self-supervised pipeline for adapting hand pose estimation to specific use cases with minimal human interaction. This enables cheap and robust hand posebased activity recognition. The pipeline consists of a general machine learning model for hand pose estimation trained on a generalized dataset, spatial and temporal filtering to account for anatomical constraints of the hand, and a retraining step to improve the model. Different parameter combinations are evaluated on a publicly available and annotated dataset. The best parameter and model combination is then applied to unlabelled videos from a manual assembly scenario. The effectiveness of the pipeline is demonstrated by training an activity recognition as a downstream task in the manual assembly scenario. △ Less

Submitted 6 July, 2023; originally announced July 2023.

Comments: Manuscript accepted at IEEE SMC 2023

Journal ref: 2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

arXiv:2306.12215 [pdf, other]

doi 10.1109/SMC53992.2023.10394031

Automated Machine Learning for Remaining Useful Life Predictions

Authors: Marc-André Zöller, Fabian Mauthe, Peter Zeiler, Marius Lindauer, Marco F. Huber

Abstract: Being able to predict the remaining useful life (RUL) of an engineering system is an important task in prognostics and health management. Recently, data-driven approaches to RUL predictions are becoming prevalent over model-based approaches since no underlying physical knowledge of the engineering system is required. Yet, this just replaces required expertise of the underlying physics with machine… ▽ More Being able to predict the remaining useful life (RUL) of an engineering system is an important task in prognostics and health management. Recently, data-driven approaches to RUL predictions are becoming prevalent over model-based approaches since no underlying physical knowledge of the engineering system is required. Yet, this just replaces required expertise of the underlying physics with machine learning (ML) expertise, which is often also not available. Automated machine learning (AutoML) promises to build end-to-end ML pipelines automatically enabling domain experts without ML expertise to create their own models. This paper introduces AutoRUL, an AutoML-driven end-to-end approach for automatic RUL predictions. AutoRUL combines fine-tuned standard regression methods to an ensemble with high predictive power. By evaluating the proposed method on eight real-world and synthetic datasets against state-of-the-art hand-crafted models, we show that AutoML provides a viable alternative to hand-crafted data-driven RUL predictions. Consequently, creating RUL predictions can be made more accessible for domain experts using AutoML by eliminating ML expertise from data-driven model construction. △ Less

Submitted 21 June, 2023; originally announced June 2023.

Comments: Manuscript accepted at IEEE SMC 2023

arXiv:2306.11727 [pdf]

Aquila: QuEra's 256-qubit neutral-atom quantum computer

Authors: Jonathan Wurtz, Alexei Bylinskii, Boris Braverman, Jesse Amato-Grill, Sergio H. Cantu, Florian Huber, Alexander Lukin, Fangli Liu, Phillip Weinberg, John Long, Sheng-Tao Wang, Nathan Gemelke, Alexander Keesling

Abstract: The neutral-atom quantum computer "Aquila" is QuEra's latest device available through the Braket cloud service on Amazon Web Services (AWS). Aquila is a "field-programmable qubit array" (FPQA) operated as an analog Hamiltonian simulator on a user-configurable architecture, executing programmable coherent quantum dynamics on up to 256 neutral-atom qubits. This whitepaper serves as an overview of Aq… ▽ More The neutral-atom quantum computer "Aquila" is QuEra's latest device available through the Braket cloud service on Amazon Web Services (AWS). Aquila is a "field-programmable qubit array" (FPQA) operated as an analog Hamiltonian simulator on a user-configurable architecture, executing programmable coherent quantum dynamics on up to 256 neutral-atom qubits. This whitepaper serves as an overview of Aquila and its capabilities: how it works under the hood, key performance benchmarks, and examples that demonstrate some quintessential applications. This includes an overview of neutral-atom quantum computing, as well as five examples of increasing complexity from single-qubit dynamics to combinatorial optimization, implemented on Aquila. This whitepaper is intended for readers who are interested in learning more about neutral-atom quantum computing, as a guide for those who are ready to start using Aquila, and as a reference point for its performance as an analog quantum computer. △ Less

Submitted 20 June, 2023; originally announced June 2023.

arXiv:2305.16827 [pdf, other]

Fast and Order-invariant Inference in Bayesian VARs with Non-Parametric Shocks

Authors: Florian Huber, Gary Koop

Abstract: The shocks which hit macroeconomic models such as Vector Autoregressions (VARs) have the potential to be non-Gaussian, exhibiting asymmetries and fat tails. This consideration motivates the VAR developed in this paper which uses a Dirichlet process mixture (DPM) to model the shocks. However, we do not follow the obvious strategy of simply modeling the VAR errors with a DPM since this would lead to… ▽ More The shocks which hit macroeconomic models such as Vector Autoregressions (VARs) have the potential to be non-Gaussian, exhibiting asymmetries and fat tails. This consideration motivates the VAR developed in this paper which uses a Dirichlet process mixture (DPM) to model the shocks. However, we do not follow the obvious strategy of simply modeling the VAR errors with a DPM since this would lead to computationally infeasible Bayesian inference in larger VARs and potentially a sensitivity to the way the variables are ordered in the VAR. Instead we develop a particular additive error structure inspired by Bayesian nonparametric treatments of random effects in panel data models. We show that this leads to a model which allows for computationally fast and order-invariant inference in large VARs with nonparametric shocks. Our empirical results with nonparametric VARs of various dimensions shows that nonparametric treatment of the VAR errors is particularly useful in periods such as the financial crisis and the pandemic. △ Less

Submitted 26 May, 2023; originally announced May 2023.

arXiv:2305.12365 [pdf, other]

Towards Optimal Energy Management Strategy for Hybrid Electric Vehicle with Reinforcement Learning

Authors: Xinyang Wu, Elisabeth Wedernikow, Christof Nitsche, Marco F. Huber

Abstract: In recent years, the development of Artificial Intelligence (AI) has shown tremendous potential in diverse areas. Among them, reinforcement learning (RL) has proven to be an effective solution for learning intelligent control strategies. As an inevitable trend for mitigating climate change, hybrid electric vehicles (HEVs) rely on efficient energy management strategies (EMS) to minimize energy cons… ▽ More In recent years, the development of Artificial Intelligence (AI) has shown tremendous potential in diverse areas. Among them, reinforcement learning (RL) has proven to be an effective solution for learning intelligent control strategies. As an inevitable trend for mitigating climate change, hybrid electric vehicles (HEVs) rely on efficient energy management strategies (EMS) to minimize energy consumption. Many researchers have employed RL to learn optimal EMS for specific vehicle models. However, most of these models tend to be complex and proprietary, making them unsuitable for broad applicability. This paper presents a novel framework, in which we implement and integrate RL-based EMS with the open-source vehicle simulation tool called FASTSim. The learned RL-based EMSs are evaluated on various vehicle models using different test drive cycles and prove to be effective in improving energy efficiency. △ Less

Submitted 21 May, 2023; originally announced May 2023.

Comments: Accepted at the 35th IEEE Intelligent Vehicles Symposium (IV 2023)

arXiv:2304.07856 [pdf, other]

Coarsened Bayesian VARs -- Correcting BVARs for Incorrect Specification

Authors: Florian Huber, Massimiliano Marcellino

Abstract: Model mis-specification in multivariate econometric models can strongly influence quantities of interest such as structural parameters, forecast distributions or responses to structural shocks, even more so if higher-order forecasts or responses are considered, due to parameter convolution. We propose a simple method for addressing these specification issues in the context of Bayesian VARs. Our me… ▽ More Model mis-specification in multivariate econometric models can strongly influence quantities of interest such as structural parameters, forecast distributions or responses to structural shocks, even more so if higher-order forecasts or responses are considered, due to parameter convolution. We propose a simple method for addressing these specification issues in the context of Bayesian VARs. Our method, called coarsened Bayesian VARs (cBVARs), replaces the exact likelihood with a coarsened likelihood that takes into account that the model might be mis-specified along important but unknown dimensions. Coupled with a conjugate prior, this results in a computationally simple model. As opposed to more flexible specifications, our approach avoids overfitting, is simple to implement and estimation is fast. The resulting cBVAR performs well in simulations for several types of mis-specification. Applied to US data, cBVARs improve point and density forecasts compared to standard BVARs, and lead to milder but more persistent negative effects of uncertainty shocks on output. △ Less

Submitted 26 May, 2023; v1 submitted 16 April, 2023; originally announced April 2023.

arXiv:2304.07111 [pdf, other]

Grouping Shapley Value Feature Importances of Random Forests for explainable Yield Prediction

Authors: Florian Huber, Hannes Engler, Anna Kicherer, Katja Herzog, Reinhard Töpfer, Volker Steinhage

Abstract: Explainability in yield prediction helps us fully explore the potential of machine learning models that are already able to achieve high accuracy for a variety of yield prediction scenarios. The data included for the prediction of yields are intricate and the models are often difficult to understand. However, understanding the models can be simplified by using natural groupings of the input featur… ▽ More Explainability in yield prediction helps us fully explore the potential of machine learning models that are already able to achieve high accuracy for a variety of yield prediction scenarios. The data included for the prediction of yields are intricate and the models are often difficult to understand. However, understanding the models can be simplified by using natural groupings of the input features. Grouping can be achieved, for example, by the time the features are captured or by the sensor used to do so. The state-of-the-art for interpreting machine learning models is currently defined by the game-theoretic approach of Shapley values. To handle groups of features, the calculated Shapley values are typically added together, ignoring the theoretical limitations of this approach. We explain the concept of Shapley values directly computed for predefined groups of features and introduce an algorithm to compute them efficiently on tree structures. We provide a blueprint for designing swarm plots that combine many local explanations for global understanding. Extensive evaluation of two different yield prediction problems shows the worth of our approach and demonstrates how we can enable a better understanding of yield prediction models in the future, ultimately leading to mutual enrichment of research and application. △ Less

Submitted 14 April, 2023; originally announced April 2023.

Comments: Preprint accepted at IntelliSys 2023

arXiv:2303.13620 [pdf, other]

doi 10.1016/j.physletb.2023.138101

Unbinned Deep Learning Jet Substructure Measurement in High $Q^2$ ep collisions at HERA

Authors: The H1 collaboration, V. Andreev, M. Arratia, A. Baghdasaryan, A. Baty, K. Begzsuren, A. Bolz, V. Boudry, G. Brandt, D. Britzger, A. Buniatyan, L. Bystritskaya, A. J. Campbell, K. B. Cantun Avila, K. Cerny, V. Chekelian, Z. Chen, J. G. Contreras, J. Cvach, J. B. Dainton, K. Daum, A. Deshpande, C. Diaconu, A. Drees, G. Eckerlin , et al. (120 additional authors not shown)

Abstract: The radiation pattern within high energy quark- and gluon-initiated jets (jet substructure) is used extensively as a precision probe of the strong force as well as an environment for optimizing event generators with numerous applications in high energy particle and nuclear physics. Looking at electron-proton collisions is of particular interest as many of the complications present at hadron collid… ▽ More The radiation pattern within high energy quark- and gluon-initiated jets (jet substructure) is used extensively as a precision probe of the strong force as well as an environment for optimizing event generators with numerous applications in high energy particle and nuclear physics. Looking at electron-proton collisions is of particular interest as many of the complications present at hadron colliders are absent. A detailed study of modern jet substructure observables, jet angularities, in electron-proton collisions is presented using data recorded using the H1 detector at HERA. The measurement is unbinned and multi-dimensional, using machine learning to correct for detector effects. All of the available reconstructed object information of the respective jets is interpreted by a graph neural network, achieving superior precision on a selected set of jet angularities. Training these networks was enabled by the use of a large number of GPUs in the Perlmutter supercomputer at Berkeley Lab. The particle jets are reconstructed in the laboratory frame, using the $k_{\mathrm{T}}$ jet clustering algorithm. Results are reported at high transverse momentum transfer $Q^2>150$ GeV${}^2$, and inelasticity $0.2 < y < 0.7$. The analysis is also performed in sub-regions of $Q^2$, thus probing scale dependencies of the substructure variables. The data are compared with a variety of predictions and point towards possible improvements of such models. △ Less

Submitted 14 September, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

Comments: 25 pages, 10 figures, 8 tables, version accepted by Physics Letters B

Report number: DESY-23-034

Journal ref: PLB 844 (2023) 138101

arXiv:2303.12739 [pdf, other]

Optimizing CAD Models with Latent Space Manipulation

Authors: Jannes Elstner, Raoul G. C. Schönhof, Steffen Tauber, Marco F Huber

Abstract: When it comes to the optimization of CAD models in the automation domain, neural networks currently play only a minor role. Optimizing abstract features such as automation capability is challenging, since they can be very difficult to simulate, are too complex for rule-based systems, and also have little to no data available for machine-learning methods. On the other hand, image manipulation metho… ▽ More When it comes to the optimization of CAD models in the automation domain, neural networks currently play only a minor role. Optimizing abstract features such as automation capability is challenging, since they can be very difficult to simulate, are too complex for rule-based systems, and also have little to no data available for machine-learning methods. On the other hand, image manipulation methods that can manipulate abstract features in images such as StyleCLIP have seen much success. They rely on the latent space of pretrained generative adversarial networks, and could therefore also make use of the vast amount of unlabeled CAD data. In this paper, we show that such an approach is also suitable for optimizing abstract automation-related features of CAD parts. We achieved this by extending StyleCLIP to work with CAD models in the form of voxel models, which includes using a 3D StyleGAN and a custom classifier. Finally, we demonstrate the ability of our system for the optimiziation of automation-related features by optimizing the grabability of various CAD models. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/) Peer review under the responsibility of the scientific committee of the 33rd CIRP Design Conference. △ Less

Submitted 9 March, 2023; originally announced March 2023.

arXiv:2303.07761 [pdf, other]

doi 10.1103/PhysRevLett.132.070202

Entanglement detection with trace polynomials

Authors: Albert Rico, Felix Huber

Abstract: We provide a systematic method for nonlinear entanglement detection based on trace polynomial inequalities. In particular, this allows to employ multi-partite witnesses for the detection of bipartite states, and vice versa. We identify witnesses for which linear detection of an entangled state fails, but for which nonlinear detection succeeds. With the trace polynomial formulation a great variety… ▽ More We provide a systematic method for nonlinear entanglement detection based on trace polynomial inequalities. In particular, this allows to employ multi-partite witnesses for the detection of bipartite states, and vice versa. We identify witnesses for which linear detection of an entangled state fails, but for which nonlinear detection succeeds. With the trace polynomial formulation a great variety of witnesses arise from immamant inequalities, which can be implemented in the laboratory through randomized measurements. △ Less

Submitted 15 February, 2024; v1 submitted 14 March, 2023; originally announced March 2023.

Journal ref: Physical Review Letters 132, 070202 (2024)

arXiv:2303.02127 [pdf, other]

doi 10.1103/PhysRevLett.131.080201

Bell inequalities with overlapping measurements

Authors: Moisés Bermejo Morán, Alejandro Pozas-Kerstjens, Felix Huber

Abstract: Which nonlocal correlations can be obtained, when a party has access to more than one subsystem? While traditionally nonlocality deals with spacelike separated parties, this question becomes important with quantum technologies that connect devices by means of small shared systems. Here we study Bell inequalities where measurements of different parties can have overlap. This allows to accommodate p… ▽ More Which nonlocal correlations can be obtained, when a party has access to more than one subsystem? While traditionally nonlocality deals with spacelike separated parties, this question becomes important with quantum technologies that connect devices by means of small shared systems. Here we study Bell inequalities where measurements of different parties can have overlap. This allows to accommodate problems in quantum information such as the existence of quantum error correction codes in the framework of non-locality. The scenarios considered show an interesting behaviour with respect to Hilbert space dimension, overlap, and symmetry. △ Less

Submitted 31 August, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

Comments: 9 pages, 3 figures. Accepted version

Journal ref: Phys. Rev. Lett. 131, 080201 (2023)

arXiv:2302.08920 [pdf, other]

A tale of two tails: 130 years of growth-at-risk

Authors: Martin Gächter, Elias Hasler, Florian Huber

Abstract: We extend the existing growth-at-risk (GaR) literature by examining a long time period of 130 years in a time-varying parameter regression model. We identify several important insights for policymakers. First, both the level as well as the determinants of GaR vary significantly over time. Second, the stability of upside risks to GDP growth reported in earlier research is specific to the period kno… ▽ More We extend the existing growth-at-risk (GaR) literature by examining a long time period of 130 years in a time-varying parameter regression model. We identify several important insights for policymakers. First, both the level as well as the determinants of GaR vary significantly over time. Second, the stability of upside risks to GDP growth reported in earlier research is specific to the period known as the Great Moderation, with the distribution of risks being more balanced before the 1970s. Third, the distribution of GDP growth has significantly narrowed since the end of the Bretton Woods system. Fourth, financial stress is always linked to higher downside risks, but it does not affect upside risks. Finally, other risk indicators, such as credit growth and house prices, not only drive downside risks, but also contribute to increased upside risks during boom periods. In this context, the paper also adds to the financial cycle literature by completing the picture of drivers (and risks) for both booms and recessions over time. △ Less

Submitted 17 February, 2023; originally announced February 2023.

arXiv:2302.08366 [pdf, other]

Defect Transfer GAN: Diverse Defect Synthesis for Data Augmentation

Authors: Ruyu Wang, Sabrina Hoppe, Eduardo Monari, Marco F. Huber

Abstract: Data-hunger and data-imbalance are two major pitfalls in many deep learning approaches. For example, on highly optimized production lines, defective samples are hardly acquired while non-defective samples come almost for free. The defects however often seem to resemble each other, e.g., scratches on different products may only differ in a few characteristics. In this work, we introduce a framework… ▽ More Data-hunger and data-imbalance are two major pitfalls in many deep learning approaches. For example, on highly optimized production lines, defective samples are hardly acquired while non-defective samples come almost for free. The defects however often seem to resemble each other, e.g., scratches on different products may only differ in a few characteristics. In this work, we introduce a framework, Defect Transfer GAN (DT-GAN), which learns to represent defect types independent of and across various background products and yet can apply defect-specific styles to generate realistic defective images. An empirical study on the MVTec AD and two additional datasets showcase DT-GAN outperforms state-of-the-art image synthesis methods w.r.t. sample fidelity and diversity in defect generation. We further demonstrate benefits for a critical downstream task in manufacturing -- defect classification. Results show that the augmented data from DT-GAN provides consistent gains even in the few samples regime and reduces the error rate up to 51% compared to both traditional and advanced data augmentation methods. △ Less

Submitted 16 February, 2023; originally announced February 2023.

Comments: Accepted by BMVC 2022

arXiv:2301.13604 [pdf, other]

Nonlinearities in Macroeconomic Tail Risk through the Lens of Big Data Quantile Regressions

Authors: Jan Prüser, Florian Huber

Abstract: Modeling and predicting extreme movements in GDP is notoriously difficult and the selection of appropriate covariates and/or possible forms of nonlinearities are key in obtaining precise forecasts. In this paper, our focus is on using large datasets in quantile regression models to forecast the conditional distribution of US GDP growth. To capture possible non-linearities, we include several nonli… ▽ More Modeling and predicting extreme movements in GDP is notoriously difficult and the selection of appropriate covariates and/or possible forms of nonlinearities are key in obtaining precise forecasts. In this paper, our focus is on using large datasets in quantile regression models to forecast the conditional distribution of US GDP growth. To capture possible non-linearities, we include several nonlinear specifications. The resulting models will be huge dimensional and we thus rely on a set of shrinkage priors. Since Markov Chain Monte Carlo estimation becomes slow in these dimensions, we rely on fast variational Bayes approximations to the posterior distribution of the coefficients and the latent states. We find that our proposed set of models produces precise forecasts. These gains are especially pronounced in the tails. Using Gaussian processes to approximate the nonlinear component of the model further improves the good performance, in particular in the right tail. △ Less

Submitted 22 September, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

arXiv:2212.03471 [pdf, ps, other]

Bayesian Forecasting in Economics and Finance: A Modern Review

Authors: Gael M. Martin, David T. Frazier, Worapree Maneesoonthorn, Ruben Loaiza-Maya, Florian Huber, Gary Koop, John Maheu, Didier Nibbering, Anastasios Panagiotelis

Abstract: The Bayesian statistical paradigm provides a principled and coherent approach to probabilistic forecasting. Uncertainty about all unknowns that characterize any forecasting problem -- model, parameters, latent states -- is able to be quantified explicitly, and factored into the forecast distribution via the process of integration or averaging. Allied with the elegance of the method, Bayesian forec… ▽ More The Bayesian statistical paradigm provides a principled and coherent approach to probabilistic forecasting. Uncertainty about all unknowns that characterize any forecasting problem -- model, parameters, latent states -- is able to be quantified explicitly, and factored into the forecast distribution via the process of integration or averaging. Allied with the elegance of the method, Bayesian forecasting is now underpinned by the burgeoning field of Bayesian computation, which enables Bayesian forecasts to be produced for virtually any problem, no matter how large, or complex. The current state of play in Bayesian forecasting in economics and finance is the subject of this review. The aim is to provide the reader with an overview of modern approaches to the field, set in some historical context; and with sufficient computational detail given to assist the reader with implementation. △ Less

Submitted 28 July, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

Comments: The paper is now published online at: https://doi.org/10.1016/j.ijforecast.2023.05.002

arXiv:2212.02492 [pdf, other]

doi 10.3390/rs14236112

Giant Planet Observations in NASA's Planetary Data System

Authors: Nancy J. Chanover, James M. Bauer, John J. Blalock, Mitchell K. Gordon, Lyle F. Huber, Mia J. T. Mace, Lynn D. V. Neakrase, Matthew S. Tiscareno, Raymond J. Walker

Abstract: While there have been far fewer missions to the outer Solar System than to the inner Solar System, spacecraft destined for the giant planets have conducted a wide range of fundamental investigations, returning data that continues to reshape our understanding of these complex systems, sometimes decades after the data were acquired. These data are preserved and accessible from national and internati… ▽ More While there have been far fewer missions to the outer Solar System than to the inner Solar System, spacecraft destined for the giant planets have conducted a wide range of fundamental investigations, returning data that continues to reshape our understanding of these complex systems, sometimes decades after the data were acquired. These data are preserved and accessible from national and international planetary science archives. For all NASA planetary missions and instruments the data are available from the science discipline nodes of the NASA Planetary Data System (PDS). Looking ahead, the PDS will be the primary repository for giant planets data from several upcoming missions and derived datasets, as well as supporting research conducted to aid in the interpretation of the remotely sensed giant planets data already archived in the PDS. △ Less

Submitted 5 December, 2022; originally announced December 2022.

Comments: Contributed to the special issue of Remote Sensing entitled "Remote Sensing Observations of the Giant Planets"

Journal ref: https://www.mdpi.com/2072-4292/14/23/6112

arXiv:2211.14617 [pdf, other]

Mixture of Decision Trees for Interpretable Machine Learning

Authors: Simeon Brüggenjürgen, Nina Schaaf, Pascal Kerschke, Marco F. Huber

Abstract: This work introduces a novel interpretable machine learning method called Mixture of Decision Trees (MoDT). It constitutes a special case of the Mixture of Experts ensemble architecture, which utilizes a linear model as gating function and decision trees as experts. Our proposed method is ideally suited for problems that cannot be satisfactorily learned by a single decision tree, but which can alt… ▽ More This work introduces a novel interpretable machine learning method called Mixture of Decision Trees (MoDT). It constitutes a special case of the Mixture of Experts ensemble architecture, which utilizes a linear model as gating function and decision trees as experts. Our proposed method is ideally suited for problems that cannot be satisfactorily learned by a single decision tree, but which can alternatively be divided into subproblems. Each subproblem can then be learned well from a single decision tree. Therefore, MoDT can be considered as a method that improves performance while maintaining interpretability by making each of its decisions understandable and traceable to humans. Our work is accompanied by a Python implementation, which uses an interpretable gating function, a fast learning algorithm, and a direct interface to fine-tuned interpretable visualization methods. The experiments confirm that the implementation works and, more importantly, show the superiority of our approach compared to single decision trees and random forests of similar complexity. △ Less

Submitted 26 November, 2022; originally announced November 2022.

Comments: Accepted for publication at the 21st IEEE International Conference of Machine Learning and Applications (ICMLA)

arXiv:2211.12399 [pdf, other]

doi 10.1016/j.physleta.2023.128782

Photons are lying about where they have been, again

Authors: Gregory Reznik, Carlotta Versmold, Jan Dziewior, Florian Huber, Shrobona Bagchi, Harald Weinfurter, Justin Dressel, Lev Vaidman

Abstract: Bhati and Arvind [Phys. Lett. A, 127955 (2022)] recently argued that in a specially designed experiment the timing of photon detection events demonstrates photon presence at a location at which they are not present according to the weak value approach. The alleged contradiction is resolved by a subtle interference effect resulting in anomalous sensitivity of the signal imprinted on the postselecte… ▽ More Bhati and Arvind [Phys. Lett. A, 127955 (2022)] recently argued that in a specially designed experiment the timing of photon detection events demonstrates photon presence at a location at which they are not present according to the weak value approach. The alleged contradiction is resolved by a subtle interference effect resulting in anomalous sensitivity of the signal imprinted on the postselected photons for the interaction at this location, similarly to the case of a nested Mach-Zehnder interferometer with a Dove prism [Quant. Stud.: Mat. Found. 2, 255 (2015)]. We perform an in depth analysis of the characterization of the presence of a pre- and postselected particle at a particular location based on information imprinted on the particle itself. The theoretical results are tested by a computer simulation of the proposed experiment. △ Less

Submitted 31 March, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

arXiv:2211.06349 [pdf, other]

Refuting spectral compatibility of quantum marginals

Authors: Felix Huber, Nikolai Wyderka

Abstract: The spectral variant of the quantum marginal problem asks: Given prescribed spectra for a set of quantum marginals, does there exist a compatible joint state? The main idea of this work is a symmetry-reduced semidefinite programming hierarchy for detecting incompatible spectra. The hierarchy is complete, in the sense that it detects every incompatible set of spectra. The refutations it provides ar… ▽ More The spectral variant of the quantum marginal problem asks: Given prescribed spectra for a set of quantum marginals, does there exist a compatible joint state? The main idea of this work is a symmetry-reduced semidefinite programming hierarchy for detecting incompatible spectra. The hierarchy is complete, in the sense that it detects every incompatible set of spectra. The refutations it provides are dimension-free, certifying incompatibility in all local dimensions. The hierarchy equally applies to the sums of Hermitian matrices problem, to optimize trace polynomials on the positive cone, to the compatibility of invariants, and to certify vanishing Kronecker coefficients. △ Less

Submitted 15 March, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

Comments: 18 pages, 2 figures. Now includes a proof of completeness and convergence

arXiv:2211.06144 [pdf]

doi 10.1016/j.tsf.2023.139860

Synthesis and physical properties of uranium thin-film hydrides UH2 and \b{eta}-UH3

Authors: Evgenia A. Tereshina-Chitrova, Ladislav Havela, Mykhaylo Paukov, Oleksandra Koloskova, Lukas Horak, Milan Dopita, Mayerling Martinez Celis, Miroslav Cieslar, Zbynek Soban, Thomas Gouder, Frank Huber

Abstract: Formation of thin uranium hydrides films, UH2 and \b{eta}-UH3, synthesized by a reactive dc sputtering of uranium metal, was explored using variable deposition conditions. Obtained stable oxygen-free hydride films were studied by a variety of methods, both in situ (photoelectron spectroscopy - XPS), and ex-situ (x-ray diffraction - XRD, transmission electron microscopy - TEM), electrical resistivi… ▽ More Formation of thin uranium hydrides films, UH2 and \b{eta}-UH3, synthesized by a reactive dc sputtering of uranium metal, was explored using variable deposition conditions. Obtained stable oxygen-free hydride films were studied by a variety of methods, both in situ (photoelectron spectroscopy - XPS), and ex-situ (x-ray diffraction - XRD, transmission electron microscopy - TEM), electrical resistivity, and magnetometry). Both types of hydrides are ferromagnetic, the Curie temperatures of UH2 and \b{eta}-UH3 are approx. 120 and 170 K, respectively. Ferromagnetism in the thin films is robust and does not depend on structure details while electrical resistivity data reflect disorder in both types of hydrides. △ Less

Submitted 11 November, 2022; originally announced November 2022.

arXiv:2211.04752 [pdf, other]

Bayesian Neural Networks for Macroeconomic Analysis

Authors: Niko Hauzenberger, Florian Huber, Karin Klieber, Massimiliano Marcellino

Abstract: Macroeconomic data is characterized by a limited number of observations (small T), many time series (big K) but also by featuring temporal dependence. Neural networks, by contrast, are designed for datasets with millions of observations and covariates. In this paper, we develop Bayesian neural networks (BNNs) that are well-suited for handling datasets commonly used for macroeconomic analysis in po… ▽ More Macroeconomic data is characterized by a limited number of observations (small T), many time series (big K) but also by featuring temporal dependence. Neural networks, by contrast, are designed for datasets with millions of observations and covariates. In this paper, we develop Bayesian neural networks (BNNs) that are well-suited for handling datasets commonly used for macroeconomic analysis in policy institutions. Our approach avoids extensive specification searches through a novel mixture specification for the activation function that appropriately selects the form of nonlinearities. Shrinkage priors are used to prune the network and force irrelevant neurons to zero. To cope with heteroskedasticity, the BNN is augmented with a stochastic volatility model for the error term. We illustrate how the model can be used in a policy institution by first showing that our different BNNs produce precise density forecasts, typically better than those from other machine learning methods. Finally, we showcase how our model can be used to recover nonlinearities in the reaction of macroeconomic aggregates to financial shocks. △ Less

Submitted 2 April, 2024; v1 submitted 9 November, 2022; originally announced November 2022.

Comments: JEL: C11, C30, C45, C53, E3, E44. Keywords: Bayesian neural networks, model selection, shrinkage priors, macro forecasting

arXiv:2210.03424 [pdf, other]

Kalman-Bucy-Informed Neural Network for System Identification

Authors: Tobias Nagel, Marco F. Huber

Abstract: Identifying parameters in a system of nonlinear, ordinary differential equations is vital for designing a robust controller. However, if the system is stochastic in its nature or if only noisy measurements are available, standard optimization algorithms for system identification usually fail. We present a new approach that combines the recent advances in physics-informed neural networks and the we… ▽ More Identifying parameters in a system of nonlinear, ordinary differential equations is vital for designing a robust controller. However, if the system is stochastic in its nature or if only noisy measurements are available, standard optimization algorithms for system identification usually fail. We present a new approach that combines the recent advances in physics-informed neural networks and the well-known achievements of Kalman filters in order to find parameters in a continuous-time system with noisy measurements. In doing so, our approach allows estimating the parameters together with the mean value and covariance matrix of the system's state vector. We show that the method works for complex systems by identifying the parameters of a double pendulum. △ Less

Submitted 7 October, 2022; originally announced October 2022.

Comments: 6 pages, 5 figures, Conference on Decision and Control 2022

arXiv:2209.11970 [pdf, other]

Bayesian Modeling of TVP-VARs Using Regression Trees

Authors: Niko Hauzenberger, Florian Huber, Gary Koop, James Mitchell

Abstract: In light of widespread evidence of parameter instability in macroeconomic models, many time-varying parameter (TVP) models have been proposed. This paper proposes a nonparametric TVP-VAR model using Bayesian additive regression trees (BART) that models the TVPs as an unknown function of effect modifiers. The novelty of this model arises from the fact that the law of motion driving the parameters i… ▽ More In light of widespread evidence of parameter instability in macroeconomic models, many time-varying parameter (TVP) models have been proposed. This paper proposes a nonparametric TVP-VAR model using Bayesian additive regression trees (BART) that models the TVPs as an unknown function of effect modifiers. The novelty of this model arises from the fact that the law of motion driving the parameters is treated nonparametrically. This leads to great flexibility in the nature and extent of parameter change, both in the conditional mean and in the conditional variance. Parsimony is achieved through adopting nonparametric factor structures and use of shrinkage priors. In an application to US macroeconomic data, we illustrate the use of our model in tracking both the evolving nature of the Phillips curve and how the effects of business cycle shocks on inflation measures vary nonlinearly with changes in the effect modifiers. △ Less

Submitted 5 May, 2023; v1 submitted 24 September, 2022; originally announced September 2022.

Comments: JEL: C11, C32, C51, E31, E32; KEYWORDS: Bayesian vector autoregression, time-varying parameters, nonparametric modeling, machine learning, regression trees, Phillips curve, business cycle shocks

Showing 1–50 of 168 results for author: Huber, F