Search | arXiv e-print repository

High-Q nanophotonics over the full visible spectrum enabled by hexagonal boron nitride metasurfaces

Authors: Lucca Kühner, Luca Sortino, Benjamin Tilmann, Thomas Weber, Kenji Watanabe, Takashi Taniguchi, Stefan A. Maier, Andreas Tittl

Abstract: All-dielectric optical metasurfaces with high quality (Q) factors have so far been hampered by the lack of simultaneously lossless and high refractive index (RI) materials over the full visible spectrum. To achieve broad spectral coverage, the use of low-index materials is, in fact, unavoidable due to the inverse correlation between the band-gap energy (and therefore the optical losses) and the RI… ▽ More All-dielectric optical metasurfaces with high quality (Q) factors have so far been hampered by the lack of simultaneously lossless and high refractive index (RI) materials over the full visible spectrum. To achieve broad spectral coverage, the use of low-index materials is, in fact, unavoidable due to the inverse correlation between the band-gap energy (and therefore the optical losses) and the RI. However, for Mie resonant photonics, smaller RIs are associated with reduced Q factors and mode volume confinement. In this work, we leverage symmetry-broken bound states in the continuum (BICs) to efficiently suppress radiation losses from the low-index (n~2) van der Waals material hexagonal boron nitride (hBN), realizing metasurfaces with high-Q resonances over the complete visible spectrum. In particular, we analyze the rational use of low and high RI materials as resonator components and harness our insights to experimentally demonstrate sharp BIC resonances with Q factors above 300, spanning wavelengths between 400 nm and 1000 nm from a single hBN flake. Moreover, we utilize the enhanced electric near-fields to demonstrate second harmonic generation (SHG) with enhancement factors above 102. Our results provide a theoretical and experimental framework for the implementation of low RI materials as photonic media for metaoptics. △ Less

Submitted 20 October, 2022; originally announced October 2022.

Comments: Main text and supporting information, 23 pages, 4 Figures manuscript + 4 Supporting Figures

arXiv:2209.05453 [pdf, ps, other]

Metric compatibility and Levi-Civita Connections on Quantum Groups

Authors: Paolo Aschieri, Thomas Weber

Abstract: Arbitrary connections on a generic Hopf algebra $H$ are studied and shown to extend to connections on tensor fields. On this ground a general definition of metric compatible connection is proposed. This leads to a sufficient criterion for the existence and uniqueness of the Levi-Civita connection, that of invertibility of an $H$-valued matrix. Provided invertibility for one metric, existence and u… ▽ More Arbitrary connections on a generic Hopf algebra $H$ are studied and shown to extend to connections on tensor fields. On this ground a general definition of metric compatible connection is proposed. This leads to a sufficient criterion for the existence and uniqueness of the Levi-Civita connection, that of invertibility of an $H$-valued matrix. Provided invertibility for one metric, existence and uniqueness of the Levi-Civita connection for all metrics conformal to the initial one is proven. This class consists of metrics which are neither central (bimodule maps) nor equivariant, in general. For central and bicoinvariant metrics the invertibility condition is further simplified to a metric independent one. Examples include metrics on $SL_q(2)$. △ Less

Submitted 4 October, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

Comments: 40 pages, fully revised version, improved condition for existence and uniqueness of Levi-Civita connections, clarified relation to [Bhowmick, Mukhopadhyay, J. Algebra 563, 2020] in Remarks 5.6 i) and 5.11. Added subsection for triangular quantum groups. Improved readability. Extended bibliography

arXiv:2209.01944 [pdf]

doi 10.1038/s41563-023-01580-7

Strong light-matter interaction with self-hybridized bound states in the continuum in monolithic van der Waals metasurfaces

Authors: Thomas Weber, Lucca Kühner, Luca Sortino, Amine Ben Mhenni, Nathan P. Wilson, Julius Kühne, Jonathan J. Finley, Stefan A. Maier, Andreas Tittl

Abstract: Photonic bound states in the continuum (BICs) are a standout nanophotonic platform for strong light-matter coupling with transition metal dichalcogenides (TMDCs), but have so far mostly been employed as all-dielectric metasurfaces with adjacent TMDC layers, incurring limitations related to strain, mode overlap, and material integration. In this work, we experimentally demonstrate for the first tim… ▽ More Photonic bound states in the continuum (BICs) are a standout nanophotonic platform for strong light-matter coupling with transition metal dichalcogenides (TMDCs), but have so far mostly been employed as all-dielectric metasurfaces with adjacent TMDC layers, incurring limitations related to strain, mode overlap, and material integration. In this work, we experimentally demonstrate for the first time asymmetry-dependent BIC resonances in 2D arrays of monolithic metasurfaces composed solely of the nanostructured bulk TMDC WS$_2$ with BIC modes exhibiting sharp and tailored linewidths, ideal for selectively enhancing light-matter interactions. Geometrical variation enables the tuning of the BIC resonances across the exciton resonance in bulk WS$_2$, revealing the strong-coupling regime with an anti-crossing pattern and a Rabi splitting of 116 meV. The precise control over the radiative loss channel provided by the BIC concept is harnessed to tailor the Rabi splitting via a geometrical asymmetry parameter of the metasurface. Crucially, the coupling strength itself can be controlled and is shown to be independent of material-intrinsic losses. Our BIC-driven monolithic metasurface platform can readily incorporate other TMDCs or excitonic materials to deliver previously unavailable fundamental insights and practical device concepts for polaritonic applications. △ Less

Submitted 5 September, 2022; originally announced September 2022.

Comments: Main text and supporting information, 31 pages, 4 Figures manuscript + 8 Supporting Figures

arXiv:2208.13802 [pdf, ps, other]

doi 10.1088/1751-8121/acc8a5

Constraining Weil-Petersson volumes by universal random matrix correlations in low-dimensional quantum gravity

Authors: Torsten Weber, Fabian Haneder, Klaus Richter, Juan Diego Urbina

Abstract: Based on the discovery of the duality between Jackiw-Teitelboim quantum gravity and a double-scaled matrix ensemble by Saad, Shenker and Stanford in 2019, we show how consistency between the two theories in the universal Random Matrix Theory (RMT) limit imposes a set of constraints on the volumes of moduli spaces of Riemannian manifolds. These volumes are given in terms of polynomial functions, th… ▽ More Based on the discovery of the duality between Jackiw-Teitelboim quantum gravity and a double-scaled matrix ensemble by Saad, Shenker and Stanford in 2019, we show how consistency between the two theories in the universal Random Matrix Theory (RMT) limit imposes a set of constraints on the volumes of moduli spaces of Riemannian manifolds. These volumes are given in terms of polynomial functions, the Weil-Petersson volumes, solving a celebrated nonlinear recursion formula that is notoriously difficult to analyze. Since our results imply linear relations between the coefficients of the Weil-Petersson volumes, they therefore provide both a stringent test for their symbolic calculation and a possible way of simplifying their construction. In this way, we propose a long-term program to improve the understanding of mathematically hard aspects concerning moduli spaces of hyperbolic manifolds by using universal RMT results as input. △ Less

Submitted 26 April, 2023; v1 submitted 29 August, 2022; originally announced August 2022.

Comments: 25 pages, matches the published version, additional clarifications and comments added as well as appendices improving the level of self-containedness

Journal ref: 2023 J. Phys. A: Math. Theor. 56 205206

arXiv:2207.10768 [pdf]

Plasmonic Bound States in the Continuum to Tailor Light-Matter Coupling

Authors: Andreas Aigner, Andreas Tittl, Juan Wang, Thomas Weber, Yuri Kivshar, Stefan A. Maier, Haoran Ren

Abstract: Plasmon resonances play a pivotal role in enhancing light-matter interactions in nanophotonics, but their low-quality factors have hindered applications demanding high spectral selectivity. Even though symmetry-protected bound states in the continuum with high-quality factors have been realized in dielectric metasurfaces, impinging light is not efficiently coupled to the resonant metasurfaces and… ▽ More Plasmon resonances play a pivotal role in enhancing light-matter interactions in nanophotonics, but their low-quality factors have hindered applications demanding high spectral selectivity. Even though symmetry-protected bound states in the continuum with high-quality factors have been realized in dielectric metasurfaces, impinging light is not efficiently coupled to the resonant metasurfaces and is lost in the form of reflection due to low intrinsic losses. Here, we demonstrate a novel design and 3D laser nanoprinting of plasmonic nanofin metasurfaces, which support symmetry-protected bound states in the continuum up to 4th order. By breaking the nanofins out-of-plane symmetry in parameter space, we achieve high-quality factor (up to 180) modes under normal incidence. We reveal that the out-of-plane symmetry breaking can be fine-tuned by the triangle angle of the 3D nanofin meta-atoms, opening a pathway to precisely control the ratio of radiative to intrinsic losses. This enables access to the under-, critical-, and over-coupled regimes, which we exploit for pixelated molecular sensing. Depending on the coupling regime we observe negative, no, or positive modulation induced by the analyte, unveiling the undeniable importance of tailoring light-matter interaction. Our demonstration provides a novel metasurface platform for enhanced light-matter interaction with a wide range of applications in optical sensing, energy conversion, nonlinear photonics, surface-enhanced spectroscopy, and quantum optics. △ Less

Submitted 21 July, 2022; originally announced July 2022.

Comments: 33 pages, 4 figures, 9 supplementary figures

arXiv:2206.05314 [pdf, other]

Large-Scale Retrieval for Reinforcement Learning

Authors: Peter C. Humphreys, Arthur Guez, Olivier Tieleman, Laurent Sifre, Théophane Weber, Timothy Lillicrap

Abstract: Effective decision making involves flexibly relating past experiences and relevant contextual information to a novel situation. In deep reinforcement learning (RL), the dominant paradigm is for an agent to amortise information that helps decision making into its network weights via gradient descent on training losses. Here, we pursue an alternative approach in which agents can utilise large-scale… ▽ More Effective decision making involves flexibly relating past experiences and relevant contextual information to a novel situation. In deep reinforcement learning (RL), the dominant paradigm is for an agent to amortise information that helps decision making into its network weights via gradient descent on training losses. Here, we pursue an alternative approach in which agents can utilise large-scale context sensitive database lookups to support their parametric computations. This allows agents to directly learn in an end-to-end manner to utilise relevant information to inform their outputs. In addition, new information can be attended to by the agent, without retraining, by simply augmenting the retrieval dataset. We study this approach for offline RL in 9x9 Go, a challenging game for which the vast combinatorial state space privileges generalisation over direct matching to past experiences. We leverage fast, approximate nearest neighbor techniques in order to retrieve relevant data from a set of tens of millions of expert demonstration states. Attending to this information provides a significant boost to prediction accuracy and game-play performance over simply using these demonstrations as training trajectories, providing a compelling demonstration of the value of large-scale retrieval in offline RL agents. △ Less

Submitted 16 December, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

Comments: Thirty-sixth Annual Conference on Neural Information Processing Systems (NeurIPS 2022), 16 pages

arXiv:2206.04590 [pdf, other]

doi 10.24963/ijcai.2021/81

GASP: Gated Attention For Saliency Prediction

Authors: Fares Abawi, Tom Weber, Stefan Wermter

Abstract: Saliency prediction refers to the computational task of modeling overt attention. Social cues greatly influence our attention, consequently altering our eye movements and behavior. To emphasize the efficacy of such features, we present a neural model for integrating social cues and weighting their influences. Our model consists of two stages. During the first stage, we detect two social cues by fo… ▽ More Saliency prediction refers to the computational task of modeling overt attention. Social cues greatly influence our attention, consequently altering our eye movements and behavior. To emphasize the efficacy of such features, we present a neural model for integrating social cues and weighting their influences. Our model consists of two stages. During the first stage, we detect two social cues by following gaze, estimating gaze direction, and recognizing affect. These features are then transformed into spatiotemporal maps through image processing operations. The transformed representations are propagated to the second stage (GASP) where we explore various techniques of late fusion for integrating social cues and introduce two sub-networks for directing attention to relevant stimuli. Our experiments indicate that fusion approaches achieve better results for static integration methods, whereas non-fusion approaches for which the influence of each modality is unknown, result in better outcomes when coupled with recurrent models for dynamic saliency prediction. We show that gaze direction and affective representations contribute a prediction to ground-truth correspondence improvement of at least 5% compared to dynamic saliency models without social cues. Furthermore, affective representations improve GASP, supporting the necessity of considering affect-biased attention in predicting saliency. △ Less

Submitted 9 June, 2022; originally announced June 2022.

Comments: International Joint Conference on Artificial Intelligence (IJCAI-21)

Journal ref: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (2021) 584-591

arXiv:2205.00216 [pdf, ps, other]

doi 10.22323/1.406.0305

Twisted geometry for submanifolds of $\mathbb{R}^n$

Authors: Gaetano Fiore, Thomas Weber

Abstract: This is a friendly introduction to our recent general procedure for constructing noncommutative deformations of an embedded submanifold $M$ of $\mathbb{R}^n$ determined by a set of smooth equations $f^a(x)=0$. We use the framework of Drinfel'd twist deformation of differential geometry pioneered in [Aschieri et al., Class. Quantum Gravity 23 (2006), 1883]; the commutative pointwise product is repl… ▽ More This is a friendly introduction to our recent general procedure for constructing noncommutative deformations of an embedded submanifold $M$ of $\mathbb{R}^n$ determined by a set of smooth equations $f^a(x)=0$. We use the framework of Drinfel'd twist deformation of differential geometry pioneered in [Aschieri et al., Class. Quantum Gravity 23 (2006), 1883]; the commutative pointwise product is replaced by a (generally noncommutative) $\star$-product induced by a Drinfel'd twist. △ Less

Submitted 30 April, 2022; originally announced May 2022.

Comments: 20 pages. Submitted to PoS, as a contribution to the Proceedings of Corfu Summer Institute 2021 "School and Workshops on Elementary Particle Physics and Gravity", more precisely to the "Workshop on Quantum Geometry, Field Theory and Gravity", 20-27 September 2021

MSC Class: 46L87; 83C65; 14A22; 16Txx

arXiv:2204.04875 [pdf, other]

Learning to Induce Causal Structure

Authors: Nan Rosemary Ke, Silvia Chiappa, Jane Wang, Anirudh Goyal, Jorg Bornschein, Melanie Rey, Theophane Weber, Matthew Botvinic, Michael Mozer, Danilo Jimenez Rezende

Abstract: The fundamental challenge in causal induction is to infer the underlying graph structure given observational and/or interventional data. Most existing causal induction algorithms operate by generating candidate graphs and evaluating them using either score-based methods (including continuous optimization) or independence tests. In our work, we instead treat the inference process as a black box and… ▽ More The fundamental challenge in causal induction is to infer the underlying graph structure given observational and/or interventional data. Most existing causal induction algorithms operate by generating candidate graphs and evaluating them using either score-based methods (including continuous optimization) or independence tests. In our work, we instead treat the inference process as a black box and design a neural network architecture that learns the mapping from both observational and interventional data to graph structures via supervised training on synthetic graphs. The learned model generalizes to new synthetic graphs, is robust to train-test distribution shifts, and achieves state-of-the-art performance on naturalistic graphs for low sample complexity. △ Less

Submitted 7 October, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

arXiv:2204.04501 [pdf, other]

Explain yourself! Effects of Explanations in Human-Robot Interaction

Authors: Jakob Ambsdorf, Alina Munir, Yiyao Wei, Klaas Degkwitz, Harm Matthias Harms, Susanne Stannek, Kyra Ahrens, Dennis Becker, Erik Strahl, Tom Weber, Stefan Wermter

Abstract: Recent developments in explainable artificial intelligence promise the potential to transform human-robot interaction: Explanations of robot decisions could affect user perceptions, justify their reliability, and increase trust. However, the effects on human perceptions of robots that explain their decisions have not been studied thoroughly. To analyze the effect of explainable robots, we conduct… ▽ More Recent developments in explainable artificial intelligence promise the potential to transform human-robot interaction: Explanations of robot decisions could affect user perceptions, justify their reliability, and increase trust. However, the effects on human perceptions of robots that explain their decisions have not been studied thoroughly. To analyze the effect of explainable robots, we conduct a study in which two simulated robots play a competitive board game. While one robot explains its moves, the other robot only announces them. Providing explanations for its actions was not sufficient to change the perceived competence, intelligence, likeability or safety ratings of the robot. However, the results show that the robot that explains its moves is perceived as more lively and human-like. This study demonstrates the need for and potential of explainable human-robot interaction and the wider assessment of its effects as a novel research direction. △ Less

Submitted 14 June, 2022; v1 submitted 9 April, 2022; originally announced April 2022.

Comments: Accepted at 2022 31st IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)

arXiv:2203.08159 [pdf, other]

doi 10.1126/science.abe4441

Topological magnon band structure of emergent Landau levels in a skyrmion lattice

Authors: T. Weber, D. M. Fobes, J. Waizner, P. Steffens, G. S. Tucker, M. Böhm, L. Beddrich, C. Franz, H. Gabold, R. Bewley, D. Voneshen, M. Skoulatos, R. Georgii, G. Ehlers, A. Bauer, C. Pfleiderer, P. Böni, M. Janoschek, M. Garst

Abstract: The motion of a spin excitation across topologically non-trivial magnetic order exhibits a deflection that is analogous to the effect of the Lorentz force on an electrically charged particle in an orbital magnetic field. We used polarized inelastic neutron scattering to investigate the propagation of magnons (i.e., bosonic collective spin excitations) in a lattice of skyrmion tubes in manganese si… ▽ More The motion of a spin excitation across topologically non-trivial magnetic order exhibits a deflection that is analogous to the effect of the Lorentz force on an electrically charged particle in an orbital magnetic field. We used polarized inelastic neutron scattering to investigate the propagation of magnons (i.e., bosonic collective spin excitations) in a lattice of skyrmion tubes in manganese silicide. For wave vectors perpendicular to the skyrmion tubes, the magnon spectra are consistent with the formation of finely spaced emergent Landau levels that are characteristic of the fictitious magnetic field used to account for the nontrivial topological winding of the skyrmion lattice. This provides evidence of a topological magnon band structure in reciprocal space, which is borne out of the nontrivial real-space topology of a magnetic order. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Journal ref: Science 375, 1025 (2022)

arXiv:2202.10961 [pdf, other]

doi 10.1088/1748-0221/17/10/P10045

Additive manufacturing of fine-granularity optically-isolated plastic scintillator elements

Authors: S. Berns, E. Boillat, A. Boyarintsev, A. De Roeck, S. Dolan, A. Gendotti, B. Grynyov, S. Hugon, U. Kose, S. Kovalchuk, B. Li, A. Rubbia, T. Sibilieva, D. Sgalaberna, T. Weber, J. Wuthrich, X. Y. Zhao

Abstract: Plastic scintillator detectors are used in high energy physics as well as for diagnostic imaging in medicine, beam monitoring on hadron therapy, muon tomography, dosimetry and many security applications. To combine particle tracking and calorimetry it is necessary to build detectors with three-dimensional granularity, i.e. small voxels of scintillator optically isolated from each other. Recently,… ▽ More Plastic scintillator detectors are used in high energy physics as well as for diagnostic imaging in medicine, beam monitoring on hadron therapy, muon tomography, dosimetry and many security applications. To combine particle tracking and calorimetry it is necessary to build detectors with three-dimensional granularity, i.e. small voxels of scintillator optically isolated from each other. Recently, the 3DET collaboration demonstrated the possibility to 3D print polystyrene-based scintillators with a light output performance close to that obtained with standard production methods. In this article, after providing a further characterization of the developed scintillators, we show the first matrix of plastic scintillator cubes optically separated by a white reflector material entirely 3D printed with fused deposition modeling. This is a major milestone towards the 3D printing of the first real particle detector. A discussion of the results as well as the next steps in the R&D is also provided. △ Less

Submitted 16 October, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

Comments: Accepted for publication in JINST

arXiv:2202.08417 [pdf, other]

Retrieval-Augmented Reinforcement Learning

Authors: Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adria Puigdomenech Badia, Arthur Guez, Mehdi Mirza, Peter C. Humphreys, Ksenia Konyushkova, Laurent Sifre, Michal Valko, Simon Osindero, Timothy Lillicrap, Nicolas Heess, Charles Blundell

Abstract: Most deep reinforcement learning (RL) algorithms distill experience into parametric behavior policies or value functions via gradient updates. While effective, this approach has several disadvantages: (1) it is computationally expensive, (2) it can take many updates to integrate experiences into the parametric model, (3) experiences that are not fully integrated do not appropriately influence the… ▽ More Most deep reinforcement learning (RL) algorithms distill experience into parametric behavior policies or value functions via gradient updates. While effective, this approach has several disadvantages: (1) it is computationally expensive, (2) it can take many updates to integrate experiences into the parametric model, (3) experiences that are not fully integrated do not appropriately influence the agent's behavior, and (4) behavior is limited by the capacity of the model. In this paper we explore an alternative paradigm in which we train a network to map a dataset of past experiences to optimal behavior. Specifically, we augment an RL agent with a retrieval process (parameterized as a neural network) that has direct access to a dataset of experiences. This dataset can come from the agent's past experiences, expert demonstrations, or any other relevant source. The retrieval process is trained to retrieve information from the dataset that may be useful in the current context, to help the agent achieve its goal faster and more efficiently. he proposed method facilitates learning agents that at test-time can condition their behavior on the entire dataset and not only the current state, or current trajectory. We integrate our method into two different RL agents: an offline DQN agent and an online R2D2 agent. In offline multi-task problems, we show that the retrieval-augmented DQN agent avoids task interference and learns faster than the baseline DQN agent. On Atari, we show that retrieval-augmented R2D2 learns significantly faster than the baseline R2D2 agent and achieves higher scores. We run extensive ablations to measure the contributions of the components of our proposed method. △ Less

Submitted 24 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

arXiv:2112.04780 [pdf, other]

Control of transport phenomena in magnetic heterostructures by wavelength modulation

Authors: Christopher Seibel, Marius Weber, Martin Stiehl, Sebastian T. Weber, Martin Aeschlimann, Hans Christian Schneider, Benjamin Stadtmüller, Baerbel Rethfeld

Abstract: We demonstrate the tuneablity of the ultrafast energy flow in magnetic/non-magnetic bilayer structures by changing the wavelength of the optical excitation. This is achieved by an advanced description of the temperature based $μ$T-model that explicitly considers the wavelength- and layer-dependent absorption profile within multilayer structures. For the exemplary case of a Ni/Au bilayer, our simul… ▽ More We demonstrate the tuneablity of the ultrafast energy flow in magnetic/non-magnetic bilayer structures by changing the wavelength of the optical excitation. This is achieved by an advanced description of the temperature based $μ$T-model that explicitly considers the wavelength- and layer-dependent absorption profile within multilayer structures. For the exemplary case of a Ni/Au bilayer, our simulations predict that the energy flow from Ni to Au is reversed when changing the wavelength of the excitation from the infrared to the ultraviolet spectral range. These predictions are fully supported by characteristic signatures in the magneto-optical Kerr traces of the Ni/Au model system. Our results will open up new avenues to steer and control the energy transport in designed magnetic multilayer for ultrafast spintronic applications. △ Less

Submitted 9 December, 2021; originally announced December 2021.

Comments: 6 pages (+3 pages supplemental), 5 figures (+1 figure in supplemental)

arXiv:2112.01851 [pdf, other]

doi 10.3390/nano12101655

Implementation of the electronic non-equilibrium in the two-temperature model

Authors: Markus Uehlein, Sebastian T. Weber, Baerbel Rethfeld

Abstract: We investigate a temperature-based model, called extended two-temperature model (eTTM), that describes the electronic non-equilibrium and its effect on energy dissipation in metals after ultrashort laser excitation. We derive and discuss improvements in comparison to published versions of this model [E. Carpene, Phys. Rev. B 2006, 74, 024301; G. Tsibidis, Appl. Phys. A 2018, 124, 311]. The compari… ▽ More We investigate a temperature-based model, called extended two-temperature model (eTTM), that describes the electronic non-equilibrium and its effect on energy dissipation in metals after ultrashort laser excitation. We derive and discuss improvements in comparison to published versions of this model [E. Carpene, Phys. Rev. B 2006, 74, 024301; G. Tsibidis, Appl. Phys. A 2018, 124, 311]. The comparison of the results of the eTTM with results of the well-known two-temperature model (TTM) shows a delayed increase of the electronic temperature when being calculated with the eTTM. We find a good agreement in the non-equilibrium energy distribution after absorption of photons with results from a kinetic description using a Boltzmann collision term. The model provides a convenient tool for fast calculation of features of the non-equilibrium electrons. As an example we inspect the dynamics of high-energy electrons observable in photo-electron spectroscopy and demonstrate the advantage of the eTTM over the conventional two-temperature model. △ Less

Submitted 3 December, 2021; originally announced December 2021.

Comments: 14 pages, 8 figures (including 3 double figures)

arXiv:2111.14908 [pdf]

doi 10.1088/2515-7647/ac76f9

Roadmap on Wavefront Shaping and deep imaging in complex media

Authors: Sylvain Gigan, Ori Katz, Hilton B. de Aguiar, Esben Ravn Andresen, Alexandre Aubry, Jacopo Bertolotti, Emmanuel Bossy, Dorian Bouchet, Joshua Brake, Sophie Brasselet, Yaron Bromberg, Hui Cao, Thomas Chaigne, Zhongtao Cheng, Wonshik Choi, Tomáš Čižmár, Meng Cui, Vincent R Curtis, Hugo Defienne, Matthias Hofer, Ryoichi Horisaki, Roarke Horstmeyer, Na Ji, Aaron K. LaViolette, Jerome Mertz , et al. (20 additional authors not shown)

Abstract: The last decade has seen the development of a wide set of tools, such as wavefront shaping, computational or fundamental methods, that allow to understand and control light propagation in a complex medium, such as biological tissues or multimode fibers. A vibrant and diverse community is now working on this field, that has revolutionized the prospect of diffraction-limited imaging at depth in tiss… ▽ More The last decade has seen the development of a wide set of tools, such as wavefront shaping, computational or fundamental methods, that allow to understand and control light propagation in a complex medium, such as biological tissues or multimode fibers. A vibrant and diverse community is now working on this field, that has revolutionized the prospect of diffraction-limited imaging at depth in tissues. This roadmap highlights several key aspects of this fast developing field, and some of the challenges and opportunities ahead. △ Less

Submitted 29 November, 2021; originally announced November 2021.

Comments: submitted to J.Phys Photonics (IOP), 116 pages, 23 sections

arXiv:2111.05149 [pdf, other]

Ethically aligned Deep Learning: Unbiased Facial Aesthetic Prediction

Authors: Michael Danner, Thomas Weber, Leping Peng, Tobias Gerlach, Xueping Su, Matthias Rätsch

Abstract: Facial beauty prediction (FBP) aims to develop a machine that automatically makes facial attractiveness assessment. In the past those results were highly correlated with human ratings, therefore also with their bias in annotating. As artificial intelligence can have racist and discriminatory tendencies, the cause of skews in the data must be identified. Development of training data and AI algorith… ▽ More Facial beauty prediction (FBP) aims to develop a machine that automatically makes facial attractiveness assessment. In the past those results were highly correlated with human ratings, therefore also with their bias in annotating. As artificial intelligence can have racist and discriminatory tendencies, the cause of skews in the data must be identified. Development of training data and AI algorithms that are robust against biased information is a new challenge for scientists. As aesthetic judgement usually is biased, we want to take it one step further and propose an Unbiased Convolutional Neural Network for FBP. While it is possible to create network models that can rate attractiveness of faces on a high level, from an ethical point of view, it is equally important to make sure the model is unbiased. In this work, we introduce AestheticNet, a state-of-the-art attractiveness prediction network, which significantly outperforms competitors with a Pearson Correlation of 0.9601. Additionally, we propose a new approach for generating a bias-free CNN to improve fairness in machine learning. △ Less

Submitted 9 November, 2021; originally announced November 2021.

Comments: Peer reviewed and accepted at CEPE/IACAP 2021 as Extended Abstract

arXiv:2111.05026 [pdf, other]

Investigating the variance increase of readout error mitigation through classical bit-flip correction on IBM and Rigetti quantum computers

Authors: Constantia Alexandrou, Lena Funcke, Tobias Hartung, Karl Jansen, Stefan Kühn, Georgios Polykratis, Paolo Stornati, Xiaoyang Wang, Tom Weber

Abstract: Readout errors are among the most dominant errors on current noisy intermediate-scale quantum devices. Recently, an efficient and scaleable method for mitigating such errors has been developed, based on classical bit-flip correction. In this talk, we compare the performance of this method for IBM's and Rigetti's quantum devices, demonstrating how the method improves the noisy measurements of obser… ▽ More Readout errors are among the most dominant errors on current noisy intermediate-scale quantum devices. Recently, an efficient and scaleable method for mitigating such errors has been developed, based on classical bit-flip correction. In this talk, we compare the performance of this method for IBM's and Rigetti's quantum devices, demonstrating how the method improves the noisy measurements of observables obtained on the quantum hardware. Moreover, we examine the variance amplification to the data after applying of our mitigation procedure, which is common to all mitigation strategies. We derive a new expression for the variance of the mitigated Pauli operators in terms of the corrected expectation values and the noisy variances.Our hardware results show good agreement with the theoretical prediction, and we demonstrate that the increase of the variance due to the mitigation procedure is only moderate. △ Less

Submitted 29 November, 2021; v1 submitted 9 November, 2021; originally announced November 2021.

Comments: 11 pages, 5 figures, Proceedings of the 38th International Symposium on Lattice Field Theory, 26th-30th July 2021, Zoom/Gather@Massachusetts Institute of Technology

Report number: MIT-CTP/5351

arXiv:2111.01587 [pdf, other]

Procedural Generalization by Planning with Self-Supervised World Models

Authors: Ankesh Anand, Jacob Walker, Yazhe Li, Eszter Vértes, Julian Schrittwieser, Sherjil Ozair, Théophane Weber, Jessica B. Hamrick

Abstract: One of the key promises of model-based reinforcement learning is the ability to generalize using an internal model of the world to make predictions in novel environments and tasks. However, the generalization ability of model-based agents is not well understood because existing work has focused on model-free agents when benchmarking generalization. Here, we explicitly measure the generalization ab… ▽ More One of the key promises of model-based reinforcement learning is the ability to generalize using an internal model of the world to make predictions in novel environments and tasks. However, the generalization ability of model-based agents is not well understood because existing work has focused on model-free agents when benchmarking generalization. Here, we explicitly measure the generalization ability of model-based agents in comparison to their model-free counterparts. We focus our analysis on MuZero (Schrittwieser et al., 2020), a powerful model-based agent, and evaluate its performance on both procedural and task generalization. We identify three factors of procedural generalization -- planning, self-supervised representation learning, and procedural data diversity -- and show that by combining these techniques, we achieve state-of-the art generalization performance and data efficiency on Procgen (Cobbe et al., 2019). However, we find that these factors do not always provide the same benefits for the task generalization benchmarks in Meta-World (Yu et al., 2019), indicating that transfer remains a challenge and may require different approaches than procedural generalization. Overall, we suggest that building generalizable agents requires moving beyond the single-task, model-free paradigm and towards self-supervised model-based agents that are trained in rich, procedural, multi-task environments. △ Less

Submitted 2 November, 2021; originally announced November 2021.

arXiv:2110.11312 [pdf, other]

Towards modelling hazard factors in unstructured data spaces using gradient-based latent interpolation

Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

Abstract: The application of deep learning in survival analysis (SA) allows utilizing unstructured and high-dimensional data types uncommon in traditional survival methods. This allows to advance methods in fields such as digital health, predictive maintenance, and churn analysis, but often yields less interpretable and intuitively understandable models due to the black-box character of deep learning-based… ▽ More The application of deep learning in survival analysis (SA) allows utilizing unstructured and high-dimensional data types uncommon in traditional survival methods. This allows to advance methods in fields such as digital health, predictive maintenance, and churn analysis, but often yields less interpretable and intuitively understandable models due to the black-box character of deep learning-based approaches. We close this gap by proposing 1) a multi-task variational autoencoder (VAE) with survival objective, yielding survival-oriented embeddings, and 2) a novel method HazardWalk that allows to model hazard factors in the original data space. HazardWalk transforms the latent distribution of our autoencoder into areas of maximized/minimized hazard and then uses the decoder to project changes to the original domain. Our procedure is evaluated on a simulated dataset as well as on a dataset of CT imaging data of patients with liver metastases. △ Less

Submitted 17 November, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

Comments: NeurIPS 2021 Workshop, Deep Generative Models and Downstream Applications

arXiv:2110.11303 [pdf, other]

Survival-oriented embeddings for improving accessibility to complex data structures

Authors: Tobias Weber, Michael Ingrisch, Matthias Fabritius, Bernd Bischl, David Rügamer

Abstract: Deep learning excels in the analysis of unstructured data and recent advancements allow to extend these techniques to survival analysis. In the context of clinical radiology, this enables, e.g., to relate unstructured volumetric images to a risk score or a prognosis of life expectancy and support clinical decision making. Medical applications are, however, associated with high criticality and cons… ▽ More Deep learning excels in the analysis of unstructured data and recent advancements allow to extend these techniques to survival analysis. In the context of clinical radiology, this enables, e.g., to relate unstructured volumetric images to a risk score or a prognosis of life expectancy and support clinical decision making. Medical applications are, however, associated with high criticality and consequently, neither medical personnel nor patients do usually accept black box models as reason or basis for decisions. Apart from averseness to new technologies, this is due to missing interpretability, transparency and accountability of many machine learning methods. We propose a hazard-regularized variational autoencoder that supports straightforward interpretation of deep neural architectures in the context of survival analysis, a field highly relevant in healthcare. We apply the proposed approach to abdominal CT scans of patients with liver tumors and their corresponding survival times. △ Less

Submitted 3 November, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

Comments: NeurIPS 2021 Workshop, Bridging the Gap: From Machine Learning Research to Clinical Practice

arXiv:2110.04510 [pdf, ps, other]

Measurement of the $e^{+}e^{-}\toΣ^{0}\barΣ^{0}$ cross sections at center-of-mass energies from $2.3864$ to $3.0200$ GeV

Authors: M. Ablikim, M. N. Achasov, P. Adlarson, S. Ahmed, M. Albrecht, A. Amoroso, Q. An, Anita, Y. Bai, O. Bakina, R. Baldini Ferroli, I. Balossino, Y. Ban, K. Begzsuren, J. V. Bennett, N. Berger, M. Bertani, D. Bettoni, F. Bianchi, J Biernat, J. Bloms, A. Bortone, I. Boyko, R. A. Briere, H. Cai , et al. (467 additional authors not shown)

Abstract: The Born cross sections of $e^{+}e^{-}\to Σ^{0}\barΣ^{0}$ are measured at center-of-mass energies from $2.3864$ to $3.0200$ GeV using data samples with an integrated luminosity of $328.5$ pb$^{-1}$ collected with the BESIII detector operating at the BEPCII collider. The analysis makes use of a novel reconstruction method for energies near production threshold, while a single-tag method is employed… ▽ More The Born cross sections of $e^{+}e^{-}\to Σ^{0}\barΣ^{0}$ are measured at center-of-mass energies from $2.3864$ to $3.0200$ GeV using data samples with an integrated luminosity of $328.5$ pb$^{-1}$ collected with the BESIII detector operating at the BEPCII collider. The analysis makes use of a novel reconstruction method for energies near production threshold, while a single-tag method is employed at other center-of-mass energies. The measured cross sections are consistent with earlier results from BaBar, with a substantially improved precision. The cross-section lineshape can be well described by a perturbative QCD-driven energy function. In addition, the effective form factors of the $Σ^{0}$ baryon are determined. The results provide precise experimental input for testing various theoretical predictions. △ Less

Submitted 12 October, 2021; v1 submitted 9 October, 2021; originally announced October 2021.

Comments: 12 pages, 10 figures, Journal (Phys. Lett. B)

arXiv:2110.03481 [pdf, ps, other]

Differential Calculi on Quantum Principal Bundles over Projective Bases

Authors: P. Aschieri, R. Fioresi, E. Latini, T. Weber

Abstract: We propose a sheaf-theoretic approach to the theory of differential calculi on quantum principal bundles over non-affine bases. After recalling the affine case we define differential calculi on sheaves of comodule algebras as sheaves of covariant bimodules together with a morphism of sheaves -- the differential -- such that the Leibniz rule and surjectivity hold locally. The main class of examples… ▽ More We propose a sheaf-theoretic approach to the theory of differential calculi on quantum principal bundles over non-affine bases. After recalling the affine case we define differential calculi on sheaves of comodule algebras as sheaves of covariant bimodules together with a morphism of sheaves -- the differential -- such that the Leibniz rule and surjectivity hold locally. The main class of examples is given by covariant calculi over quantum flag manifolds, which we provide via an explicit Ore extension construction. In a second step we introduce principal covariant calculi by requiring a local compatibility of the calculi on the total sheaf, base sheaf and the structure Hopf algebra in terms of exact sequences. In this case Hopf--Galois extensions of algebras lift to Hopf--Galois extensions of exterior algebras with compatible differentials. In particular, the examples of principal (covariant) calculi on the quantum principal bundles $\mathcal{O}_q(\mathrm{SL}_2(\mathbb{C}))$ and $\mathcal{O}_q(\mathrm{GL}_2(\mathbb{C}))$ over the projective space $\mathrm{P}^1(\mathbb{C})$ are discussed in detail. △ Less

Submitted 6 February, 2023; v1 submitted 7 October, 2021; originally announced October 2021.

Comments: Fully revisited version. Proven general theorem characterizing base forms as intersection of coinvariant and horizontal forms (in the affine and projective setting). Added section on exact sequence of calculi characterizing graded Hopf--Galois extensions with compatible differential. New and improved examples

arXiv:2106.09014 [pdf, other]

doi 10.1103/PhysRevResearch.3.033191

Non-Equilibrium Dynamics in Two-Color, Few-Photon Dissociative Excitation and Ionization of D$_2$

Authors: D. S. Slaughter, F. P. Sturm, R. Y. Bello, K. A. Larsen, N. Shivaram, C. W. McCurdy, R. R. Lucchese, L. Martin, C. W. Hogle, M. M. Murnane, H. C. Kapteyn, P. Ranitovic, Th. Weber

Abstract: D$_2$ molecules, excited by linearly cross-polarized femtosecond extreme ultraviolet (XUV) and near-infrared (NIR) light pulses, reveal highly structured D$^+$ ion fragment momenta and angular distributions that originate from two different 4-step dissociative ionization pathways after four photon absorption (1 XUV + 3 NIR). We show that, even for very low dissociation kinetic energy release… ▽ More D$_2$ molecules, excited by linearly cross-polarized femtosecond extreme ultraviolet (XUV) and near-infrared (NIR) light pulses, reveal highly structured D$^+$ ion fragment momenta and angular distributions that originate from two different 4-step dissociative ionization pathways after four photon absorption (1 XUV + 3 NIR). We show that, even for very low dissociation kinetic energy release $\le$~240~meV, specific electronic excitation pathways can be identified and isolated in the final ion momentum distributions. With the aid of {\it ab initio} electronic structure and time-dependent Schrödinger equation calculations, angular momentum, energy, and parity conservation are used to identify the excited neutral molecular states and molecular orientations relative to the polarization vectors in these different photoexcitation and dissociation sequences of the neutral D$_2$ molecule and its D$_2^+$ cation. In one sequential photodissociation pathway, molecules aligned along either of the two light polarization vectors are excluded, while another pathway selects molecules aligned parallel to the light propagation direction. The evolution of the nuclear wave packet on the intermediate \Bstate electronic state of the neutral D$_2$ molecule is also probed in real time. △ Less

Submitted 16 June, 2021; originally announced June 2021.

Comments: 11 pages including 6 figures

Journal ref: Phys. Rev. Research 3, 033191 (2021)

arXiv:2105.03354 [pdf]

The future of human-AI collaboration: a taxonomy of design knowledge for hybrid intelligence systems

Authors: Dominik Dellermann, Adrian Calma, Nikolaus Lipusch, Thorsten Weber, Sascha Weigel, Philipp Ebel

Abstract: Recent technological advances, especially in the field of machine learning, provide astonishing progress on the road towards artificial general intelligence. However, tasks in current real-world business applications cannot yet be solved by machines alone. We, therefore, identify the need for developing socio-technological ensembles of humans and machines. Such systems possess the ability to accom… ▽ More Recent technological advances, especially in the field of machine learning, provide astonishing progress on the road towards artificial general intelligence. However, tasks in current real-world business applications cannot yet be solved by machines alone. We, therefore, identify the need for developing socio-technological ensembles of humans and machines. Such systems possess the ability to accomplish complex goals by combining human and artificial intelligence to collectively achieve superior results and continuously improve by learning from each other. Thus, the need for structured design knowledge for those systems arises. Following a taxonomy development method, this article provides three main contributions: First, we present a structured overview of interdisciplinary research on the role of humans in the machine learning pipeline. Second, we envision hybrid intelligence systems and conceptualize the relevant dimensions for system design for the first time. Finally, we offer useful guidance for system developers during the implementation of such applications. △ Less

Submitted 7 May, 2021; originally announced May 2021.

arXiv:2104.07320 [pdf, other]

doi 10.1109/ICSA-C52384.2021.00026

Modelling for Quantum Error Mitigation

Authors: Tom Weber, Matthias Riebisch, Kerstin Borras, Karl Jansen, Dirk Krücker

Abstract: While we expect quantum computers to surpass their classical counterparts in the future, current devices are prone to high error rates and techniques to minimise the impact of these errors are indispensable. There already exists a variety of error mitigation methods addressing this quantum noise that differ in effectiveness, and scalability. But for a more systematic and comprehensible approach we… ▽ More While we expect quantum computers to surpass their classical counterparts in the future, current devices are prone to high error rates and techniques to minimise the impact of these errors are indispensable. There already exists a variety of error mitigation methods addressing this quantum noise that differ in effectiveness, and scalability. But for a more systematic and comprehensible approach we propose the introduction of modelling, in particular for representing cause-effect relations as well as for evaluating methods or combinations thereof with respect to a selection of relevant criteria. △ Less

Submitted 15 April, 2021; originally announced April 2021.

arXiv:2104.06159 [pdf, other]

Muesli: Combining Improvements in Policy Optimization

Authors: Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver, Hado van Hasselt

Abstract: We propose a novel policy update that combines regularized policy optimization with model learning as an auxiliary loss. The update (henceforth Muesli) matches MuZero's state-of-the-art performance on Atari. Notably, Muesli does so without using deep search: it acts directly with a policy network and has computation speed comparable to model-free baselines. The Atari results are complemented by ex… ▽ More We propose a novel policy update that combines regularized policy optimization with model learning as an auxiliary loss. The update (henceforth Muesli) matches MuZero's state-of-the-art performance on Atari. Notably, Muesli does so without using deep search: it acts directly with a policy network and has computation speed comparable to model-free baselines. The Atari results are complemented by extensive ablations, and by additional results on continuous control and 9x9 Go. △ Less

Submitted 31 March, 2022; v1 submitted 13 April, 2021; originally announced April 2021.

arXiv:2103.15098 [pdf, other]

doi 10.1007/JHEP06(2021)181

Amplitude analysis and branching-fraction measurement of \boldmath $D_{s}^{+} \to K^0_{S}π^{+}π^{0}$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, S. Ahmed, M. Albrecht, A. Amoroso, Q. An, X. H. Bai, Y. Bai, O. Bakina, R. Baldini Ferroli, I. Balossino, Y. Ban, K. Begzsuren, J. V. Bennett, N. Berger, M. Bertani, D. Bettoni, F. Bianchi, J Biernat, J. Bloms, A. Bortone, I. Boyko, R. A. Briere , et al. (468 additional authors not shown)

Abstract: Utilizing a data set corresponding to an integrated luminosity of 6.32~$\rm fb^{-1}$, recorded by the BESIII detector at center-of-mass energies between 4.178 and 4.226~GeV, we perform an amplitude analysis of the decay $D_{s}^{+} \to K_{S}^{0}π^{+}π^{0}$ and determine the relative fractions and phase differences of different intermediate processes, which include $K_{S}^{0}ρ(770)^{+}$,… ▽ More Utilizing a data set corresponding to an integrated luminosity of 6.32~$\rm fb^{-1}$, recorded by the BESIII detector at center-of-mass energies between 4.178 and 4.226~GeV, we perform an amplitude analysis of the decay $D_{s}^{+} \to K_{S}^{0}π^{+}π^{0}$ and determine the relative fractions and phase differences of different intermediate processes, which include $K_{S}^{0}ρ(770)^{+}$, $K_{S}^{0}ρ(1450)^{+}$, $K^{*}(892)^{0}π^{+}$, $K^{*}(892)^{+}π^{0}$, and $K^{*}(1410)^{0}π^{+}$. Using a double-tag technique, and making an efficiency correction that relies on our knowledge of the phase-space distribution of the decays coming from the amplitude analysis, the absolute branching fraction is measured to be $\mathcal{B}(D_{s}^{+} \to K_{S}^{0}π^{+}π^{0})=(5.43\pm0.30_{\text{stat}}\pm 0.15_{\text{syst}})\times 10^{-3}$. △ Less

Submitted 28 March, 2021; originally announced March 2021.

arXiv:2103.10384 [pdf, other]

doi 10.1029/2020JA028130

The Influence of Magnetic Field Topology and Orientation on the Distribution of Thermal Electrons in the Martian Magnetotail

Authors: Murti Nauth, Christopher M. Fowler, Laila Andersson, Gina A. DiBraccio, Shaosui Xu, Tristan Weber, David Mitchell

Abstract: Thermal (<1 eV) electron density measurements, derived from the Mars Atmosphere and Volatile Evolution's (MAVEN) Langmuir Probe and Waves (LPW) instrument, are analyzed to produce the first statistical study of the thermal electron population in the Martian magnetotail. Coincident measurements of the local magnetic field are used to demonstrate that close to Mars, the thermal electron population i… ▽ More Thermal (<1 eV) electron density measurements, derived from the Mars Atmosphere and Volatile Evolution's (MAVEN) Langmuir Probe and Waves (LPW) instrument, are analyzed to produce the first statistical study of the thermal electron population in the Martian magnetotail. Coincident measurements of the local magnetic field are used to demonstrate that close to Mars, the thermal electron population is most likely to be observed at a cylindrical distance of ~1.1 Mars radii (RM) from the central tail region during times when the magnetic field flares inward toward the central tail, compared to ~1.3 RM during times when the magnetic field flares outward away from the central tail. Similar patterns are observed further down the magnetotail with greater variability. Thermal electron densities are highly variable throughout the magnetotail; average densities are typically ~20-50 /cc within the optical shadow of Mars and can peak at ~100 /cc just outside of the optical shadow. Standard deviations of 100% are observed for average densities measured throughout the tail. Analysis of the local magnetic field topology suggests that thermal electrons observed within the optical shadow of Mars are likely sourced from the nightside ionosphere, whereas electrons observed just outside of the optical shadow are likely sourced from the dayside ionosphere. Finally, thermal electrons within the optical shadow of Mars are up to 20% more likely to be observed when the strongest crustal magnetic fields point sunward than when they point tailward. △ Less

Submitted 18 March, 2021; originally announced March 2021.

Comments: 10 pages, 7 figures

Journal ref: Journal of Geophysical Research: Space Physics, 126, e2020JA028130 (2021)

arXiv:2103.07949 [pdf, other]

doi 10.1063/5.0048071

Ultrasound differential phase contrast using backscattering and the memory effect

Authors: Timothy D. Weber, Nikunj Khetan, Ruohui Yang, Jerome Mertz

Abstract: We describe a simple and fast technique to perform ultrasound differential phase contrast (DPC) imaging in arbitrarily thick scattering media. Though configured in a reflection geometry, DPC is based on transmission imaging and is a direct analogue of optical differential interference contrast (DIC). DPC exploits the memory effect and works in combination with standard pulse-echo imaging, with no… ▽ More We describe a simple and fast technique to perform ultrasound differential phase contrast (DPC) imaging in arbitrarily thick scattering media. Though configured in a reflection geometry, DPC is based on transmission imaging and is a direct analogue of optical differential interference contrast (DIC). DPC exploits the memory effect and works in combination with standard pulse-echo imaging, with no additional hardware or data requirements, enabling complementary phase contrast (in the transverse direction) without any need for intensive numerical computation. We experimentally demonstrate the principle of DPC using tissue phantoms with calibrated speed-of-sound inclusions. △ Less

Submitted 14 March, 2021; originally announced March 2021.

Comments: 5 pages, 5 figures. Accepted for publication in Applied Physics Letters

arXiv:2102.12425 [pdf, other]

Synthetic Returns for Long-Term Credit Assignment

Authors: David Raposo, Sam Ritter, Adam Santoro, Greg Wayne, Theophane Weber, Matt Botvinick, Hado van Hasselt, Francis Song

Abstract: Since the earliest days of reinforcement learning, the workhorse method for assigning credit to actions over time has been temporal-difference (TD) learning, which propagates credit backward timestep-by-timestep. This approach suffers when delays between actions and rewards are long and when intervening unrelated events contribute variance to long-term returns. We propose state-associative (SA) le… ▽ More Since the earliest days of reinforcement learning, the workhorse method for assigning credit to actions over time has been temporal-difference (TD) learning, which propagates credit backward timestep-by-timestep. This approach suffers when delays between actions and rewards are long and when intervening unrelated events contribute variance to long-term returns. We propose state-associative (SA) learning, where the agent learns associations between states and arbitrarily distant future rewards, then propagates credit directly between the two. In this work, we use SA-learning to model the contribution of past states to the current reward. With this model we can predict each state's contribution to the far future, a quantity we call "synthetic returns". TD-learning can then be applied to select actions that maximize these synthetic returns (SRs). We demonstrate the effectiveness of augmenting agents with SRs across a range of tasks on which TD-learning alone fails. We show that the learned SRs are interpretable: they spike for states that occur after critical actions are taken. Finally, we show that our IMPALA-based SR agent solves Atari Skiing -- a game with a lengthy reward delay that posed a major hurdle to deep-RL agents -- 25 times faster than the published state-of-the-art. △ Less

Submitted 24 February, 2021; originally announced February 2021.

arXiv:2102.09312 [pdf, other]

doi 10.1016/j.jvcir.2020.102823

Hierarchical Learning Using Deep Optimum-Path Forest

Authors: Luis C. S. Afonso, Clayton R. Pereira, Silke A. T. Weber, Christian Hook, Alexandre X. Falcão, João P. Papa

Abstract: Bag-of-Visual Words (BoVW) and deep learning techniques have been widely used in several domains, which include computer-assisted medical diagnoses. In this work, we are interested in developing tools for the automatic identification of Parkinson's disease using machine learning and the concept of BoVW. The proposed approach concerns a hierarchical-based learning technique to design visual diction… ▽ More Bag-of-Visual Words (BoVW) and deep learning techniques have been widely used in several domains, which include computer-assisted medical diagnoses. In this work, we are interested in developing tools for the automatic identification of Parkinson's disease using machine learning and the concept of BoVW. The proposed approach concerns a hierarchical-based learning technique to design visual dictionaries through the Deep Optimum-Path Forest classifier. The proposed method was evaluated in six datasets derived from data collected from individuals when performing handwriting exams. Experimental results showed the potential of the technique, with robust achievements. △ Less

Submitted 18 February, 2021; originally announced February 2021.

arXiv:2102.02274 [pdf, other]

Neural Recursive Belief States in Multi-Agent Reinforcement Learning

Authors: Pol Moreno, Edward Hughes, Kevin R. McKee, Bernardo Avila Pires, Théophane Weber

Abstract: In multi-agent reinforcement learning, the problem of learning to act is particularly difficult because the policies of co-players may be heavily conditioned on information only observed by them. On the other hand, humans readily form beliefs about the knowledge possessed by their peers and leverage beliefs to inform decision-making. Such abilities underlie individual success in a wide range of Ma… ▽ More In multi-agent reinforcement learning, the problem of learning to act is particularly difficult because the policies of co-players may be heavily conditioned on information only observed by them. On the other hand, humans readily form beliefs about the knowledge possessed by their peers and leverage beliefs to inform decision-making. Such abilities underlie individual success in a wide range of Markov games, from bluffing in Poker to conditional cooperation in the Prisoner's Dilemma, to convention-building in Bridge. Classical methods are usually not applicable to complex domains due to the intractable nature of hierarchical beliefs (i.e. beliefs of other agents' beliefs). We propose a scalable method to approximate these belief structures using recursive deep generative models, and to use the belief models to obtain representations useful to acting in complex tasks. Our agents trained with belief models outperform model-free baselines with equivalent representational capacity using common training paradigms. We also show that higher-order belief models outperform agents with lower-order models. △ Less

Submitted 3 February, 2021; originally announced February 2021.

arXiv:2101.05093 [pdf]

doi 10.1177/00333549211026817

Protecting Privacy and Transforming COVID-19 Case Surveillance Datasets for Public Use

Authors: Brian Lee, Brandi Dupervil, Nicholas P. Deputy, Wil Duck, Stephen Soroka, Lyndsay Bottichio, Benjamin Silk, Jason Price, Patricia Sweeney, Jennifer Fuld, Todd Weber, Dan Pollock

Abstract: Objectives: Federal open data initiatives that promote increased sharing of federally collected data are important for transparency, data quality, trust, and relationships with the public and state, tribal, local, and territorial (STLT) partners. These initiatives advance understanding of health conditions and diseases by providing data to more researchers, scientists, and policymakers for analysi… ▽ More Objectives: Federal open data initiatives that promote increased sharing of federally collected data are important for transparency, data quality, trust, and relationships with the public and state, tribal, local, and territorial (STLT) partners. These initiatives advance understanding of health conditions and diseases by providing data to more researchers, scientists, and policymakers for analysis, collaboration, and valuable use outside CDC responders. This is particularly true for emerging conditions such as COVID-19 where we have much to learn and have evolving data needs. Since the beginning of the outbreak, CDC has collected person-level, de-identified data from jurisdictions and currently has over 8 million records, increasing each day. This paper describes how CDC designed and produces two de-identified public datasets from these collected data. Materials and Methods: Data elements were included based on the usefulness, public request, and privacy implications; specific field values were suppressed to reduce risk of reidentification and exposure of confidential information. Datasets were created and verified for privacy and confidentiality using data management platform analytic tools as well as R scripts. Results: Unrestricted data are available to the public through Data.CDC.gov and restricted data, with additional fields, are available with a data use agreement through a private repository on GitHub.com. Practice Implications: Enriched understanding of the available public data, the methods used to create these data, and the algorithms used to protect privacy of de-identified individuals allow for improved data use. Automating data generation procedures allows greater and more timely sharing of data. △ Less

Submitted 13 January, 2021; originally announced January 2021.

Comments: 19 pages, 4 figures, 1 table, 5 supplements

arXiv:2101.03807 [pdf, ps, other]

doi 10.4204/EPTCS.332.1

Mechanisation of Model-theoretic Conservative Extension for HOL with Ad-hoc Overloading

Authors: Arve Gengelbach, Johannes Åman Pohjola, Tjark Weber

Abstract: Definitions of new symbols merely abbreviate expressions in logical frameworks, and no new facts (regarding previously defined symbols) should hold because of a new definition. In Isabelle/HOL, definable symbols are types and constants. The latter may be ad-hoc overloaded, i.e. have different definitions for non-overlapping types. We prove that symbols that are independent of a new definition may… ▽ More Definitions of new symbols merely abbreviate expressions in logical frameworks, and no new facts (regarding previously defined symbols) should hold because of a new definition. In Isabelle/HOL, definable symbols are types and constants. The latter may be ad-hoc overloaded, i.e. have different definitions for non-overlapping types. We prove that symbols that are independent of a new definition may keep their interpretation in a model extension. This work revises our earlier notion of model-theoretic conservative extension and generalises an earlier model construction. We obtain consistency of theories of definitions in higher-order logic (HOL) with ad-hoc overloading as a corollary. Our results are mechanised in the HOL4 theorem prover. △ Less

Submitted 11 January, 2021; originally announced January 2021.

Comments: In Proceedings LFMTP 2020, arXiv:2101.02835

ACM Class: F.3.1; F.3.2

Journal ref: EPTCS 332, 2021, pp. 1-17

arXiv:2012.09518 [pdf, other]

doi 10.1088/1748-0221/16/03/p03022

The upgrade of the ALICE TPC with GEMs and continuous readout

Authors: J. Adolfsson, M. Ahmed, S. Aiola, J. Alme, T. Alt, W. Amend, F. Anastasopoulos, C. Andrei, M. Angelsmark, V. Anguelov, A. Anjam, H. Appelshäuser, V. Aprodu, O. Arnold, M. Arslandok, D. Baitinger, M. Ball, G. G. Barnaföldi, E. Bartsch, P. Becht, R. Bellwied, A. Berdnikova, M. Berger, N. Bialas, P. Bialas , et al. (210 additional authors not shown)

Abstract: The upgrade of the ALICE TPC will allow the experiment to cope with the high interaction rates foreseen for the forthcoming Run 3 and Run 4 at the CERN LHC. In this article, we describe the design of new readout chambers and front-end electronics, which are driven by the goals of the experiment. Gas Electron Multiplier (GEM) detectors arranged in stacks containing four GEMs each, and continuous re… ▽ More The upgrade of the ALICE TPC will allow the experiment to cope with the high interaction rates foreseen for the forthcoming Run 3 and Run 4 at the CERN LHC. In this article, we describe the design of new readout chambers and front-end electronics, which are driven by the goals of the experiment. Gas Electron Multiplier (GEM) detectors arranged in stacks containing four GEMs each, and continuous readout electronics based on the SAMPA chip, an ALICE development, are replacing the previous elements. The construction of these new elements, together with their associated quality control procedures, is explained in detail. Finally, the readout chamber and front-end electronics cards replacement, together with the commissioning of the detector prior to installation in the experimental cavern, are presented. After a nine-year period of R&D, construction, and assembly, the upgrade of the TPC was completed in 2020. △ Less

Submitted 25 March, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

Comments: 88 pages, 60 figures

Journal ref: JINST 16 (2021) P03022

arXiv:2012.07969 [pdf, other]

A case for new neural network smoothness constraints

Authors: Mihaela Rosca, Theophane Weber, Arthur Gretton, Shakir Mohamed

Abstract: How sensitive should machine learning models be to input changes? We tackle the question of model smoothness and show that it is a useful inductive bias which aids generalization, adversarial robustness, generative modeling and reinforcement learning. We explore current methods of imposing smoothness constraints and observe they lack the flexibility to adapt to new tasks, they don't account for da… ▽ More How sensitive should machine learning models be to input changes? We tackle the question of model smoothness and show that it is a useful inductive bias which aids generalization, adversarial robustness, generative modeling and reinforcement learning. We explore current methods of imposing smoothness constraints and observe they lack the flexibility to adapt to new tasks, they don't account for data modalities, they interact with losses, architectures and optimization in ways not yet fully understood. We conclude that new advances in the field are hinging on finding ways to incorporate data, tasks and learning into our definitions of smoothness. △ Less

Submitted 7 July, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

arXiv:2012.04545 [pdf, other]

doi 10.3389/fcomp.2021.672867

Discovering key topics from short, real-world medical inquiries via natural language processing and unsupervised learning

Authors: Angelo Ziletti, Christoph Berns, Oliver Treichel, Thomas Weber, Jennifer Liang, Stephanie Kammerath, Marion Schwaerzler, Jagatheswari Virayah, David Ruau, Xin Ma, Andreas Mattern

Abstract: Millions of unsolicited medical inquiries are received by pharmaceutical companies every year. It has been hypothesized that these inquiries represent a treasure trove of information, potentially giving insight into matters regarding medicinal products and the associated medical treatments. However, due to the large volume and specialized nature of the inquiries, it is difficult to perform timely,… ▽ More Millions of unsolicited medical inquiries are received by pharmaceutical companies every year. It has been hypothesized that these inquiries represent a treasure trove of information, potentially giving insight into matters regarding medicinal products and the associated medical treatments. However, due to the large volume and specialized nature of the inquiries, it is difficult to perform timely, recurrent, and comprehensive analyses. Here, we propose a machine learning approach based on natural language processing and unsupervised learning to automatically discover key topics in real-world medical inquiries from customers. This approach does not require ontologies nor annotations. The discovered topics are meaningful and medically relevant, as judged by medical information specialists, thus demonstrating that unsolicited medical inquiries are a source of valuable customer insights. Our work paves the way for the machine-learning-driven analysis of medical inquiries in the pharmaceutical industry, which ultimately aims at improving patient care. △ Less

Submitted 8 December, 2020; originally announced December 2020.

Journal ref: Front. Comput. Sci 88 (3) (2021)

arXiv:2012.04186 [pdf, ps, other]

doi 10.1103/PhysRevLett.127.082002

Observation of di-structures in $e^+e^-\rightarrow{J}/ψ{\rm X}$ at center-of-mass energies around 3.773 GeV

Authors: M. Ablikim, M. N. Achasov, P. Adlarson, S. Ahmed, M. Albrecht, A. Amoroso, Q. An, Anita, Y. Bai, O. Bakina, R. Baldini Ferroli, I. Balossino, Y. Ban, K. Begzsuren, J. V. Bennett, N. Berger, M. Bertani, D. Bettoni, F. Bianchi, J Biernat, J. Bloms, A. Bortone, I. Boyko, R. A. Briere, H. Cai , et al. (460 additional authors not shown)

Abstract: We report a measurement of the observed cross sections of the inclusive $J/ψ$ production in $e^+e^-\rightarrow {J}/ψ{\rm X}$ based on 3.21 fb$^{-1}$ of data accumulated at energies from 3.645 to 3.891 GeV with the BESIII detector operated at the BEPCII collider. The energy-dependent lineshape obtained from the measured cross sections cannot be well described by two Breit-Wigner (BW) amplitudes of… ▽ More We report a measurement of the observed cross sections of the inclusive $J/ψ$ production in $e^+e^-\rightarrow {J}/ψ{\rm X}$ based on 3.21 fb$^{-1}$ of data accumulated at energies from 3.645 to 3.891 GeV with the BESIII detector operated at the BEPCII collider. The energy-dependent lineshape obtained from the measured cross sections cannot be well described by two Breit-Wigner (BW) amplitudes of the expected decays $ψ(3686)\rightarrow {J}/ψ{\rm X}$ and $ψ(3770)\rightarrow {J}/ψ{\rm X}$. Instead it can be better described with three BW amplitudes of the decays $ψ(3686)\rightarrow {J}/ψ{\rm X}$, $R(3760)\rightarrow {J}/ψ{\rm X}$ and $R(3790)\rightarrow {J}/ψ{\rm X}$ with two distinct structures referred to as $R(3760)$ and $R(3790)$. Under this assumption, we extracted their masses, total widths, and the product of the leptonic width and decay branching fractions to be $M_{R(3760)}= {3761.7\pm 2.2 \pm 1.2}$ MeV/$c^2$, $Γ^{\rm tot}_{R(3760)}= {6.7\pm 11.1 \pm 1.1}$ MeV, $Γ^{ee}_{R(3760)}\mathcal B[R(3760)\rightarrow {J}/ψ{\rm X}]=(4.0\pm 4.3\pm 1.2)$ eV, $M_{R(3790)} = {3784.7\pm 5.7 \pm 1.6}$ MeV/$c^2$, $Γ^{\rm tot}_{R(3790)} = {31.6 \pm 11.9 \pm 3.2}$ MeV, $Γ^{ee}_{R(3790)}\mathcal B[R(3790)\rightarrow {J}/ψ{\rm X}]=(18.1\pm 10.3\pm 4.7)$ eV, where the first uncertainties are statistical and second systematic. △ Less

Submitted 7 December, 2020; originally announced December 2020.

Comments: 8 pages, 3 figures

Journal ref: Phys. Rev. Lett. 127, 082002 (2021)

arXiv:2011.13850 [pdf, other]

doi 10.1103/PhysRevD.103.032004

Search for the reaction channel $e^+e^- \rightarrow η_cηπ^+π^-$ at center-of-mass energies from 4.23 to 4.60 GeV

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, S. Ahmed, M. Albrecht, M. Alekseev, A. Amoroso, Q. An, Y. Bai, O. Bakina, R. Baldini Ferroli, I. Balossino, Y. Ban, K. Begzsuren, J. V. Bennett, N. Berger, M. Bertani, D. Bettoni, F. Bianchi, J Biernat, J. Bloms, I. Boyko, R. A. Briere, H. Cai , et al. (451 additional authors not shown)

Abstract: Using data collected with the BESIII detector operating at the Beijing Electron Positron Collider, we search for the process $e^+e^-\rightarrow η_cηπ^+π^-$. The search is performed using five large data sets recorded at center-of-mass energies of 4.23, 4.26, 4.36, 4.42, and 4.60 GeV. The $η_c$ meson is reconstructed in 16 exclusive decay modes. No signal is observed in the $η_c$ mass region at any… ▽ More Using data collected with the BESIII detector operating at the Beijing Electron Positron Collider, we search for the process $e^+e^-\rightarrow η_cηπ^+π^-$. The search is performed using five large data sets recorded at center-of-mass energies of 4.23, 4.26, 4.36, 4.42, and 4.60 GeV. The $η_c$ meson is reconstructed in 16 exclusive decay modes. No signal is observed in the $η_c$ mass region at any center-of-mass energy. The upper limits on the reaction cross sections are determined to be 6.2, 10.8, 27.6, 22.6 and 23.7 pb at the 90% confidence level at the center-of-mass energies listed above. △ Less

Submitted 15 March, 2021; v1 submitted 27 November, 2020; originally announced November 2020.

Journal ref: Phys. Rev. D 103, 032004 (2021)

arXiv:2011.13438 [pdf, other]

doi 10.1103/PhysRevResearch.3.013082

Investigating resonant low-energy electron attachment to formamide: dynamics of model peptide bond dissociation and other fragmentation channels

Authors: Guglielmo Panelli, Ali Moradmand, Brandon Griffin, Kyle Swanson, Thorsten Weber, Thomas N. Rescigno, C. William McCurdy, Daniel S. Slaughter, Joshua B. Williams

Abstract: We report experimental results on three-dimensional momentum imaging measurements of anions generated via dissociative electron attachment to gaseous formamide. From the momentum images, we analyze the angular and kinetic energy distributions for NH$_2^{-}$, O$^{-}$, and H$^{-}$ fragments and discuss the possible electron attachment and dissociation mechanisms for multiple resonances for two range… ▽ More We report experimental results on three-dimensional momentum imaging measurements of anions generated via dissociative electron attachment to gaseous formamide. From the momentum images, we analyze the angular and kinetic energy distributions for NH$_2^{-}$, O$^{-}$, and H$^{-}$ fragments and discuss the possible electron attachment and dissociation mechanisms for multiple resonances for two ranges of incident electron energies, from 5.3~eV to 6.8~eV, and from 10.0~eV to 11.5~eV. {\it Ab initio} theoretical results for the angular distributions of the NH$_2^{-}$ anion for $\sim$6~eV incident electrons, when compared with the experimental results, strongly suggest that one of the two resonances producing this fragment is a $^2$A$''$ Feshbach resonance. △ Less

Submitted 26 November, 2020; originally announced November 2020.

Comments: 10 pages, 6 figures

Journal ref: Phys. Rev. Research 3, 013082 (2021)

arXiv:2011.09464 [pdf, other]

Counterfactual Credit Assignment in Model-Free Reinforcement Learning

Authors: Thomas Mesnard, Théophane Weber, Fabio Viola, Shantanu Thakoor, Alaa Saade, Anna Harutyunyan, Will Dabney, Tom Stepleton, Nicolas Heess, Arthur Guez, Éric Moulines, Marcus Hutter, Lars Buesing, Rémi Munos

Abstract: Credit assignment in reinforcement learning is the problem of measuring an action's influence on future rewards. In particular, this requires separating skill from luck, i.e. disentangling the effect of an action on rewards from that of external factors and subsequent actions. To achieve this, we adapt the notion of counterfactuals from causality theory to a model-free RL setup. The key idea is to… ▽ More Credit assignment in reinforcement learning is the problem of measuring an action's influence on future rewards. In particular, this requires separating skill from luck, i.e. disentangling the effect of an action on rewards from that of external factors and subsequent actions. To achieve this, we adapt the notion of counterfactuals from causality theory to a model-free RL setup. The key idea is to condition value functions on future events, by learning to extract relevant information from a trajectory. We formulate a family of policy gradient algorithms that use these future-conditional value functions as baselines or critics, and show that they are provably low variance. To avoid the potential bias from conditioning on future information, we constrain the hindsight information to not contain information about the agent's actions. We demonstrate the efficacy and validity of our algorithm on a number of illustrative and challenging problems. △ Less

Submitted 14 December, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

arXiv:2011.04021 [pdf, other]

On the role of planning in model-based deep reinforcement learning

Authors: Jessica B. Hamrick, Abram L. Friesen, Feryal Behbahani, Arthur Guez, Fabio Viola, Sims Witherspoon, Thomas Anthony, Lars Buesing, Petar Veličković, Théophane Weber

Abstract: Model-based planning is often thought to be necessary for deep, careful reasoning and generalization in artificial agents. While recent successes of model-based reinforcement learning (MBRL) with deep function approximation have strengthened this hypothesis, the resulting diversity of model-based methods has also made it difficult to track which components drive success and why. In this paper, we… ▽ More Model-based planning is often thought to be necessary for deep, careful reasoning and generalization in artificial agents. While recent successes of model-based reinforcement learning (MBRL) with deep function approximation have strengthened this hypothesis, the resulting diversity of model-based methods has also made it difficult to track which components drive success and why. In this paper, we seek to disentangle the contributions of recent methods by focusing on three questions: (1) How does planning benefit MBRL agents? (2) Within planning, what choices drive performance? (3) To what extent does planning improve generalization? To answer these questions, we study the performance of MuZero (Schrittwieser et al., 2019), a state-of-the-art MBRL algorithm with strong connections and overlapping components with many other MBRL algorithms. We perform a number of interventions and ablations of MuZero across a wide range of environments, including control tasks, Atari, and 9x9 Go. Our results suggest the following: (1) Planning is most useful in the learning process, both for policy updates and for providing a more useful data distribution. (2) Using shallow trees with simple Monte-Carlo rollouts is as performant as more complex methods, except in the most difficult reasoning tasks. (3) Planning alone is insufficient to drive strong generalization. These results indicate where and how to utilize planning in reinforcement learning settings, and highlight a number of open questions for future MBRL research. △ Less

Submitted 17 March, 2021; v1 submitted 8 November, 2020; originally announced November 2020.

Comments: Published at ICLR 2021

arXiv:2010.11793 [pdf, other]

Metapath- and Entity-aware Graph Neural Network for Recommendation

Authors: Muhammad Umer Anwaar, Zhiwei Han, Shyam Arumugaswamy, Rayyan Ahmad Khan, Thomas Weber, Tianming Qiu, Hao Shen, Yuanting Liu, Martin Kleinsteuber

Abstract: In graph neural networks (GNNs), message passing iteratively aggregates nodes' information from their direct neighbors while neglecting the sequential nature of multi-hop node connections. Such sequential node connections e.g., metapaths, capture critical insights for downstream tasks. Concretely, in recommender systems (RSs), disregarding these insights leads to inadequate distillation of collabo… ▽ More In graph neural networks (GNNs), message passing iteratively aggregates nodes' information from their direct neighbors while neglecting the sequential nature of multi-hop node connections. Such sequential node connections e.g., metapaths, capture critical insights for downstream tasks. Concretely, in recommender systems (RSs), disregarding these insights leads to inadequate distillation of collaborative signals. In this paper, we employ collaborative subgraphs (CSGs) and metapaths to form metapath-aware subgraphs, which explicitly capture sequential semantics in graph structures. We propose meta\textbf{P}ath and \textbf{E}ntity-\textbf{A}ware \textbf{G}raph \textbf{N}eural \textbf{N}etwork (PEAGNN), which trains multilayer GNNs to perform metapath-aware information aggregation on such subgraphs. This aggregated information from different metapaths is then fused using attention mechanism. Finally, PEAGNN gives us the representations for node and subgraph, which can be used to train MLP for predicting score for target user-item pairs. To leverage the local structure of CSGs, we present entity-awareness that acts as a contrastive regularizer on node embedding. Moreover, PEAGNN can be combined with prominent layers such as GAT, GCN and GraphSage. Our empirical evaluation shows that our proposed technique outperforms competitive baselines on several datasets for recommendation tasks. Further analysis demonstrates that PEAGNN also learns meaningful metapath combinations from a given set of metapaths. △ Less

Submitted 1 April, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

arXiv:2010.07556 [pdf, other]

Encoder-decoder semantic segmentation models for electroluminescence images of thin-film photovoltaic modules

Authors: Evgenii Sovetkin, Elbert Jan Achterberg, Thomas Weber, Bart E. Pieters

Abstract: We consider a series of image segmentation methods based on the deep neural networks in order to perform semantic segmentation of electroluminescence (EL) images of thin-film modules. We utilize the encoder-decoder deep neural network architecture. The framework is general such that it can easily be extended to other types of images (e.g. thermography) or solar cell technologies (e.g. crystalline… ▽ More We consider a series of image segmentation methods based on the deep neural networks in order to perform semantic segmentation of electroluminescence (EL) images of thin-film modules. We utilize the encoder-decoder deep neural network architecture. The framework is general such that it can easily be extended to other types of images (e.g. thermography) or solar cell technologies (e.g. crystalline silicon modules). The networks are trained and tested on a sample of images from a database with 6000 EL images of Copper Indium Gallium Diselenide (CIGS) thin film modules. We selected two types of features to extract, shunts and so called "droplets". The latter feature is often observed in the set of images. Several models are tested using various combinations of encoder-decoder layers, and a procedure is proposed to select the best model. We show exemplary results with the best selected model. Furthermore, we applied the best model to the full set of 6000 images and demonstrate that the automated segmentation of EL images can reveal many subtle features which cannot be inferred from studying a small sample of images. We believe these features can contribute to process optimization and quality control. △ Less

Submitted 15 October, 2020; originally announced October 2020.

arXiv:2010.04602 [pdf, other]

Integrating Intrinsic and Extrinsic Explainability: The Relevance of Understanding Neural Networks for Human-Robot Interaction

Authors: Tom Weber, Stefan Wermter

Abstract: Explainable artificial intelligence (XAI) can help foster trust in and acceptance of intelligent and autonomous systems. Moreover, understanding the motivation for an agent's behavior results in better and more successful collaborations between robots and humans. However, not only can humans benefit from a robot's explanation but the robot itself can also benefit from explanations given to him. Cu… ▽ More Explainable artificial intelligence (XAI) can help foster trust in and acceptance of intelligent and autonomous systems. Moreover, understanding the motivation for an agent's behavior results in better and more successful collaborations between robots and humans. However, not only can humans benefit from a robot's explanation but the robot itself can also benefit from explanations given to him. Currently, most attention is paid to explaining deep neural networks and black-box models. However, a lot of these approaches are not applicable to humanoid robots. Therefore, in this position paper, current problems with adapting XAI methods to explainable neurorobotics are described. Furthermore, NICO, an open-source humanoid robot platform, is introduced and how the interaction of intrinsic explanations by the robot itself and extrinsic explanations provided by the environment enable efficient robotic behavior. △ Less

Submitted 9 October, 2020; originally announced October 2020.

Comments: Fall Symposium AAAI 2020

arXiv:2010.01298 [pdf, other]

Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban

Authors: Peter Karkus, Mehdi Mirza, Arthur Guez, Andrew Jaegle, Timothy Lillicrap, Lars Buesing, Nicolas Heess, Theophane Weber

Abstract: Intelligent robots need to achieve abstract objectives using concrete, spatiotemporally complex sensory information and motor control. Tabula rasa deep reinforcement learning (RL) has tackled demanding tasks in terms of either visual, abstract, or physical reasoning, but solving these jointly remains a formidable challenge. One recent, unsolved benchmark task that integrates these challenges is Mu… ▽ More Intelligent robots need to achieve abstract objectives using concrete, spatiotemporally complex sensory information and motor control. Tabula rasa deep reinforcement learning (RL) has tackled demanding tasks in terms of either visual, abstract, or physical reasoning, but solving these jointly remains a formidable challenge. One recent, unsolved benchmark task that integrates these challenges is Mujoban, where a robot needs to arrange 3D warehouses generated from 2D Sokoban puzzles. We explore whether integrated tasks like Mujoban can be solved by composing RL modules together in a sense-plan-act hierarchy, where modules have well-defined roles similarly to classic robot architectures. Unlike classic architectures that are typically model-based, we use only model-free modules trained with RL or supervised learning. We find that our modular RL approach dramatically outperforms the state-of-the-art monolithic RL agent on Mujoban. Further, learned modules can be reused when, e.g., using a different robot platform to solve the same task. Together our results give strong evidence for the importance of research into modular RL designs. Project website: https://sites.google.com/view/modular-rl/ △ Less

Submitted 3 October, 2020; originally announced October 2020.

arXiv:2009.08669 [pdf, other]

doi 10.1103/PhysRevA.102.063118

The role of dipole-forbidden autoionizing resonances in non-resonant one-color two-photon single ionization of N$_2$

Authors: Kirk A. Larsen, Roger Y. Bello, Robert R. Lucchese, Thomas N. Rescigno, C. William McCurdy, Daniel S. Slaughter, Thorsten Weber

Abstract: We present an experimental and theoretical energy- and angle-resolved study on the photoionization dynamics of non-resonant one-color two-photon single valence ionization of neutral N$_2$ molecules. Using 9.3 eV photons produced via high harmonic generation and a 3-D momentum imaging spectrometer, we detect the photoelectrons and ions produced from one-color two-photon ionization in coincidence. P… ▽ More We present an experimental and theoretical energy- and angle-resolved study on the photoionization dynamics of non-resonant one-color two-photon single valence ionization of neutral N$_2$ molecules. Using 9.3 eV photons produced via high harmonic generation and a 3-D momentum imaging spectrometer, we detect the photoelectrons and ions produced from one-color two-photon ionization in coincidence. Photoionization of N$_2$ populates the X $^2Σ^+_g$, A $^2Π_u$, and B $^2Σ^+_u$ ionic states of N$_2^+$, where the photoelectron angular distributions associated with the X $^2Σ^+_g$ and A $^2Π_u$ states both vary with changes in photoelectron kinetic energy of only a few hundred meV. We attribute the rapid evolution in the photoelectron angular distributions to the excitation and decay of dipole-forbidden autoionizing resonances that belong to series of different symmetries, all of which are members of the Hopfield series, and compete with the direct two-photon single ionization. △ Less

Submitted 29 December, 2020; v1 submitted 18 September, 2020; originally announced September 2020.

Comments: 12 pages, 9 figures, 2 tables

Journal ref: Phys. Rev. A 102, 063118 (2020)

arXiv:2009.08099 [pdf, ps, other]

doi 10.1016/j.physletb.2020.136059

Observation of a resonant structure in $e^{+}e^{-} \to ωη$ and another in $e^{+}e^{-} \to ωπ^{0}$ at center-of-mass energies between 2.00 and 3.08 GeV

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, S. Ahmed, M. Albrecht, A. Amoroso, Q. An, X. H. Bai, Y. Bai, O. Bakina, R. Baldini Ferroli, I. Balossino, Y. Ban, K. Begzsuren, J. V. Bennett, N. Berger, M. Bertani, D. Bettoni, F. Bianchi, J Biernat, J. Bloms, A. Bortone, I. Boyko, R. A. Briere , et al. (469 additional authors not shown)

Abstract: Born cross sections for the processes $e^+e^- \to ωη$ and $e^+e^- \to ωπ^{0}$ have been determined for center-of-mass energies between 2.00 and 3.08 GeV with the BESIII detector at the BEPCII collider. The results obtained in this work are consistent with previous measurements but with improved precision. Two resonant structures are observed. In the $e^{+}e^{-} \to ωη$ cross sections, a resonance… ▽ More Born cross sections for the processes $e^+e^- \to ωη$ and $e^+e^- \to ωπ^{0}$ have been determined for center-of-mass energies between 2.00 and 3.08 GeV with the BESIII detector at the BEPCII collider. The results obtained in this work are consistent with previous measurements but with improved precision. Two resonant structures are observed. In the $e^{+}e^{-} \to ωη$ cross sections, a resonance with a mass of $(2179 \pm 21 \pm 3)\text{MeV}/c^2$ and a width of $(89 \pm 28 \pm 5)\text{MeV}$ is observed with a significance of 6.1$σ$. Its properties are consistent with the $φ(2170)$. In the $e^{+}e^{-} \toωπ^{0}$ cross sections, a resonance denoted $Y(2040)$ is observed with a significance of more than 10$σ$. Its mass and width are determined to be $(2034 \pm 13 \pm 9)\text{MeV}/c^2$ and $(234 \pm 30 \pm 25)\text{MeV}$, respectively, where the first uncertainties are statistical and the second ones are systematic. △ Less

Submitted 30 October, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

Comments: 14 pages, 5 figures

Journal ref: Physics Letters B Volume 813, 10 February 2021, 136059

arXiv:2009.05524 [pdf, other]

Physically Embedded Planning Problems: New Challenges for Reinforcement Learning

Authors: Mehdi Mirza, Andrew Jaegle, Jonathan J. Hunt, Arthur Guez, Saran Tunyasuvunakool, Alistair Muldal, Théophane Weber, Peter Karkus, Sébastien Racanière, Lars Buesing, Timothy Lillicrap, Nicolas Heess

Abstract: Recent work in deep reinforcement learning (RL) has produced algorithms capable of mastering challenging games such as Go, chess, or shogi. In these works the RL agent directly observes the natural state of the game and controls that state directly with its actions. However, when humans play such games, they do not just reason about the moves but also interact with their physical environment. They… ▽ More Recent work in deep reinforcement learning (RL) has produced algorithms capable of mastering challenging games such as Go, chess, or shogi. In these works the RL agent directly observes the natural state of the game and controls that state directly with its actions. However, when humans play such games, they do not just reason about the moves but also interact with their physical environment. They understand the state of the game by looking at the physical board in front of them and modify it by manipulating pieces using touch and fine-grained motor control. Mastering complicated physical systems with abstract goals is a central challenge for artificial intelligence, but it remains out of reach for existing RL algorithms. To encourage progress towards this goal we introduce a set of physically embedded planning problems and make them publicly available. We embed challenging symbolic tasks (Sokoban, tic-tac-toe, and Go) in a physics engine to produce a set of tasks that require perception, reasoning, and motor control over long time horizons. Although existing RL algorithms can tackle the symbolic versions of these tasks, we find that they struggle to master even the simplest of their physically embedded counterparts. As a first step towards characterizing the space of solution to these tasks, we introduce a strong baseline that uses a pre-trained expert game player to provide hints in the abstract space to an RL agent's policy while training it on the full sensorimotor control task. The resulting agent solves many of the tasks, underlining the need for methods that bridge the gap between abstract planning and embodied control. See illustrating video at https://youtu.be/RwHiHlym_1k. △ Less

Submitted 29 October, 2020; v1 submitted 11 September, 2020; originally announced September 2020.

Comments: 17 pages + appendix. Updated text and references

Showing 51–100 of 436 results for author: Weber, T