Skip to main content

Showing 1–50 of 80 results for author: Weber, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13760  [pdf, other

    eess.SY cs.AI

    Neural Network Tire Force Modeling for Automated Drifting

    Authors: Nicholas Drake Broadbent, Trey Weber, Daiki Mori, J. Christian Gerdes

    Abstract: Automated drifting presents a challenge problem for vehicle control, requiring models and control algorithms that can precisely handle nonlinear, coupled tire forces at the friction limits. We present a neural network architecture for predicting front tire lateral force as a drop-in replacement for physics-based approaches. With a full-scale automated vehicle purpose-built for the drifting applica… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 16th International Symposium on Advanced Vehicle Control (AVEC). September 2nd-6th, 2024. Milan, Italy

  2. arXiv:2406.10636  [pdf, other

    cs.HC

    From Computational to Conversational Notebooks

    Authors: Thomas Weber, Sven Mayer

    Abstract: Today, we see a drastic increase in LLM-based user interfaces to support users in various tasks. Also, in programming, we witness a productivity boost with features like LLM-supported code completion and conversational agents to generate code. In this work, we look at the future of computational notebooks by enriching them with LLM support. We propose a spectrum of support, from simple inline code… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 1st ACM CHI Workshop on Human-Notebook Interactions

  3. arXiv:2406.05072  [pdf, other

    cs.LG stat.ML

    Linearization Turns Neural Operators into Function-Valued Gaussian Processes

    Authors: Emilia Magnani, Marvin Pförtner, Tobias Weber, Philipp Hennig

    Abstract: Modeling dynamical systems, e.g. in climate and engineering sciences, often necessitates solving partial differential equations. Neural operators are deep neural networks designed to learn nontrivial solution operators of such differential equations from data. As for all statistical models, the predictions of these models are imperfect and exhibit errors. Such errors are particularly difficult to… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    MSC Class: G.1.0; I.2.6; G.3; G.1.8

  4. arXiv:2405.02475  [pdf, other

    cs.LG cs.AI stat.CO stat.ME

    Generalizing Orthogonalization for Models with Non-Linearities

    Authors: David Rügamer, Chris Kolb, Tobias Weber, Lucas Kook, Thomas Nagler

    Abstract: The complexity of black-box algorithms can lead to various challenges, including the introduction of biases. These biases present immediate risks in the algorithms' application. It was, for instance, shown that neural networks can deduce racial information solely from a patient's X-ray scan, a task beyond the capability of medical experts. If this fact is not known to the medical expert, automatic… ▽ More

    Submitted 2 June, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  5. arXiv:2404.09683  [pdf, other

    eess.IV cs.CV cs.LG

    Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition

    Authors: Tobias Weber, Jakob Dexl, David Rügamer, Michael Ingrisch

    Abstract: We address the computational barrier of deploying advanced deep learning segmentation models in clinical settings by studying the efficacy of network compression through tensor decomposition. We propose a post-training Tucker factorization that enables the decomposition of pre-existing models to reduce computational requirements without impeding segmentation accuracy. We applied Tucker decompositi… ▽ More

    Submitted 18 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  6. arXiv:2311.05540  [pdf, other

    cs.HC

    Usability and Adoption of Graphical Data-Driven Development Tools

    Authors: Thomas Weber, Sven Mayer

    Abstract: Software development of modern, data-driven applications still relies on tools that use interaction paradigms that have remained mostly unchanged for decades. While rich forms of interactions exist as an alternative to textual command input, they find little adoption in professional software creation. In this work, we compare graphical programming using direct manipulation to the traditional, text… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  7. arXiv:2311.01349  [pdf, other

    cs.LG cs.CY stat.ML

    Post-hoc Orthogonalization for Mitigation of Protected Feature Bias in CXR Embeddings

    Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

    Abstract: Purpose: To analyze and remove protected feature effects in chest radiograph embeddings of deep learning models. Methods: An orthogonalization is utilized to remove the influence of protected features (e.g., age, sex, race) in CXR embeddings, ensuring feature-independent results. To validate the efficacy of the approach, we retrospectively study the MIMIC and CheXpert datasets using three pre-trai… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

  8. arXiv:2310.18091  [pdf, other

    cs.LG stat.ML

    Adversarial Anomaly Detection using Gaussian Priors and Nonlinear Anomaly Scores

    Authors: Fiete Lüer, Tobias Weber, Maxim Dolgich, Christian Böhm

    Abstract: Anomaly detection in imbalanced datasets is a frequent and crucial problem, especially in the medical domain where retrieving and labeling irregularities is often expensive. By combining the generative stability of a $β$-variational autoencoder (VAE) with the discriminative strengths of generative adversarial networks (GANs), we propose a novel model, $β$-VAEGAN. We investigate methods for composi… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: accepted at AI4TS @ ICDMW 2023

  9. arXiv:2307.02924  [pdf, other

    cs.RO cs.HC

    The Emotional Dilemma: Influence of a Human-like Robot on Trust and Cooperation

    Authors: Dennis Becker, Diana Rueda, Felix Beese, Brenda Scarleth Gutierrez Torres, Myriem Lafdili, Kyra Ahrens, Di Fu, Erik Strahl, Tom Weber, Stefan Wermter

    Abstract: Increasing anthropomorphic robot behavioral design could affect trust and cooperation positively. However, studies have shown contradicting results and suggest a task-dependent relationship between robots that display emotions and trust. Therefore, this study analyzes the effect of robots that display human-like emotions on trust, cooperation, and participants' emotions. In the between-group study… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: Accepted at 2023 32nd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)

  10. arXiv:2305.16376  [pdf, other

    eess.IV cs.CV cs.LG

    Constrained Probabilistic Mask Learning for Task-specific Undersampled MRI Reconstruction

    Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

    Abstract: Undersampling is a common method in Magnetic Resonance Imaging (MRI) to subsample the number of data points in k-space, reducing acquisition times at the cost of decreased image quality. A popular approach is to employ undersampling patterns following various strategies, e.g., variable density sampling or radial trajectories. In this work, we propose a method that directly learns the undersampling… ▽ More

    Submitted 22 August, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: accepted at WACV 2024

  11. arXiv:2305.02054  [pdf

    cs.LG cs.AI cs.RO

    Map-based Experience Replay: A Memory-Efficient Solution to Catastrophic Forgetting in Reinforcement Learning

    Authors: Muhammad Burhan Hafez, Tilman Immisch, Tom Weber, Stefan Wermter

    Abstract: Deep Reinforcement Learning agents often suffer from catastrophic forgetting, forgetting previously found solutions in parts of the input space when training on new data. Replay Memories are a common solution to the problem, decorrelating and shuffling old and new training samples. They naively store state transitions as they come in, without regard for redundancy. We introduce a novel cognitive-i… ▽ More

    Submitted 28 August, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Journal ref: Frontiers in Neurorobotics 17:1127642 (2023)

  12. arXiv:2304.05823  [pdf, other

    q-bio.MN cs.LG q-bio.GN

    DiscoGen: Learning to Discover Gene Regulatory Networks

    Authors: Nan Rosemary Ke, Sara-Jane Dunn, Jorg Bornschein, Silvia Chiappa, Melanie Rey, Jean-Baptiste Lespiau, Albin Cassirer, Jane Wang, Theophane Weber, David Barrett, Matthew Botvinick, Anirudh Goyal, Mike Mozer, Danilo Rezende

    Abstract: Accurately inferring Gene Regulatory Networks (GRNs) is a critical and challenging task in biology. GRNs model the activatory and inhibitory interactions between genes and are inherently causal in nature. To accurately identify GRNs, perturbational data is required. However, most GRN discovery methods only operate on observational data. Recent advances in neural network-based causal discovery meth… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  13. Automated wildlife image classification: An active learning tool for ecological applications

    Authors: Ludwig Bothmann, Lisa Wimmer, Omid Charrakh, Tobias Weber, Hendrik Edelhoff, Wibke Peters, Hien Nguyen, Caryl Benjamin, Annette Menzel

    Abstract: Wildlife camera trap images are being used extensively to investigate animal abundance, habitat associations, and behavior, which is complicated by the fact that experts must first classify the images manually. Artificial intelligence systems can take over this task but usually need a large number of already-labeled training images to achieve sufficient performance. This requirement necessitates h… ▽ More

    Submitted 2 August, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Journal ref: Ecological Informatics (2023) 102231

  14. arXiv:2303.14041  [pdf, other

    physics.ins-det cs.CG

    Motion Planning for Triple-Axis Spectrometers

    Authors: Tobias Weber

    Abstract: We present the free and open source software TAS-Paths, a novel system which calculates optimal, collision-free paths for the movement of triple-axis spectrometers. The software features an easy to use graphical user interface, but can also be scripted and used as a library. It allows the user to plan and visualise the motion of the instrument before the experiment and can be used during measureme… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: 6 pages, 4 figures

  15. arXiv:2303.11224  [pdf, other

    eess.IV cs.CV cs.LG

    Cascaded Latent Diffusion Models for High-Resolution Chest X-ray Synthesis

    Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

    Abstract: While recent advances in large-scale foundational models show promising results, their application to the medical domain has not yet been explored in detail. In this paper, we progress into the realms of large-scale modeling in medical synthesis by proposing Cheff - a foundational cascaded latent diffusion model, which generates highly-realistic chest radiographs providing state-of-the-art quality… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: accepted at PAKDD 2023

  16. arXiv:2302.04798  [pdf, other

    cs.LG cs.AI stat.ML

    Equivariant MuZero

    Authors: Andreea Deac, Théophane Weber, George Papamakarios

    Abstract: Deep reinforcement learning repeatedly succeeds in closed, well-defined domains such as games (Chess, Go, StarCraft). The next frontier is real-world scenarios, where setups are numerous and varied. For this, agents need to learn the underlying rules governing the environment, so as to robustly generalise to conditions that differ from those they were trained on. Model-based reinforcement learning… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: 9 pages, 3 figures

  17. arXiv:2302.04009  [pdf, other

    cs.LG

    Investigating the role of model-based learning in exploration and transfer

    Authors: Jacob Walker, Eszter Vértes, Yazhe Li, Gabriel Dulac-Arnold, Ankesh Anand, Théophane Weber, Jessica B. Hamrick

    Abstract: State of the art reinforcement learning has enabled training agents on tasks of ever increasing complexity. However, the current paradigm tends to favor training agents from scratch on every new task or on collections of tasks with a view towards generalizing to novel task configurations. The former suffers from poor data efficiency while the latter is difficult when test tasks are out-of-distribu… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  18. arXiv:2301.05747  [pdf, other

    cs.CV cs.AI

    Laser: Latent Set Representations for 3D Generative Modeling

    Authors: Pol Moreno, Adam R. Kosiorek, Heiko Strathmann, Daniel Zoran, Rosalia G. Schneider, Björn Winckler, Larisa Markeeva, Théophane Weber, Danilo J. Rezende

    Abstract: NeRF provides unparalleled fidelity of novel view synthesis: rendering a 3D scene from an arbitrary viewpoint. NeRF requires training on a large number of views that fully cover a scene, which limits its applicability. While these issues can be addressed by learning a prior over scenes in various forms, previous approaches have been either applied to overly simple scenes or struggling to render un… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: See https://laser-nv-paper.github.io/ for video results

  19. arXiv:2212.14882  [pdf, other

    cs.CL cs.LG

    ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on Simplified Radiology Reports

    Authors: Katharina Jeblick, Balthasar Schachtner, Jakob Dexl, Andreas Mittermeier, Anna Theresa Stüber, Johanna Topalis, Tobias Weber, Philipp Wesp, Bastian Sabel, Jens Ricke, Michael Ingrisch

    Abstract: The release of ChatGPT, a language model capable of generating text that appears human-like and authentic, has gained significant attention beyond the research community. We expect that the convincing performance of ChatGPT incentivizes users to apply it to a variety of downstream tasks, including prompting the model to simplify their own medical reports. To investigate this phenomenon, we conduct… ▽ More

    Submitted 30 December, 2022; originally announced December 2022.

  20. arXiv:2206.05314  [pdf, other

    cs.LG cs.AI

    Large-Scale Retrieval for Reinforcement Learning

    Authors: Peter C. Humphreys, Arthur Guez, Olivier Tieleman, Laurent Sifre, Théophane Weber, Timothy Lillicrap

    Abstract: Effective decision making involves flexibly relating past experiences and relevant contextual information to a novel situation. In deep reinforcement learning (RL), the dominant paradigm is for an agent to amortise information that helps decision making into its network weights via gradient descent on training losses. Here, we pursue an alternative approach in which agents can utilise large-scale… ▽ More

    Submitted 16 December, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: Thirty-sixth Annual Conference on Neural Information Processing Systems (NeurIPS 2022), 16 pages

  21. GASP: Gated Attention For Saliency Prediction

    Authors: Fares Abawi, Tom Weber, Stefan Wermter

    Abstract: Saliency prediction refers to the computational task of modeling overt attention. Social cues greatly influence our attention, consequently altering our eye movements and behavior. To emphasize the efficacy of such features, we present a neural model for integrating social cues and weighting their influences. Our model consists of two stages. During the first stage, we detect two social cues by fo… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: International Joint Conference on Artificial Intelligence (IJCAI-21)

    Journal ref: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (2021) 584-591

  22. arXiv:2204.04875  [pdf, other

    stat.ML cs.LG

    Learning to Induce Causal Structure

    Authors: Nan Rosemary Ke, Silvia Chiappa, Jane Wang, Anirudh Goyal, Jorg Bornschein, Melanie Rey, Theophane Weber, Matthew Botvinic, Michael Mozer, Danilo Jimenez Rezende

    Abstract: The fundamental challenge in causal induction is to infer the underlying graph structure given observational and/or interventional data. Most existing causal induction algorithms operate by generating candidate graphs and evaluating them using either score-based methods (including continuous optimization) or independence tests. In our work, we instead treat the inference process as a black box and… ▽ More

    Submitted 7 October, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

  23. arXiv:2204.04501  [pdf, other

    cs.RO cs.LG

    Explain yourself! Effects of Explanations in Human-Robot Interaction

    Authors: Jakob Ambsdorf, Alina Munir, Yiyao Wei, Klaas Degkwitz, Harm Matthias Harms, Susanne Stannek, Kyra Ahrens, Dennis Becker, Erik Strahl, Tom Weber, Stefan Wermter

    Abstract: Recent developments in explainable artificial intelligence promise the potential to transform human-robot interaction: Explanations of robot decisions could affect user perceptions, justify their reliability, and increase trust. However, the effects on human perceptions of robots that explain their decisions have not been studied thoroughly. To analyze the effect of explainable robots, we conduct… ▽ More

    Submitted 14 June, 2022; v1 submitted 9 April, 2022; originally announced April 2022.

    Comments: Accepted at 2022 31st IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)

  24. arXiv:2202.08417  [pdf, other

    cs.LG

    Retrieval-Augmented Reinforcement Learning

    Authors: Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adria Puigdomenech Badia, Arthur Guez, Mehdi Mirza, Peter C. Humphreys, Ksenia Konyushkova, Laurent Sifre, Michal Valko, Simon Osindero, Timothy Lillicrap, Nicolas Heess, Charles Blundell

    Abstract: Most deep reinforcement learning (RL) algorithms distill experience into parametric behavior policies or value functions via gradient updates. While effective, this approach has several disadvantages: (1) it is computationally expensive, (2) it can take many updates to integrate experiences into the parametric model, (3) experiences that are not fully integrated do not appropriately influence the… ▽ More

    Submitted 24 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

  25. arXiv:2111.05149  [pdf, other

    cs.CV cs.LG

    Ethically aligned Deep Learning: Unbiased Facial Aesthetic Prediction

    Authors: Michael Danner, Thomas Weber, Leping Peng, Tobias Gerlach, Xueping Su, Matthias Rätsch

    Abstract: Facial beauty prediction (FBP) aims to develop a machine that automatically makes facial attractiveness assessment. In the past those results were highly correlated with human ratings, therefore also with their bias in annotating. As artificial intelligence can have racist and discriminatory tendencies, the cause of skews in the data must be identified. Development of training data and AI algorith… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: Peer reviewed and accepted at CEPE/IACAP 2021 as Extended Abstract

  26. arXiv:2111.01587  [pdf, other

    cs.LG cs.AI

    Procedural Generalization by Planning with Self-Supervised World Models

    Authors: Ankesh Anand, Jacob Walker, Yazhe Li, Eszter Vértes, Julian Schrittwieser, Sherjil Ozair, Théophane Weber, Jessica B. Hamrick

    Abstract: One of the key promises of model-based reinforcement learning is the ability to generalize using an internal model of the world to make predictions in novel environments and tasks. However, the generalization ability of model-based agents is not well understood because existing work has focused on model-free agents when benchmarking generalization. Here, we explicitly measure the generalization ab… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  27. arXiv:2110.11312  [pdf, other

    cs.LG

    Towards modelling hazard factors in unstructured data spaces using gradient-based latent interpolation

    Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

    Abstract: The application of deep learning in survival analysis (SA) allows utilizing unstructured and high-dimensional data types uncommon in traditional survival methods. This allows to advance methods in fields such as digital health, predictive maintenance, and churn analysis, but often yields less interpretable and intuitively understandable models due to the black-box character of deep learning-based… ▽ More

    Submitted 17 November, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021 Workshop, Deep Generative Models and Downstream Applications

  28. arXiv:2110.11303  [pdf, other

    cs.LG

    Survival-oriented embeddings for improving accessibility to complex data structures

    Authors: Tobias Weber, Michael Ingrisch, Matthias Fabritius, Bernd Bischl, David Rügamer

    Abstract: Deep learning excels in the analysis of unstructured data and recent advancements allow to extend these techniques to survival analysis. In the context of clinical radiology, this enables, e.g., to relate unstructured volumetric images to a risk score or a prognosis of life expectancy and support clinical decision making. Medical applications are, however, associated with high criticality and cons… ▽ More

    Submitted 3 November, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021 Workshop, Bridging the Gap: From Machine Learning Research to Clinical Practice

  29. arXiv:2105.03354  [pdf

    cs.AI cs.HC

    The future of human-AI collaboration: a taxonomy of design knowledge for hybrid intelligence systems

    Authors: Dominik Dellermann, Adrian Calma, Nikolaus Lipusch, Thorsten Weber, Sascha Weigel, Philipp Ebel

    Abstract: Recent technological advances, especially in the field of machine learning, provide astonishing progress on the road towards artificial general intelligence. However, tasks in current real-world business applications cannot yet be solved by machines alone. We, therefore, identify the need for developing socio-technological ensembles of humans and machines. Such systems possess the ability to accom… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

  30. arXiv:2104.06159  [pdf, other

    cs.LG cs.AI

    Muesli: Combining Improvements in Policy Optimization

    Authors: Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver, Hado van Hasselt

    Abstract: We propose a novel policy update that combines regularized policy optimization with model learning as an auxiliary loss. The update (henceforth Muesli) matches MuZero's state-of-the-art performance on Atari. Notably, Muesli does so without using deep search: it acts directly with a policy network and has computation speed comparable to model-free baselines. The Atari results are complemented by ex… ▽ More

    Submitted 31 March, 2022; v1 submitted 13 April, 2021; originally announced April 2021.

  31. arXiv:2102.12425  [pdf, other

    cs.LG

    Synthetic Returns for Long-Term Credit Assignment

    Authors: David Raposo, Sam Ritter, Adam Santoro, Greg Wayne, Theophane Weber, Matt Botvinick, Hado van Hasselt, Francis Song

    Abstract: Since the earliest days of reinforcement learning, the workhorse method for assigning credit to actions over time has been temporal-difference (TD) learning, which propagates credit backward timestep-by-timestep. This approach suffers when delays between actions and rewards are long and when intervening unrelated events contribute variance to long-term returns. We propose state-associative (SA) le… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

  32. Hierarchical Learning Using Deep Optimum-Path Forest

    Authors: Luis C. S. Afonso, Clayton R. Pereira, Silke A. T. Weber, Christian Hook, Alexandre X. Falcão, João P. Papa

    Abstract: Bag-of-Visual Words (BoVW) and deep learning techniques have been widely used in several domains, which include computer-assisted medical diagnoses. In this work, we are interested in developing tools for the automatic identification of Parkinson's disease using machine learning and the concept of BoVW. The proposed approach concerns a hierarchical-based learning technique to design visual diction… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

  33. arXiv:2102.02274  [pdf, other

    cs.LG cs.AI cs.MA

    Neural Recursive Belief States in Multi-Agent Reinforcement Learning

    Authors: Pol Moreno, Edward Hughes, Kevin R. McKee, Bernardo Avila Pires, Théophane Weber

    Abstract: In multi-agent reinforcement learning, the problem of learning to act is particularly difficult because the policies of co-players may be heavily conditioned on information only observed by them. On the other hand, humans readily form beliefs about the knowledge possessed by their peers and leverage beliefs to inform decision-making. Such abilities underlie individual success in a wide range of Ma… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

  34. Protecting Privacy and Transforming COVID-19 Case Surveillance Datasets for Public Use

    Authors: Brian Lee, Brandi Dupervil, Nicholas P. Deputy, Wil Duck, Stephen Soroka, Lyndsay Bottichio, Benjamin Silk, Jason Price, Patricia Sweeney, Jennifer Fuld, Todd Weber, Dan Pollock

    Abstract: Objectives: Federal open data initiatives that promote increased sharing of federally collected data are important for transparency, data quality, trust, and relationships with the public and state, tribal, local, and territorial (STLT) partners. These initiatives advance understanding of health conditions and diseases by providing data to more researchers, scientists, and policymakers for analysi… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: 19 pages, 4 figures, 1 table, 5 supplements

  35. Mechanisation of Model-theoretic Conservative Extension for HOL with Ad-hoc Overloading

    Authors: Arve Gengelbach, Johannes Åman Pohjola, Tjark Weber

    Abstract: Definitions of new symbols merely abbreviate expressions in logical frameworks, and no new facts (regarding previously defined symbols) should hold because of a new definition. In Isabelle/HOL, definable symbols are types and constants. The latter may be ad-hoc overloaded, i.e. have different definitions for non-overlapping types. We prove that symbols that are independent of a new definition may… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

    Comments: In Proceedings LFMTP 2020, arXiv:2101.02835

    ACM Class: F.3.1; F.3.2

    Journal ref: EPTCS 332, 2021, pp. 1-17

  36. arXiv:2012.07969  [pdf, other

    stat.ML cs.LG

    A case for new neural network smoothness constraints

    Authors: Mihaela Rosca, Theophane Weber, Arthur Gretton, Shakir Mohamed

    Abstract: How sensitive should machine learning models be to input changes? We tackle the question of model smoothness and show that it is a useful inductive bias which aids generalization, adversarial robustness, generative modeling and reinforcement learning. We explore current methods of imposing smoothness constraints and observe they lack the flexibility to adapt to new tasks, they don't account for da… ▽ More

    Submitted 7 July, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

  37. Discovering key topics from short, real-world medical inquiries via natural language processing and unsupervised learning

    Authors: Angelo Ziletti, Christoph Berns, Oliver Treichel, Thomas Weber, Jennifer Liang, Stephanie Kammerath, Marion Schwaerzler, Jagatheswari Virayah, David Ruau, Xin Ma, Andreas Mattern

    Abstract: Millions of unsolicited medical inquiries are received by pharmaceutical companies every year. It has been hypothesized that these inquiries represent a treasure trove of information, potentially giving insight into matters regarding medicinal products and the associated medical treatments. However, due to the large volume and specialized nature of the inquiries, it is difficult to perform timely,… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

    Journal ref: Front. Comput. Sci 88 (3) (2021)

  38. arXiv:2011.09464  [pdf, other

    cs.LG

    Counterfactual Credit Assignment in Model-Free Reinforcement Learning

    Authors: Thomas Mesnard, Théophane Weber, Fabio Viola, Shantanu Thakoor, Alaa Saade, Anna Harutyunyan, Will Dabney, Tom Stepleton, Nicolas Heess, Arthur Guez, Éric Moulines, Marcus Hutter, Lars Buesing, Rémi Munos

    Abstract: Credit assignment in reinforcement learning is the problem of measuring an action's influence on future rewards. In particular, this requires separating skill from luck, i.e. disentangling the effect of an action on rewards from that of external factors and subsequent actions. To achieve this, we adapt the notion of counterfactuals from causality theory to a model-free RL setup. The key idea is to… ▽ More

    Submitted 14 December, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

  39. arXiv:2011.04021  [pdf, other

    cs.AI cs.LG

    On the role of planning in model-based deep reinforcement learning

    Authors: Jessica B. Hamrick, Abram L. Friesen, Feryal Behbahani, Arthur Guez, Fabio Viola, Sims Witherspoon, Thomas Anthony, Lars Buesing, Petar Veličković, Théophane Weber

    Abstract: Model-based planning is often thought to be necessary for deep, careful reasoning and generalization in artificial agents. While recent successes of model-based reinforcement learning (MBRL) with deep function approximation have strengthened this hypothesis, the resulting diversity of model-based methods has also made it difficult to track which components drive success and why. In this paper, we… ▽ More

    Submitted 17 March, 2021; v1 submitted 8 November, 2020; originally announced November 2020.

    Comments: Published at ICLR 2021

  40. arXiv:2010.11793  [pdf, other

    cs.LG cs.AI

    Metapath- and Entity-aware Graph Neural Network for Recommendation

    Authors: Muhammad Umer Anwaar, Zhiwei Han, Shyam Arumugaswamy, Rayyan Ahmad Khan, Thomas Weber, Tianming Qiu, Hao Shen, Yuanting Liu, Martin Kleinsteuber

    Abstract: In graph neural networks (GNNs), message passing iteratively aggregates nodes' information from their direct neighbors while neglecting the sequential nature of multi-hop node connections. Such sequential node connections e.g., metapaths, capture critical insights for downstream tasks. Concretely, in recommender systems (RSs), disregarding these insights leads to inadequate distillation of collabo… ▽ More

    Submitted 1 April, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

  41. arXiv:2010.07556  [pdf, other

    eess.IV cs.CV

    Encoder-decoder semantic segmentation models for electroluminescence images of thin-film photovoltaic modules

    Authors: Evgenii Sovetkin, Elbert Jan Achterberg, Thomas Weber, Bart E. Pieters

    Abstract: We consider a series of image segmentation methods based on the deep neural networks in order to perform semantic segmentation of electroluminescence (EL) images of thin-film modules. We utilize the encoder-decoder deep neural network architecture. The framework is general such that it can easily be extended to other types of images (e.g. thermography) or solar cell technologies (e.g. crystalline… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  42. arXiv:2010.04602  [pdf, other

    cs.RO cs.AI

    Integrating Intrinsic and Extrinsic Explainability: The Relevance of Understanding Neural Networks for Human-Robot Interaction

    Authors: Tom Weber, Stefan Wermter

    Abstract: Explainable artificial intelligence (XAI) can help foster trust in and acceptance of intelligent and autonomous systems. Moreover, understanding the motivation for an agent's behavior results in better and more successful collaborations between robots and humans. However, not only can humans benefit from a robot's explanation but the robot itself can also benefit from explanations given to him. Cu… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: Fall Symposium AAAI 2020

  43. arXiv:2010.01298  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban

    Authors: Peter Karkus, Mehdi Mirza, Arthur Guez, Andrew Jaegle, Timothy Lillicrap, Lars Buesing, Nicolas Heess, Theophane Weber

    Abstract: Intelligent robots need to achieve abstract objectives using concrete, spatiotemporally complex sensory information and motor control. Tabula rasa deep reinforcement learning (RL) has tackled demanding tasks in terms of either visual, abstract, or physical reasoning, but solving these jointly remains a formidable challenge. One recent, unsolved benchmark task that integrates these challenges is Mu… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

  44. arXiv:2009.05524  [pdf, other

    cs.AI cs.LG

    Physically Embedded Planning Problems: New Challenges for Reinforcement Learning

    Authors: Mehdi Mirza, Andrew Jaegle, Jonathan J. Hunt, Arthur Guez, Saran Tunyasuvunakool, Alistair Muldal, Théophane Weber, Peter Karkus, Sébastien Racanière, Lars Buesing, Timothy Lillicrap, Nicolas Heess

    Abstract: Recent work in deep reinforcement learning (RL) has produced algorithms capable of mastering challenging games such as Go, chess, or shogi. In these works the RL agent directly observes the natural state of the game and controls that state directly with its actions. However, when humans play such games, they do not just reason about the moves but also interact with their physical environment. They… ▽ More

    Submitted 29 October, 2020; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: 17 pages + appendix. Updated text and references

  45. arXiv:2004.11410  [pdf, other

    cs.LG cs.AI stat.ML

    Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning

    Authors: Giambattista Parascandolo, Lars Buesing, Josh Merel, Leonard Hasenclever, John Aslanides, Jessica B. Hamrick, Nicolas Heess, Alexander Neitz, Theophane Weber

    Abstract: Standard planners for sequential decision making (including Monte Carlo planning, tree search, dynamic programming, etc.) are constrained by an implicit sequential planning assumption: The order in which a plan is constructed is the same in which it is executed. We consider alternatives to this assumption for the class of goal-directed Reinforcement Learning (RL) problems. Instead of an environmen… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

  46. arXiv:2002.08329  [pdf, other

    cs.LG stat.ML

    Value-driven Hindsight Modelling

    Authors: Arthur Guez, Fabio Viola, Théophane Weber, Lars Buesing, Steven Kapturowski, Doina Precup, David Silver, Nicolas Heess

    Abstract: Value estimation is a critical component of the reinforcement learning (RL) paradigm. The question of how to effectively learn value predictors from data is one of the major problems studied by the RL community, and different approaches exploit structure in the problem domain in different ways. Model learning can make use of the rich transition structure present in sequences of observations, but t… ▽ More

    Submitted 20 October, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: 9 pages + reference + appendix. NeurIPS 2020 version

  47. arXiv:2002.02836  [pdf, other

    cs.LG cs.AI stat.ML

    Causally Correct Partial Models for Reinforcement Learning

    Authors: Danilo J. Rezende, Ivo Danihelka, George Papamakarios, Nan Rosemary Ke, Ray Jiang, Theophane Weber, Karol Gregor, Hamza Merzic, Fabio Viola, Jane Wang, Jovana Mitrovic, Frederic Besse, Ioannis Antonoglou, Lars Buesing

    Abstract: In reinforcement learning, we can learn a model of future observations and rewards, and use it to plan the agent's next actions. However, jointly modeling future observations can be computationally expensive or even intractable if the observations are high-dimensional (e.g. images). For this reason, previous works have considered partial models, which model only part of the observation. In this pa… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

  48. arXiv:1912.02807  [pdf, other

    cs.LG stat.ML

    Combining Q-Learning and Search with Amortized Value Estimates

    Authors: Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Tobias Pfaff, Theophane Weber, Lars Buesing, Peter W. Battaglia

    Abstract: We introduce "Search with Amortized Value Estimates" (SAVE), an approach for combining model-free Q-learning with model-based Monte-Carlo Tree Search (MCTS). In SAVE, a learned prior over state-action values is used to guide MCTS, which estimates an improved set of state-action values. The new Q-estimates are then used in combination with real experience to update the prior. This effectively amort… ▽ More

    Submitted 10 January, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: Published as a conference paper at ICLR 2020

  49. arXiv:1910.11059  [pdf, other

    cs.HC eess.IV

    Interactive Image Restoration

    Authors: Zhiwei Han, Thomas Weber, Stefan Matthes, Yuanting Liu, Hao Shen

    Abstract: Machine learning and many of its applications are considered hard to approach due to their complexity and lack of transparency. One mission of human-centric machine learning is to improve algorithm transparency and user satisfaction while ensuring an acceptable task accuracy. In this work, we present an interactive image restoration framework, which exploits both image prior and human painting kno… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

    Comments: Human-centric Machine Learning Workshop, NeurIPS 2019

  50. arXiv:1910.09313  [pdf, other

    cs.IR cs.DL cs.LG stat.ML

    Using Supervised Learning to Classify Metadata of Research Data by Discipline of Research

    Authors: Tobias Weber, Dieter Kranzlmüller, Michael Fromm, Nelson Tavares de Sousa

    Abstract: Automated classification of metadata of research data by their discipline(s) of research can be used in scientometric research, by repository service providers, and in the context of research data aggregation services. Openly available metadata of the DataCite index for research data were used to compile a large training and evaluation set comprised of 609,524 records, which is published alongside… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.