Skip to main content

Showing 1–47 of 47 results for author: Rügamer, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.05429  [pdf, other

    cs.LG cs.AI stat.CO stat.ML

    How Inverse Conditional Flows Can Serve as a Substitute for Distributional Regression

    Authors: Lucas Kook, Chris Kolb, Philipp Schiele, Daniel Dold, Marcel Arpogaus, Cornelius Fritz, Philipp F. Baumann, Philipp Kopper, Tobias Pielok, Emilio Dorigatti, David Rügamer

    Abstract: Neural network representations of simple models, such as linear regression, are being studied increasingly to better understand the underlying principles of deep learning algorithms. However, neural representations of distributional regression models, such as the Cox model, have received little attention so far. We close this gap by proposing a framework for distributional regression using inverse… ▽ More

    Submitted 10 July, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted at UAI 2024 https://www.auai.org/uai2024/accepted_papers

  2. arXiv:2405.02475  [pdf, other

    cs.LG cs.AI stat.CO stat.ME

    Generalizing Orthogonalization for Models with Non-Linearities

    Authors: David Rügamer, Chris Kolb, Tobias Weber, Lucas Kook, Thomas Nagler

    Abstract: The complexity of black-box algorithms can lead to various challenges, including the introduction of biases. These biases present immediate risks in the algorithms' application. It was, for instance, shown that neural networks can deduce racial information solely from a patient's X-ray scan, a task beyond the capability of medical experts. If this fact is not known to the medical expert, automatic… ▽ More

    Submitted 2 June, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  3. arXiv:2405.02200  [pdf, other

    cs.LG stat.ML

    Position: Why We Must Rethink Empirical Research in Machine Learning

    Authors: Moritz Herrmann, F. Julian D. Lange, Katharina Eggensperger, Giuseppe Casalicchio, Marcel Wever, Matthias Feurer, David Rügamer, Eyke Hüllermeier, Anne-Laure Boulesteix, Bernd Bischl

    Abstract: We warn against a common but incomplete understanding of empirical research in machine learning that leads to non-replicable results, makes findings unreliable, and threatens to undermine progress in the field. To overcome this alarming situation, we call for more awareness of the plurality of ways of gaining knowledge experimentally but also of some epistemic limitations. In particular, we argue… ▽ More

    Submitted 25 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: 20 pages, accepted for publication at ICML 2024, camera-ready version

  4. arXiv:2404.09683  [pdf, other

    eess.IV cs.CV cs.LG

    Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition

    Authors: Tobias Weber, Jakob Dexl, David Rügamer, Michael Ingrisch

    Abstract: We address the computational barrier of deploying advanced deep learning segmentation models in clinical settings by studying the efficacy of network compression through tensor decomposition. We propose a post-training Tucker factorization that enables the decomposition of pre-existing models to reduce computational requirements without impeding segmentation accuracy. We applied Tucker decompositi… ▽ More

    Submitted 18 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  5. arXiv:2403.13150  [pdf, other

    cs.LG cs.AI stat.CO stat.ML

    Training Survival Models using Scoring Rules

    Authors: Philipp Kopper, David Rügamer, Raphael Sonabend, Bernd Bischl, Andreas Bender

    Abstract: Survival Analysis provides critical insights for partially incomplete time-to-event data in various domains. It is also an important example of probabilistic machine learning. The probabilistic nature of the predictions can be exploited by using (proper) scoring rules in the model fitting process instead of likelihood-based optimization. Our proposal does so in a generic manner and can be used for… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  6. arXiv:2403.10923  [pdf, other

    cs.LG cs.AI stat.CO stat.ML

    Interpretable Machine Learning for TabPFN

    Authors: David Rundel, Julius Kobialka, Constantin von Crailsheim, Matthias Feurer, Thomas Nagler, David Rügamer

    Abstract: The recently developed Prior-Data Fitted Networks (PFNs) have shown very promising results for applications in low-data regimes. The TabPFN model, a special case of PFNs for tabular data, is able to achieve state-of-the-art performance on a variety of classification tasks while producing posterior predictive distributions in mere seconds by in-context learning without the need for learning paramet… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  7. arXiv:2402.01484  [pdf, other

    cs.LG stat.CO stat.ML

    Connecting the Dots: Is Mode-Connectedness the Key to Feasible Sample-Based Inference in Bayesian Neural Networks?

    Authors: Emanuel Sommer, Lisa Wimmer, Theodore Papamarkou, Ludwig Bothmann, Bernd Bischl, David Rügamer

    Abstract: A major challenge in sample-based inference (SBI) for Bayesian neural networks is the size and structure of the networks' parameter space. Our work shows that successful SBI is possible by embracing the characteristic relationship between weight and function space, uncovering a systematic link between overparameterization and the difficulty of the sampling problem. Through extensive experiments, w… ▽ More

    Submitted 27 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  8. arXiv:2402.01090  [pdf, other

    stat.ML cs.LG stat.CO

    Scalable Higher-Order Tensor Product Spline Models

    Authors: David Rügamer

    Abstract: In the current era of vast data and transparent machine learning, it is essential for techniques to operate at a large scale while providing a clear mathematical comprehension of the internal workings of the method. Although there already exist interpretable semi-parametric regression methods for large-scale applications that take into account non-linearity in the data, the complexity of the model… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted at the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024. arXiv admin note: substantial text overlap with arXiv:2205.14515

  9. arXiv:2402.00809  [pdf, other

    cs.LG stat.ML

    Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

    Authors: Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang

    Abstract: In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learni… ▽ More

    Submitted 2 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  10. arXiv:2401.12950  [pdf, other

    cs.LG stat.ML

    Bayesian Semi-structured Subspace Inference

    Authors: Daniel Dold, David Rügamer, Beate Sick, Oliver Dürr

    Abstract: Semi-structured regression models enable the joint modeling of interpretable structured and complex unstructured feature effects. The structured model part is inspired by statistical models and can be used to infer the input-output relationship for features of particular importance. The complex unstructured part defines an arbitrary deep neural network and thereby provides enough flexibility to ac… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted at AISTATS 2024

  11. arXiv:2311.01349  [pdf, other

    cs.LG cs.CY stat.ML

    Post-hoc Orthogonalization for Mitigation of Protected Feature Bias in CXR Embeddings

    Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

    Abstract: Purpose: To analyze and remove protected feature effects in chest radiograph embeddings of deep learning models. Methods: An orthogonalization is utilized to remove the influence of protected features (e.g., age, sex, race) in CXR embeddings, ensuring feature-independent results. To validate the efficacy of the approach, we retrospectively study the MIMIC and CheXpert datasets using three pre-trai… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

  12. arXiv:2308.01684  [pdf, other

    cs.CL

    Baby's CoThought: Leveraging Large Language Models for Enhanced Reasoning in Compact Models

    Authors: Zheyu Zhang, Han Yang, Bolei Ma, David Rügamer, Ercong Nie

    Abstract: Large Language Models (LLMs) demonstrate remarkable performance on a variety of natural language understanding (NLU) tasks, primarily due to their in-context learning ability. This ability could be applied to building babylike models, i.e. models at small scales, improving training efficiency. In this paper, we propose a "CoThought" pipeline, which efficiently trains smaller "baby" language models… ▽ More

    Submitted 23 October, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: CoNLL 2023 BabyLM Challenge

  13. arXiv:2307.03571  [pdf, other

    cs.LG math.OC stat.ML

    Smoothing the Edges: Smooth Optimization for Sparse Regularization using Hadamard Overparametrization

    Authors: Chris Kolb, Christian L. Müller, Bernd Bischl, David Rügamer

    Abstract: We present a framework for smooth optimization of explicitly regularized objectives for (structured) sparsity. These non-smooth and possibly non-convex problems typically rely on solvers tailored to specific models and regularizers. In contrast, our method enables fully differentiable and approximation-free optimization and is thus compatible with the ubiquitous gradient descent paradigm in deep l… ▽ More

    Submitted 26 April, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

  14. arXiv:2306.00522  [pdf, other

    cs.LG stat.ML

    A New PHO-rmula for Improved Performance of Semi-Structured Networks

    Authors: David Rügamer

    Abstract: Recent advances to combine structured regression models and deep neural networks for better interpretability, more expressiveness, and statistically valid uncertainty quantification demonstrate the versatility of semi-structured neural networks (SSNs). We show that techniques to properly identify the contributions of the different model components in SSNs, however, lead to suboptimal network estim… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  15. arXiv:2305.16376  [pdf, other

    eess.IV cs.CV cs.LG

    Constrained Probabilistic Mask Learning for Task-specific Undersampled MRI Reconstruction

    Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

    Abstract: Undersampling is a common method in Magnetic Resonance Imaging (MRI) to subsample the number of data points in k-space, reducing acquisition times at the cost of decreased image quality. A popular approach is to employ undersampling patterns following various strategies, e.g., variable density sampling or radial trajectories. In this work, we propose a method that directly learns the undersampling… ▽ More

    Submitted 22 August, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: accepted at WACV 2024

  16. arXiv:2304.07250  [pdf, other

    cs.CV cs.AI

    Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments

    Authors: Felix Ott, Lucas Heublein, David Rügamer, Bernd Bischl, Christopher Mutschler

    Abstract: The localization of objects is a crucial task in various applications such as robotics, virtual and augmented reality, and the transportation of goods in warehouses. Recent advances in deep learning have enabled the localization using monocular visual cameras. While structure from motion (SfM) predicts the absolute pose from a point cloud, absolute pose regression (APR) methods learn a semantic un… ▽ More

    Submitted 9 June, 2024; v1 submitted 14 April, 2023; originally announced April 2023.

    MSC Class: 68U01 ACM Class: I.2.9; I.2.10; I.4.1; I.4.10; I.5.4

  17. arXiv:2304.02902  [pdf, other

    stat.ML cs.LG

    Towards Efficient MCMC Sampling in Bayesian Neural Networks by Exploiting Symmetry

    Authors: Jonas Gregor Wiese, Lisa Wimmer, Theodore Papamarkou, Bernd Bischl, Stephan Günnemann, David Rügamer

    Abstract: Bayesian inference in deep neural networks is challenging due to the high-dimensional, strongly multi-modal parameter posterior density landscape. Markov chain Monte Carlo approaches asymptotically recover the true posterior but are considered prohibitively expensive for large modern architectures. Local methods, which have emerged as a popular alternative, focus on specific parameter regions that… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  18. arXiv:2303.11224  [pdf, other

    eess.IV cs.CV cs.LG

    Cascaded Latent Diffusion Models for High-Resolution Chest X-ray Synthesis

    Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

    Abstract: While recent advances in large-scale foundational models show promising results, their application to the medical domain has not yet been explored in detail. In this paper, we progress into the realms of large-scale modeling in medical synthesis by proposing Cheff - a foundational cascaded latent diffusion model, which generates highly-realistic chest radiographs providing state-of-the-art quality… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: accepted at PAKDD 2023

  19. arXiv:2301.06293  [pdf, other

    cs.CV

    Representation Learning for Tablet and Paper Domain Adaptation in Favor of Online Handwriting Recognition

    Authors: Felix Ott, David Rügamer, Lucas Heublein, Bernd Bischl, Christopher Mutschler

    Abstract: The performance of a machine learning model degrades when it is applied to data from a similar but different domain than the data it has initially been trained on. The goal of domain adaptation (DA) is to mitigate this domain shift problem by searching for an optimal feature transformation to learn a domain-invariant representation. Such a domain shift can appear in handwriting recognition (HWR) a… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: Accepted at IAPR Intl. Workshop on Multimodal Pattern Recognition of Social Signals in Human Computer Interaction (MPRSS), Montreal, Canada, August 2022

    MSC Class: 49Q22; 62M10 ACM Class: I.2.4

  20. arXiv:2211.02730  [pdf, other

    stat.ML cs.LG

    Uncertainty-aware predictive modeling for fair data-driven decisions

    Authors: Patrick Kaiser, Christoph Kern, David Rügamer

    Abstract: Both industry and academia have made considerable progress in developing trustworthy and responsible machine learning (ML) systems. While critical concepts like fairness and explainability are often addressed, the safety of systems is typically not sufficiently taken into account. By viewing data-driven decision systems as socio-technical systems, we draw on the uncertainty in ML literature to sho… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  21. arXiv:2210.07723  [pdf, other

    stat.ML cs.CR cs.LG

    Privacy-Preserving and Lossless Distributed Estimation of High-Dimensional Generalized Additive Mixed Models

    Authors: Daniel Schalk, Bernd Bischl, David Rügamer

    Abstract: Various privacy-preserving frameworks that respect the individual's privacy in the analysis of data have been developed in recent years. However, available model classes such as simple statistics or generalized linear models lack the flexibility required for a good approximation of the underlying data-generating process in practice. In this paper, we propose an algorithm for a distributed, privacy… ▽ More

    Submitted 10 March, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

  22. arXiv:2208.14919  [pdf, other

    cs.LG cs.NE stat.ML

    ARMA Cell: A Modular and Effective Approach for Neural Autoregressive Modeling

    Authors: Philipp Schiele, Christoph Berninger, David Rügamer

    Abstract: The autoregressive moving average (ARMA) model is a classical, and arguably one of the most studied approaches to model time series data. It has compelling theoretical properties and is widely used among practitioners. More recent deep learning approaches popularize recurrent neural networks (RNNs) and, in particular, Long Short-Term Memory (LSTM) cells that have become one of the best performing… ▽ More

    Submitted 11 January, 2024; v1 submitted 31 August, 2022; originally announced August 2022.

    ACM Class: G.3

  23. arXiv:2208.00919  [pdf, other

    cs.CV

    Benchmarking Visual-Inertial Deep Multimodal Fusion for Relative Pose Regression and Odometry-aided Absolute Pose Regression

    Authors: Felix Ott, Nisha Lakshmana Raichur, David Rügamer, Tobias Feigl, Heiko Neumann, Bernd Bischl, Christopher Mutschler

    Abstract: Visual-inertial localization is a key problem in computer vision and robotics applications such as virtual reality, self-driving cars, and aerial vehicles. The goal is to estimate an accurate pose of an object when either the environment or the dynamics are known. Absolute pose regression (APR) techniques directly regress the absolute pose from an image input in a known scene using convolutional a… ▽ More

    Submitted 4 August, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: Under review

    MSC Class: 68T40; 65D19 ACM Class: I.4; I.5.1

  24. arXiv:2206.08640  [pdf, other

    cs.CV cs.AI

    Uncertainty-aware Evaluation of Time-Series Classification for Online Handwriting Recognition with Domain Shift

    Authors: Andreas Klaß, Sven M. Lorenz, Martin W. Lauer-Schmaltz, David Rügamer, Bernd Bischl, Christopher Mutschler, Felix Ott

    Abstract: For many applications, analyzing the uncertainty of a machine learning model is indispensable. While research of uncertainty quantification (UQ) techniques is very advanced for computer vision applications, UQ methods for spatio-temporal data are less studied. In this paper, we focus on models for online handwriting recognition, one particular type of spatio-temporal data. The data is observed fro… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    MSC Class: 62F15 ACM Class: H.1.1

  25. arXiv:2205.14515  [pdf, other

    stat.CO cs.LG stat.ML

    Additive Higher-Order Factorization Machines

    Authors: David Rügamer

    Abstract: In the age of big data and interpretable machine learning, approaches need to work at scale and at the same time allow for a clear mathematical understanding of the method's inner workings. While there exist inherently interpretable semi-parametric regression techniques for large-scale applications to account for non-linearity in the data, their model complexity is still often restricted. One of t… ▽ More

    Submitted 1 February, 2023; v1 submitted 28 May, 2022; originally announced May 2022.

  26. arXiv:2205.13080  [pdf, other

    stat.ML cs.LG stat.CO

    Factorized Structured Regression for Large-Scale Varying Coefficient Models

    Authors: David Rügamer, Andreas Bender, Simon Wiegrebe, Daniel Racek, Bernd Bischl, Christian L. Müller, Clemens Stachl

    Abstract: Recommender Systems (RS) pervade many aspects of our everyday digital life. Proposed to work at scale, state-of-the-art RS allow the modeling of thousands of interactions and facilitate highly individualized recommendations. Conceptually, many RS can be viewed as instances of statistical regression models that incorporate complex feature effects and potentially non-Gaussian outcomes. Such structur… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

  27. arXiv:2204.03342  [pdf, other

    cs.LG cs.AI

    Domain Adaptation for Time-Series Classification to Mitigate Covariate Shift

    Authors: Felix Ott, David Rügamer, Lucas Heublein, Bernd Bischl, Christopher Mutschler

    Abstract: The performance of a machine learning model degrades when it is applied to data from a similar but different domain than the data it has initially been trained on. To mitigate this domain shift problem, domain adaptation (DA) techniques search for an optimal transformation that converts the (current) input data from a source domain to a target domain to learn a domain-invariant representation that… ▽ More

    Submitted 15 July, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    MSC Class: 49Q22; 62M10 ACM Class: I.2.4

  28. Auxiliary Cross-Modal Representation Learning with Triplet Loss Functions for Online Handwriting Recognition

    Authors: Felix Ott, David Rügamer, Lucas Heublein, Bernd Bischl, Christopher Mutschler

    Abstract: Cross-modal representation learning learns a shared embedding between two or more modalities to improve performance in a given task compared to using only one of the modalities. Cross-modal representation learning from different data types -- such as images and time-series data (e.g., audio or text data) -- requires a deep metric learning loss that minimizes the distance between the modality embed… ▽ More

    Submitted 3 August, 2023; v1 submitted 16 February, 2022; originally announced February 2022.

    MSC Class: 68T30; 68T35 ACM Class: I.2.4

    Journal ref: IEEE Access, volume 11, pages 94148-94172, August 2023

  29. arXiv:2202.07423  [pdf, other

    stat.ML cs.LG

    DeepPAMM: Deep Piecewise Exponential Additive Mixed Models for Complex Hazard Structures in Survival Analysis

    Authors: Philipp Kopper, Simon Wiegrebe, Bernd Bischl, Andreas Bender, David Rügamer

    Abstract: Survival analysis (SA) is an active field of research that is concerned with time-to-event outcomes and is prevalent in many domains, particularly biomedical applications. Despite its importance, SA remains challenging due to small-scale data sets and complex outcome distributions, concealed by truncation and censoring processes. The piecewise exponential additive mixed model (PAMM) is a model cla… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

    Comments: 13 pages, 2 figures, This work has been accepted by the 26th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD2022)

  30. Benchmarking Online Sequence-to-Sequence and Character-based Handwriting Recognition from IMU-Enhanced Pens

    Authors: Felix Ott, David Rügamer, Lucas Heublein, Tim Hamann, Jens Barth, Bernd Bischl, Christopher Mutschler

    Abstract: Purpose. Handwriting is one of the most frequently occurring patterns in everyday life and with it come challenging applications such as handwriting recognition (HWR), writer identification, and signature verification. In contrast to offline HWR that only uses spatial information (i.e., images), online HWR (OnHWR) uses richer spatio-temporal information (i.e., trajectory data or inertial data). Wh… ▽ More

    Submitted 21 September, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted for International Journal on Document Analysis and Recognition (IJDAR)

    MSC Class: 68T30; 68T10 ACM Class: I.5.4

  31. arXiv:2111.05303  [pdf, other

    cs.LG physics.ao-ph stat.ML

    Identifying the atmospheric drivers of drought and heat using a smoothed deep learning approach

    Authors: Magdalena Mittermeier, Maximilian Weigert, David Rügamer

    Abstract: Europe was hit by several, disastrous heat and drought events in recent summers. Besides thermodynamic influences, such hot and dry extremes are driven by certain atmospheric situations including anticyclonic conditions. Effects of climate change on atmospheric circulations are complex and many open research questions remain in this context, e.g., on future trends of anticyclonic conditions. Based… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021: Tackling Climate Change with Machine Learning

  32. arXiv:2110.11312  [pdf, other

    cs.LG

    Towards modelling hazard factors in unstructured data spaces using gradient-based latent interpolation

    Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

    Abstract: The application of deep learning in survival analysis (SA) allows utilizing unstructured and high-dimensional data types uncommon in traditional survival methods. This allows to advance methods in fields such as digital health, predictive maintenance, and churn analysis, but often yields less interpretable and intuitively understandable models due to the black-box character of deep learning-based… ▽ More

    Submitted 17 November, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021 Workshop, Deep Generative Models and Downstream Applications

  33. arXiv:2110.11303  [pdf, other

    cs.LG

    Survival-oriented embeddings for improving accessibility to complex data structures

    Authors: Tobias Weber, Michael Ingrisch, Matthias Fabritius, Bernd Bischl, David Rügamer

    Abstract: Deep learning excels in the analysis of unstructured data and recent advancements allow to extend these techniques to survival analysis. In the context of clinical radiology, this enables, e.g., to relate unstructured volumetric images to a risk score or a prognosis of life expectancy and support clinical decision making. Medical applications are, however, associated with high criticality and cons… ▽ More

    Submitted 3 November, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021 Workshop, Bridging the Gap: From Machine Learning Research to Clinical Practice

  34. arXiv:2110.08248  [pdf, other

    cs.LG

    Probabilistic Time Series Forecasts with Autoregressive Transformation Models

    Authors: David Rügamer, Philipp F. M. Baumann, Thomas Kneib, Torsten Hothorn

    Abstract: Probabilistic forecasting of time series is an important matter in many applications and research fields. In order to draw conclusions from a probabilistic forecast, we must ensure that the model class used to approximate the true forecasting distribution is expressive enough. Yet, characteristics of the model itself, such as its uncertainty or its feature-outcome relationship are not of lesser im… ▽ More

    Submitted 9 July, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  35. arXiv:2110.03513  [pdf, other

    stat.CO cs.LG

    Accelerated Componentwise Gradient Boosting using Efficient Data Representation and Momentum-based Optimization

    Authors: Daniel Schalk, Bernd Bischl, David Rügamer

    Abstract: Componentwise boosting (CWB), also known as model-based boosting, is a variant of gradient boosting that builds on additive models as base learners to ensure interpretability. CWB is thus often used in research areas where models are employed as tools to explain relationships in data. One downside of CWB is its computational complexity in terms of memory and runtime. In this paper, we propose two… ▽ More

    Submitted 29 October, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

  36. arXiv:2109.05583  [pdf, ps, other

    stat.ML cs.LG

    Automatic Componentwise Boosting: An Interpretable AutoML System

    Authors: Stefan Coors, Daniel Schalk, Bernd Bischl, David Rügamer

    Abstract: In practice, machine learning (ML) workflows require various different steps, from data preprocessing, missing value imputation, model selection, to model tuning as well as model evaluation. Many of these steps rely on human ML experts. AutoML - the field of automating these ML pipelines - tries to help practitioners to apply ML off-the-shelf without any expert knowledge. Most modern AutoML system… ▽ More

    Submitted 16 October, 2021; v1 submitted 12 September, 2021; originally announced September 2021.

    Comments: 6 pages, 4 figures, ECML-PKDD Workshop on Automating Data Science 2021

  37. arXiv:2109.05232  [pdf, other

    cs.CV

    Joint Debiased Representation Learning and Imbalanced Data Clustering

    Authors: Mina Rezaei, Emilio Dorigatti, David Ruegamer, Bernd Bischl

    Abstract: One of the most promising approaches for unsupervised learning is combining deep representation learning and deep clustering. Some recent works propose to simultaneously learn representation using deep neural networks and perform clustering by defining a clustering loss on top of embedded features. However, these approaches are sensitive to imbalanced data and out-of-distribution samples. As a con… ▽ More

    Submitted 6 September, 2022; v1 submitted 11 September, 2021; originally announced September 2021.

  38. arXiv:2107.14330  [pdf, ps, other

    cs.CY cs.LG

    Developing Open Source Educational Resources for Machine Learning and Data Science

    Authors: Ludwig Bothmann, Sven Strickroth, Giuseppe Casalicchio, David Rügamer, Marius Lindauer, Fabian Scheipl, Bernd Bischl

    Abstract: Education should not be a privilege but a common good. It should be openly accessible to everyone, with as few barriers as possible; even more so for key technologies such as Machine Learning (ML) and Data Science (DS). Open Educational Resources (OER) are a crucial factor for greater educational equity. In this paper, we describe the specific requirements for OER in ML and DS and argue that it is… ▽ More

    Submitted 10 August, 2021; v1 submitted 28 July, 2021; originally announced July 2021.

    Comments: 6 pages

    Journal ref: Proceedings of the Third Teaching Machine Learning and Artificial Intelligence Workshop, PMLR 207:1-6, 2022

  39. arXiv:2104.02705  [pdf, other

    stat.ML cs.LG stat.CO

    deepregression: a Flexible Neural Network Framework for Semi-Structured Deep Distributional Regression

    Authors: David Rügamer, Chris Kolb, Cornelius Fritz, Florian Pfisterer, Philipp Kopper, Bernd Bischl, Ruolin Shen, Christina Bukas, Lisa Barros de Andrade e Sousa, Dominik Thalmeier, Philipp Baumann, Lucas Kook, Nadja Klein, Christian L. Müller

    Abstract: In this paper we describe the implementation of semi-structured deep distributional regression, a flexible framework to learn conditional distributions based on the combination of additive regression models and deep networks. Our implementation encompasses (1) a modular neural network building system based on the deep learning library \pkg{TensorFlow} for the fusion of various statistical and deep… ▽ More

    Submitted 10 March, 2022; v1 submitted 6 April, 2021; originally announced April 2021.

  40. Deep Semi-Supervised Learning for Time Series Classification

    Authors: Jann Goschenhofer, Rasmus Hvingelby, David Rügamer, Janek Thomas, Moritz Wagner, Bernd Bischl

    Abstract: While Semi-supervised learning has gained much attention in computer vision on image data, yet limited research exists on its applicability in the time series domain. In this work, we investigate the transferability of state-of-the-art deep semi-supervised models from image to time series classification. We discuss the necessary model adaptations, in particular an appropriate model backbone archit… ▽ More

    Submitted 16 February, 2022; v1 submitted 6 February, 2021; originally announced February 2021.

    Journal ref: 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

  41. arXiv:2101.00661  [pdf, other

    cs.LG stat.AP

    Combining Graph Neural Networks and Spatio-temporal Disease Models to Predict COVID-19 Cases in Germany

    Authors: Cornelius Fritz, Emilio Dorigatti, David Rügamer

    Abstract: During 2020, the infection rate of COVID-19 has been investigated by many scholars from different research fields. In this context, reliable and interpretable forecasts of disease incidents are a vital tool for policymakers to manage healthcare resources. Several experts have called for the necessity to account for human mobility to explain the spread of COVID-19. Existing approaches are often app… ▽ More

    Submitted 3 January, 2021; originally announced January 2021.

  42. arXiv:2011.05824  [pdf, other

    cs.LG cs.AI stat.ML

    Semi-Structured Deep Piecewise Exponential Models

    Authors: Philipp Kopper, Sebastian Pölsterl, Christian Wachinger, Bernd Bischl, Andreas Bender, David Rügamer

    Abstract: We propose a versatile framework for survival analysis that combines advanced concepts from statistics with deep learning. The presented framework is based on piecewise exponential models and thereby supports various survival tasks, such as competing risks and multi-state modeling, and further allows for estimation of time-varying effects and time-varying features. To also include multiple data so… ▽ More

    Submitted 1 March, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: 8 pages, 3 figures, Accepted at the AAAI spring symposium: Survival Prediction

  43. Deep Conditional Transformation Models

    Authors: Philipp F. M. Baumann, Torsten Hothorn, David Rügamer

    Abstract: Learning the cumulative distribution function (CDF) of an outcome variable conditional on a set of features remains challenging, especially in high-dimensional settings. Conditional transformation models provide a semi-parametric approach that allows to model a large class of conditional CDFs without an explicit parametric distribution assumption and with only a few parameters. Existing estimation… ▽ More

    Submitted 6 April, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

    Journal ref: Machine Learning and Knowledge Discovery in Databases. Research Track. ECML PKDD 2021

  44. arXiv:2010.06889  [pdf, other

    stat.CO cs.LG stat.ML

    Neural Mixture Distributional Regression

    Authors: David Rügamer, Florian Pfisterer, Bernd Bischl

    Abstract: We present neural mixture distributional regression (NMDR), a holistic framework to estimate complex finite mixtures of distributional regressions defined by flexible additive predictors. Our framework is able to handle a large number of mixtures of potentially different distributions in high-dimensional settings, allows for efficient and scalable optimization and can be applied to recent concepts… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

  45. arXiv:2006.15442  [pdf, other

    stat.ML cs.LG stat.CO

    A General Machine Learning Framework for Survival Analysis

    Authors: Andreas Bender, David Rügamer, Fabian Scheipl, Bernd Bischl

    Abstract: The modeling of time-to-event data, also known as survival analysis, requires specialized methods that can deal with censoring and truncation, time-varying features and effects, and that extend to settings with multiple competing events. However, many machine learning methods for survival analysis only consider the standard setting with right-censored data and proportional hazards assumption. The… ▽ More

    Submitted 17 April, 2021; v1 submitted 27 June, 2020; originally announced June 2020.

  46. arXiv:2002.05777  [pdf, other

    stat.ML cs.LG stat.ME

    Semi-Structured Distributional Regression -- Extending Structured Additive Models by Arbitrary Deep Neural Networks and Data Modalities

    Authors: David Rügamer, Chris Kolb, Nadja Klein

    Abstract: Combining additive models and neural networks allows to broaden the scope of statistical regression and extend deep learning-based approaches by interpretable structured additive predictors at the same time. Existing attempts uniting the two modeling approaches are, however, limited to very specific combinations and, more importantly, involve an identifiability issue. As a consequence, interpretab… ▽ More

    Submitted 9 July, 2022; v1 submitted 13 February, 2020; originally announced February 2020.

  47. arXiv:1805.01852  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Inference for $L_2$-Boosting

    Authors: David Rügamer, Sonja Greven

    Abstract: We propose a statistical inference framework for the component-wise functional gradient descent algorithm (CFGD) under normality assumption for model errors, also known as $L_2$-Boosting. The CFGD is one of the most versatile tools to analyze data, because it scales well to high-dimensional data sets, allows for a very flexible definition of additive regression models and incorporates inbuilt vari… ▽ More

    Submitted 4 June, 2019; v1 submitted 4 May, 2018; originally announced May 2018.