Zum Hauptinhalt springen

Showing 1–50 of 50 results for author: Klein, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.18454  [pdf, other

    cs.CY stat.AP

    From Counting Stations to City-Wide Estimates: Data-Driven Bicycle Volume Extrapolation

    Authors: Silke K. Kaiser, Nadja Klein, Lynn H. Kaack

    Abstract: Shifting to cycling in urban areas reduces greenhouse gas emissions and improves public health. Street-level bicycle volume information would aid cities in planning targeted infrastructure improvements to encourage cycling and provide civil society with evidence to advocate for cyclists' needs. Yet, the data currently available to cities and citizens often only comes from sparsely located counting… ▽ More

    Submitted 1 August, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.03900  [pdf, other

    stat.ME stat.AP

    Enhanced variable selection for boosting sparser and less complex models in distributional copula regression

    Authors: Annika Strömer, Nadja Klein, Christian Staerk, Florian Faschingbauer, Hannah Klinkhammer, Andreas Mayr

    Abstract: Structured additive distributional copula regression allows to model the joint distribution of multivariate outcomes by relating all distribution parameters to covariates. Estimation via statistical boosting enables accounting for high-dimensional data and incorporating data-driven variable selection, both of which are useful given the complexity of the model class. However, as known from univaria… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2403.02194  [pdf, other

    stat.ME

    Boosting Distributional Copula Regression for Bivariate Binary, Discrete and Mixed Responses

    Authors: Guillermo Briseño Sanchez, Nadja Klein, Hannah Klinkhammer, Andreas Mayr

    Abstract: Motivated by challenges in the analysis of biomedical data and observational studies, we develop statistical boosting for the general class of bivariate distributional copula regression with arbitrary marginal distributions, which is suited to model binary, count, continuous or mixed outcomes. In our framework, the joint distribution of arbitrary, bivariate responses is modelled through a parametr… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  4. arXiv:2401.11804  [pdf, other

    stat.ME

    Regression Copulas for Multivariate Responses

    Authors: Nadja Klein, Michael Stanley Smith, David Nott, Ryan Chisholm

    Abstract: We propose a novel distributional regression model for a multivariate response vector based on a copula process over the covariate space. It uses the implicit copula of a Gaussian multivariate regression, which we call a ``regression copula''. To allow for large covariate vectors their coefficients are regularized using a novel multivariate extension of the horseshoe prior. Bayesian inference and… ▽ More

    Submitted 5 March, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  5. arXiv:2401.06523  [pdf, other

    stat.ML cs.LG math.PR math.ST

    Boosting Causal Additive Models

    Authors: Maximilian Kertel, Nadja Klein

    Abstract: We present a boosting-based method to learn additive Structural Equation Models (SEMs) from observational data, with a focus on the theoretical aspects of determining the causal order among variables. We introduce a family of score functions based on arbitrary regression techniques, for which we establish necessary conditions to consistently favor the true causal ordering. Our analysis reveals tha… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  6. arXiv:2401.03881  [pdf, other

    stat.ME stat.AP

    Density regression via Dirichlet process mixtures of normal structured additive regression models

    Authors: María Xosé Rodríguez-Álvarez, Vanda Inácio, Nadja Klein

    Abstract: Within Bayesian nonparametrics, dependent Dirichlet process mixture models provide a highly flexible approach for conducting inference about the conditional density function. However, several formulations of this class make either rather restrictive modelling assumptions or involve intricate algorithms for posterior inference, thus preventing their widespread use. In response to these challenges,… ▽ More

    Submitted 13 May, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  7. arXiv:2401.00840  [pdf, other

    stat.ME

    Bayesian Effect Selection in Additive Models with an Application to Time-to-Event Data

    Authors: Paul Bach, Nadja Klein

    Abstract: Accurately selecting and estimating smooth functional effects in additive models with potentially many functions is a challenging task. We introduce a novel Demmler-Reinsch basis expansion to model the functional effects that allows us to orthogonally decompose an effect into its linear and nonlinear parts. We show that our representation allows to consistently estimate both parts as opposed to co… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  8. arXiv:2312.00439  [pdf, other

    stat.ME

    Modeling the Ratio of Correlated Biomarkers Using Copula Regression

    Authors: Moritz Berger, Nadja Klein, Michael Wagner, Matthias Schmid

    Abstract: Modeling the ratio of two dependent components as a function of covariates is a frequently pursued objective in observational research. Despite the high relevance of this topic in medical studies, where biomarker ratios are often used as surrogate endpoints for specific diseases, existing models are based on oversimplified assumptions, assuming e.g.\@ independence or strictly positive associations… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 32 pages, 6 figures, 5 tables

  9. arXiv:2311.07371  [pdf, other

    stat.ME

    Scalable Estimation for Structured Additive Distributional Regression Through Variational Inference

    Authors: Jana Kleinemeier, Nadja Klein

    Abstract: Structured additive distributional regression models offer a versatile framework for estimating complete conditional distributions by relating all parameters of a parametric distribution to covariates. Although these models efficiently leverage information in vast and intricate data sets, they often result in highly-parameterized models with many unknowns. Standard estimation methods, like Bayesia… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  10. arXiv:2311.07156  [pdf, other

    stat.ME stat.AP

    Deep mixture of linear mixed models for complex longitudinal data

    Authors: Lucas Kock, Nadja Klein, David J. Nott

    Abstract: Mixtures of linear mixed models are widely used for modelling longitudinal data for which observation times differ between subjects. In typical applications, temporal trends are described using a basis expansion, with basis coefficients treated as random effects varying by subject. Additional random effects can describe variation between mixture components, or other known sources of variation in c… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  11. arXiv:2306.02711  [pdf, other

    stat.ME

    Truly Multivariate Structured Additive Distributional Regression

    Authors: Lucas Kock, Nadja Klein

    Abstract: Generalized additive models for location, scale and shape (GAMLSS) are a popular extension to mean regression models where each parameter of an arbitrary distribution is modelled through covariates. While such models have been developed for univariate and bivariate responses, the truly multivariate case remains extremely challenging for both computational and theoretical reasons. Alternative appro… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  12. arXiv:2305.11575  [pdf, other

    stat.ML cs.LG

    The Deep Promotion Time Cure Model

    Authors: Victor Medina-Olivares, Stefan Lessmann, Nadja Klein

    Abstract: We propose a novel method for predicting time-to-event in the presence of cure fractions based on flexible survivals models integrated into a deep neural network framework. Our approach allows for non-linear relationships and high-dimensional interactions between covariates and survival and is suitable for large-scale applications. Furthermore, we allow the method to incorporate an identified pred… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  13. arXiv:2305.06625  [pdf, other

    stat.ML cs.LG

    Dropout Regularization in Extended Generalized Linear Models based on Double Exponential Families

    Authors: Benedikt Lütke Schwienhorst, Lucas Kock, Nadja Klein, David J. Nott

    Abstract: Even though dropout is a popular regularization technique, its theoretical properties are not fully understood. In this paper we study dropout regularization in extended generalized linear models based on double exponential families, for which the dispersion parameter can vary with the features. A theoretical analysis shows that dropout regularization prefers rare but important features in both th… ▽ More

    Submitted 28 July, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: Added a real-world data application and comments on approximations in the appendix

  14. arXiv:2304.08673  [pdf, other

    cs.LG stat.ML

    Semi-supervised Learning of Pushforwards For Domain Translation & Adaptation

    Authors: Nishant Panda, Natalie Klein, Dominic Yang, Patrick Gasda, Diane Oyen

    Abstract: Given two probability densities on related data spaces, we seek a map pushing one density to the other while satisfying application-dependent constraints. For maps to have utility in a broad application space (including domain translation, domain adaptation, and generative modeling), the map must be available to apply on out-of-sample data points and should correspond to a probabilistic model over… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 19 pages, 7 figures

  15. arXiv:2301.05593  [pdf, other

    stat.CO stat.ML

    Scalable Estimation for Structured Additive Distributional Regression

    Authors: Nikolaus Umlauf, Johannes Seiler, Mattias Wetscher, Thorsten Simon, Stefan Lang, Nadja Klein

    Abstract: Recently, fitting probabilistic models have gained importance in many areas but estimation of such distributional models with very large data sets is a difficult task. In particular, the use of rather complex models can easily lead to memory-related efficiency problems that can make estimation infeasible even on high-performance computers. We therefore propose a novel backfitting algorithm, which… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

  16. arXiv:2212.01613  [pdf, other

    stat.ME

    Accounting for Time Dependency in Meta-Analyses of Concordance Probability Estimates

    Authors: Matthias Schmid, Tim Friede, Nadja Klein, Leonie Weinhold

    Abstract: Recent years have seen the development of many novel scoring tools for disease prognosis and prediction. To become accepted for use in clinical applications, these tools have to be validated on external data. In practice, validation is often hampered by logistical issues, resulting in multiple small-sized validation studies. It is therefore necessary to synthesize the results of these studies usin… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

  17. arXiv:2211.16218  [pdf, other

    stat.CO stat.ME

    Anisotropic multidimensional smoothing using Bayesian tensor product P-splines

    Authors: Paul Bach, Nadja Klein

    Abstract: We introduce a highly efficient fully Bayesian approach for anisotropic multidimensional smoothing. The main challenge in this context is the Markov chain Monte Carlo update of the smoothing parameters as their full conditional posterior comprises a pseudo-determinant that appears to be intractable at first sight. As a consequence, most existing implementations are computationally feasible only fo… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  18. arXiv:2211.00348  [pdf, other

    cs.LG stat.ML

    Informed Priors for Knowledge Integration in Trajectory Prediction

    Authors: Christian Schlauch, Nadja Klein, Christian Wirth

    Abstract: Informed machine learning methods allow the integration of prior knowledge into learning systems. This can increase accuracy and robustness or reduce data needs. However, existing methods often assume hard constraining knowledge, that does not require to trade-off prior knowledge with observations, but can be used to directly reduce the problem space. Other approaches use specific, architectural c… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    ACM Class: I.2.6

  19. arXiv:2211.00080  [pdf, other

    cs.LG eess.SP stat.AP

    Denoising neural networks for magnetic resonance spectroscopy

    Authors: Natalie Klein, Amber J. Day, Harris Mason, Michael W. Malone, Sinead A. Williamson

    Abstract: In many scientific applications, measured time series are corrupted by noise or distortions. Traditional denoising techniques often fail to recover the signal of interest, particularly when the signal-to-noise ratio is low or when certain assumptions on the signal and noise are violated. In this work, we demonstrate that deep learning-based denoising methods can outperform traditional techniques w… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: 5 pages with appendix

  20. arXiv:2210.10389  [pdf, other

    stat.ME stat.CO stat.ML

    Distributional Adaptive Soft Regression Trees

    Authors: Nikolaus Umlauf, Nadja Klein

    Abstract: Random forests are an ensemble method relevant for many problems, such as regression or classification. They are popular due to their good predictive performance (compared to, e.g., decision trees) requiring only minimal tuning of hyperparameters. They are built via aggregation of multiple regression trees during training and are usually calculated recursively using hard splitting rules. Recently… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  21. arXiv:2207.08470  [pdf, other

    stat.ME

    Boosting Multivariate Structured Additive Distributional Regression Models

    Authors: Annika Strömer, Nadja Klein, Christian Staerk, Hannah Klinkhammer, Andreas Mayr

    Abstract: We develop a model-based boosting approach for multivariate distributional regression within the framework of generalized additive models for location, scale, and shape. Our approach enables the simultaneous modeling of all distribution parameters of an arbitrary parametric distribution of a multivariate response conditional on explanatory variables, while being applicable to potentially high-dime… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  22. arXiv:2204.14214  [pdf, other

    stat.ME stat.CO

    Actor Heterogeneity and Explained Variance in Network Models -- A Scalable Approach through Variational Approximations

    Authors: Nadja Klein, Göran Kauermann

    Abstract: The analysis of network data has gained considerable interest in recent years. This also includes the analysis of large, high-dimensional networks with hundreds and thousands of nodes. While exponential random graph models serve as workhorse for network data analyses, their applicability to very large networks is problematic via classical inference such as maximum likelihood or exact Bayesian esti… ▽ More

    Submitted 12 September, 2023; v1 submitted 29 April, 2022; originally announced April 2022.

  23. arXiv:2202.12851  [pdf, other

    stat.ME

    Boosting Distributional Copula Regression

    Authors: Nicolai Hans, Nadja Klein, Florian Faschingbauer, Michael Schneider, Andreas Mayr

    Abstract: Capturing complex dependence structures between outcome variables (e.g., study endpoints) is of high relevance in contemporary biomedical data problems and medical research. Distributional copula regression provides a flexible tool to model the joint distribution of multiple outcome variables by disentangling the marginal response distributions and their dependence structure. In a regression setup… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  24. arXiv:2202.01657  [pdf, other

    stat.ME stat.AP

    Deselection of Base-Learners for Statistical Boosting -- with an Application to Distributional Regression

    Authors: Annika Strömer, Christian Staerk, Nadja Klein, Leonie Weinhold, Stephanie Titze, Andreas Mayr

    Abstract: We present a new procedure for enhanced variable selection for component-wise gradient boosting. Statistical boosting is a computational approach that emerged from machine learning, which allows to fit regression models in the presence of high-dimensional data. Furthermore, the algorithm can lead to data-driven variable selection. In practice, however, the final models typically tend to include to… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  25. arXiv:2110.01050  [pdf, other

    stat.ML cs.CV cs.LG

    Marginally calibrated response distributions for end-to-end learning in autonomous driving

    Authors: Clara Hoffmann, Nadja Klein

    Abstract: End-to-end learners for autonomous driving are deep neural networks that predict the instantaneous steering angle directly from images of the ahead-lying street. These learners must provide reliable uncertainty estimates for their predictions in order to meet safety requirements and initiate a switch to manual control in areas of high uncertainty. Yet end-to-end learners typically only deliver poi… ▽ More

    Submitted 3 October, 2021; originally announced October 2021.

    Comments: 17 pages, 9 figures

  26. arXiv:2109.04288  [pdf, other

    math.ST stat.ME

    Posterior Concentration Rates for Bayesian Penalized Splines

    Authors: Paul Bach, Nadja Klein

    Abstract: Despite their widespread use in practice, the asymptotic properties of Bayesian penalized splines have not been investigated so far. We close this gap and study posterior concentration rates for Bayesian penalized splines in a Gaussian nonparametric regression model. A key feature of the approach is the hyperprior on the smoothing variance, which allows for adaptive smoothing in practice but compl… ▽ More

    Submitted 23 March, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

  27. arXiv:2108.08709  [pdf, other

    cs.LG math.SP stat.ML

    Neural density estimation and uncertainty quantification for laser induced breakdown spectroscopy spectra

    Authors: Katiana Kontolati, Natalie Klein, Nishant Panda, Diane Oyen

    Abstract: Constructing probability densities for inference in high-dimensional spectral data is often intractable. In this work, we use normalizing flows on structured spectral latent spaces to estimate such densities, enabling downstream inference tasks. In addition, we evaluate a method for uncertainty quantification when predicting unobserved state vectors associated with each spectrum. We demonstrate th… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: 5 pages, 3 figures

  28. arXiv:2106.03737  [pdf, other

    stat.ME

    A multivariate Gaussian random field prior against spatial confounding

    Authors: Isa Marques, Thomas Kneib, Nadja Klein

    Abstract: Spatial models are used in a variety research areas, such as environmental sciences, epidemiology, or physics. A common phenomenon in many spatial regression models is spatial confounding. This phenomenon takes place when spatially indexed covariates modeling the mean of the response are correlated with the spatial random effect. As a result, estimates for regression coefficients of the covariates… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Submitted to Environmetrics and currently under review

  29. arXiv:2105.10890  [pdf, other

    stat.ME stat.AP stat.CO

    Bayesian Effect Selection for Additive Quantile Regression with an Analysis to Air Pollution Thresholds

    Authors: Nadja Klein, Jorge Mateu

    Abstract: Statistical techniques used in air pollution modelling usually lack the possibility to understand which predictors affect air pollution in which functional form; and are not able to regress on exceedances over certain thresholds imposed by authorities directly. The latter naturally induce conditional quantiles and reflect the seriousness of particular events. In the present paper we focus on this… ▽ More

    Submitted 23 May, 2021; originally announced May 2021.

  30. arXiv:2105.09003  [pdf, other

    stat.ME math.ST

    Flexible Specification Testing in Quantile Regression Models

    Authors: Tim Kutzker, Nadja Klein, Dominik Wied

    Abstract: We propose three novel consistent specification tests for quantile regression models which generalize former tests in three ways. First, we allow the covariate effects to be quantile-dependent and nonlinear. Second, we allow parameterizing the conditional quantile functions by appropriate basis functions, rather than parametrically. We are hence able to test for functional forms beyond linearity,… ▽ More

    Submitted 5 December, 2021; v1 submitted 19 May, 2021; originally announced May 2021.

  31. Variational Inference and Sparsity in High-Dimensional Deep Gaussian Mixture Models

    Authors: Lucas Kock, Nadja Klein, David J. Nott

    Abstract: Gaussian mixture models are a popular tool for model-based clustering, and mixtures of factor analyzers are Gaussian mixture models having parsimonious factor covariance structure for mixture components. There are several recent extensions of mixture of factor analyzers to deep mixtures, where the Gaussian model for the latent factors is replaced by a mixture of factor analyzers. This construction… ▽ More

    Submitted 1 September, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Journal ref: Stat Comput 32, 70 (2022)

  32. arXiv:2104.14243  [pdf, other

    stat.ME stat.AP

    Bivariate Analysis of Birth Weight and Gestational Age Depending on Environmental Exposures: Bayesian Distributional Regression with Copulas

    Authors: Jonathan Rathjens, Arthur Kolbe, Jürgen Hölzer, Katja Ickstadt, Nadja Klein

    Abstract: In this article, we analyze perinatal data with birth weight (BW) as primarily interesting response variable. Gestational age (GA) is usually an important covariate and included in polynomial form. However, in opposition to this univariate regression, bivariate modeling of BW and GA is recommended to distinguish effects on each, on both, and between them. Rather than a parametric bivariate distrib… ▽ More

    Submitted 31 October, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: 27 pages, 7 figures (some of them composed from several pdf files)

  33. arXiv:2104.10070  [pdf, other

    stat.AP q-bio.NC stat.ME

    Cross-population coupling of neural activity based on Gaussian process current source densities

    Authors: Natalie Klein, Joshua H. Siegle, Tobias Teichert, Robert E. Kass

    Abstract: Because local field potentials (LFPs) arise from multiple sources in different spatial locations, they do not easily reveal coordinated activity across neural populations on a trial-to-trial basis. As we show here, however, once disparate source signals are decoupled, their trial-to-trial fluctuations become more accessible, and cross-population correlations become more apparent. To decouple sourc… ▽ More

    Submitted 8 November, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: Accepted for publication in PLOS Computational Biology

  34. arXiv:2104.02705  [pdf, other

    stat.ML cs.LG stat.CO

    deepregression: a Flexible Neural Network Framework for Semi-Structured Deep Distributional Regression

    Authors: David Rügamer, Chris Kolb, Cornelius Fritz, Florian Pfisterer, Philipp Kopper, Bernd Bischl, Ruolin Shen, Christina Bukas, Lisa Barros de Andrade e Sousa, Dominik Thalmeier, Philipp Baumann, Lucas Kook, Nadja Klein, Christian L. Müller

    Abstract: In this paper we describe the implementation of semi-structured deep distributional regression, a flexible framework to learn conditional distributions based on the combination of additive regression models and deep networks. Our implementation encompasses (1) a modular neural network building system based on the deep learning library \pkg{TensorFlow} for the fusion of various statistical and deep… ▽ More

    Submitted 10 March, 2022; v1 submitted 6 April, 2021; originally announced April 2021.

  35. arXiv:2012.11016  [pdf, other

    stat.ME

    Bayesian Conditional Transformation Models

    Authors: Manuel Carlan, Thomas Kneib, Nadja Klein

    Abstract: Recent developments in statistical regression methodology shift away from pure mean regression towards distributional regression models. One important strand thereof is that of conditional transformation models (CTMs). CTMs infer the entire conditional distribution directly by applying a transformation function to the response conditionally on a set of covariates towards a simple log-concave refer… ▽ More

    Submitted 21 May, 2022; v1 submitted 20 December, 2020; originally announced December 2020.

  36. arXiv:2010.01844  [pdf, ps, other

    stat.ME econ.EM stat.AP stat.CO stat.ML

    Deep Distributional Time Series Models and the Probabilistic Forecasting of Intraday Electricity Prices

    Authors: Nadja Klein, Michael Stanley Smith, David J. Nott

    Abstract: Recurrent neural networks (RNNs) with rich feature vectors of past values can provide accurate point forecasts for series that exhibit complex serial dependence. We propose two approaches to constructing deep time series probabilistic models based on a variant of RNN called an echo state network (ESN). The first is where the output layer of the ESN has stochastic disturbances and a shrinkage prior… ▽ More

    Submitted 27 May, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Journal ref: Journal of Applied Econometrics (2023), 38( 4), 493-511

  37. arXiv:2002.05777  [pdf, other

    stat.ML cs.LG stat.ME

    Semi-Structured Distributional Regression -- Extending Structured Additive Models by Arbitrary Deep Neural Networks and Data Modalities

    Authors: David Rügamer, Chris Kolb, Nadja Klein

    Abstract: Combining additive models and neural networks allows to broaden the scope of statistical regression and extend deep learning-based approaches by interpretable structured additive predictors at the same time. Existing attempts uniting the two modeling approaches are, however, limited to very specific combinations and, more importantly, involve an identifiability issue. As a consequence, interpretab… ▽ More

    Submitted 9 July, 2022; v1 submitted 13 February, 2020; originally announced February 2020.

  38. arXiv:1911.08725  [pdf, other

    stat.CO

    Assessment and adjustment of approximate inference algorithms using the law of total variance

    Authors: Xuejun Yu, David J. Nott, Minh-Ngoc Tran, Nadja Klein

    Abstract: A common method for assessing validity of Bayesian sampling or approximate inference methods makes use of simulated data replicates for parameters drawn from the prior. Under continuity assumptions, quantiles of functions of the simulated parameter values within corresponding posterior distributions are uniformly distributed. Checking for uniformity when a posterior density is approximated numeric… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

  39. arXiv:1910.11044  [pdf, ps, other

    stat.ME stat.ML

    Torus Graphs for Multivariate Phase Coupling Analysis

    Authors: Natalie Klein, Josue Orellana, Scott Brincat, Earl K. Miller, Robert E. Kass

    Abstract: Angular measurements are often modeled as circular random variables, where there are natural circular analogues of moments, including correlation. Because a product of circles is a torus, a d-dimensional vector of circular random variables lies on a d-dimensional torus. For such vectors we present here a class of graphical models, which we call torus graphs, based on the full exponential family wi… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

    Comments: N.K. and J.O. contributed equally to this work. Peer reviewed version, in press at The Annals of Applied Statistics. 10 main Figures, supplementary text appended with 11 supplementary figures

  40. arXiv:1909.11784  [pdf, other

    stat.CO cs.LG stat.ME stat.ML

    bamlss: A Lego Toolbox for Flexible Bayesian Regression (and Beyond)

    Authors: Nikolaus Umlauf, Nadja Klein, Thorsten Simon, Achim Zeileis

    Abstract: Over the last decades, the challenges in applied regression and in predictive modeling have been changing considerably: (1) More flexible model specifications are needed as big(ger) data become available, facilitated by more powerful computing infrastructure. (2) Full probabilistic modeling rather than predicting just means or expectations is crucial in many applications. (3) Interest in Bayesian… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

    Comments: 48 pages, 12 figures

  41. arXiv:1909.01274  [pdf, other

    stat.AP

    In Search of Lost Edges: A Case Study on Reconstructing Financial Networks

    Authors: Michael Lebacher, Samantha Cook, Nadja Klein, Göran Kauermann

    Abstract: To capture the systemic complexity of international financial systems, network data is an important prerequisite. However, dyadic data is often not available, raising the need for methods that allow for reconstructing networks based on limited information. In this paper, we are reviewing different methods that are designed for the estimation of matrices from their marginals and potentially exogeno… ▽ More

    Submitted 4 September, 2019; v1 submitted 3 September, 2019; originally announced September 2019.

  42. arXiv:1908.09482  [pdf, ps, other

    stat.ME stat.CO stat.ML

    Marginally-calibrated deep distributional regression

    Authors: Nadja Klein, David J. Nott, Michael Stanley Smith

    Abstract: Deep neural network (DNN) regression models are widely used in applications requiring state-of-the-art predictive accuracy. However, until recently there has been little work on accurate uncertainty quantification for predictions from such models. We add to this literature by outlining an approach to constructing predictive distributions that are `marginally calibrated'. This is where the long run… ▽ More

    Submitted 3 September, 2020; v1 submitted 26 August, 2019; originally announced August 2019.

    Journal ref: Journal of Computational and Graphical Statistics (2020)

  43. Bayesian Variable Selection for Non-Gaussian Responses: A Marginally Calibrated Copula Approach

    Authors: Nadja Klein, Michael Stanley Smith

    Abstract: We propose a new highly flexible and tractable Bayesian approach to undertake variable selection in non-Gaussian regression models. It uses a copula decomposition for the joint distribution of observations on the dependent variable. This allows the marginal distribution of the dependent variable to be calibrated accurately using a nonparametric or other estimator. The family of copulas employed ar… ▽ More

    Submitted 3 September, 2020; v1 submitted 10 July, 2019; originally announced July 2019.

    Journal ref: Biometrics (2020)

  44. Bayesian Inference for Regression Copulas

    Authors: Michael Stanley Smith, Nadja Klein

    Abstract: We propose a new semi-parametric distributional regression smoother that is based on a copula decomposition of the joint distribution of the vector of response values. The copula is high-dimensional and constructed by inversion of a pseudo regression, where the conditional mean and variance are semi-parametric functions of covariates modeled using regularized basis functions. By integrating out th… ▽ More

    Submitted 24 January, 2020; v1 submitted 10 July, 2019; originally announced July 2019.

    Comments: Journal of Business & Economic Statistics (2020)

  45. Multivariate Conditional Transformation Models

    Authors: Nadja Klein, Torsten Hothorn, Luisa Barbanti, Thomas Kneib

    Abstract: Regression models describing the joint distribution of multivariate response variables conditional on covariate information have become an important aspect of contemporary regression analysis. However, a limitation of such models is that they often rely on rather simplistic assumptions, e.g. a constant dependency structure that is not allowed to vary with the covariates or the restriction to linea… ▽ More

    Submitted 3 September, 2020; v1 submitted 7 June, 2019; originally announced June 2019.

    Journal ref: Scandinavian Journal of Statistics (2022); 49: 116-142

  46. Bayesian Effect Selection in Structured Additive Distributional Regression Models

    Authors: Nadja Klein, Manuel Carlan, Thomas Kneib, Stefan Lang, Helga Wagner

    Abstract: We propose a novel spike and slab prior specification with scaled beta prime marginals for the importance parameters of regression coefficients to allow for general effect selection within the class of structured additive distributional regression. This enables us to model effects on all distributional parameters for arbitrary parametric distributions, and to consider various effect types such as… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

    Journal ref: Bayesian Anal., Advance publication (2020), 29 pages

  47. arXiv:1805.07395  [pdf

    stat.AP

    More green space is related to less antidepressant prescription rates in the Netherlands: A Bayesian geoadditive quantile regression approach

    Authors: Marco Helbich, Nadja Klein, Hannah Roberts, Paulien Hagedoorn, Peter Groenewegen

    Abstract: Exposure to green space seems to be beneficial for self-reported mental health. In this study we used an objective health indicator, namely antidepressant prescription rates. Current studies rely exclusively upon mean regression models assuming linear associations. It is, however, plausible that the presence of green space is non-linearly related with different quantiles of the outcome antidepress… ▽ More

    Submitted 8 June, 2018; v1 submitted 18 May, 2018; originally announced May 2018.

    Journal ref: Environmental Research 2018

  48. Implicit Copulas from Bayesian Regularized Regression Smoothers

    Authors: Nadja Klein, Michael Stanley Smith

    Abstract: We show how to extract the implicit copula of a response vector from a Bayesian regularized regression smoother with Gaussian disturbances. The copula can be used to compare smoothers that employ different shrinkage priors and function bases. We illustrate with three popular choices of shrinkage priors --- a pairwise prior, the horseshoe prior and a g prior augmented with a point mass as employed… ▽ More

    Submitted 14 May, 2018; v1 submitted 27 April, 2018; originally announced April 2018.

    Journal ref: Bayesian Anal. 14 (2019), no. 4, 1143--1171

  49. arXiv:1609.02686  [pdf, other

    stat.ML stat.ME

    Boosting Joint Models for Longitudinal and Time-to-Event Data

    Authors: Elisabeth Waldmann, David Taylor-Robinson, Nadja Klein, Thomas Kneib, Tania Pressler, Matthias Schmid, Andreas Mayr

    Abstract: Joint Models for longitudinal and time-to-event data have gained a lot of attention in the last few years as they are a helpful technique to approach common a data structure in clinical studies where longitudinal outcomes are recorded alongside event times. Those two processes are often linked and the two outcomes should thus be modeled jointly in order to prevent the potential bias introduced by… ▽ More

    Submitted 22 December, 2016; v1 submitted 9 September, 2016; originally announced September 2016.

  50. Bayesian structured additive distributional regression with an application to regional income inequality in Germany

    Authors: Nadja Klein, Thomas Kneib, Stefan Lang, Alexander Sohn

    Abstract: We propose a generic Bayesian framework for inference in distributional regression models in which each parameter of a potentially complex response distribution and not only the mean is related to a structured additive predictor. The latter is composed additively of a variety of different functional effect types such as nonlinear effects, spatial effects, random coefficients, interaction surfaces… ▽ More

    Submitted 17 September, 2015; originally announced September 2015.

    Comments: Published at http://dx.doi.org/10.1214/15-AOAS823 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS823

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 2, 1024-1052