Skip to main content

Showing 1–22 of 22 results for author: Lessmann, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13009  [pdf, other

    stat.ML cs.LG

    Fighting Sampling Bias: A Framework for Training and Evaluating Credit Scoring Models

    Authors: Nikita Kozodoi, Stefan Lessmann, Morteza Alamgir, Luis Moreira-Matias, Konstantinos Papakonstantinou

    Abstract: Scoring models support decision-making in financial institutions. Their estimation and evaluation are based on the data of previously accepted applicants with known repayment behavior. This creates sampling bias: the available labeled data offers a partial picture of the distribution of candidate borrowers, which the model is supposed to score. The paper addresses the adverse effect of sampling bi… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  2. arXiv:2403.15886  [pdf, other

    cs.CL cs.AI cs.LG

    Leveraging Zero-Shot Prompting for Efficient Language Model Distillation

    Authors: Lukas Vöge, Vincent Gurgul, Stefan Lessmann

    Abstract: This paper introduces a novel approach for efficiently distilling LLMs into smaller, application-specific models, significantly reducing operational costs and manual labor. Addressing the challenge of deploying computationally intensive LLMs in specific applications or edge devices, this technique utilizes LLMs' reasoning capabilities to generate labels and natural language rationales for unlabele… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  3. arXiv:2312.05234  [pdf, other

    stat.ML cs.LG

    The impact of heteroskedasticity on uplift modeling

    Authors: Björn Bokelmann, Stefan Lessmann

    Abstract: There are various applications, where companies need to decide to which individuals they should best allocate treatment. To support such decisions, uplift models are applied to predict treatment effects on an individual level. Based on the predicted treatment effects, individuals can be ranked and treatment allocation can be prioritized according to this ranking. An implicit assumption, which has… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 10 pages, 4 figures

  4. arXiv:2311.14759  [pdf, other

    q-fin.ST cs.LG stat.ML

    Forecasting Cryptocurrency Prices Using Deep Learning: Integrating Financial, Blockchain, and Text Data

    Authors: Vincent Gurgul, Stefan Lessmann, Wolfgang Karl Härdle

    Abstract: This paper explores the application of Machine Learning (ML) and Natural Language Processing (NLP) techniques in cryptocurrency price forecasting, specifically Bitcoin (BTC) and Ethereum (ETH). Focusing on news and social media data, primarily from Twitter and Reddit, we analyse the influence of public sentiment on cryptocurrency valuations using advanced deep learning NLP methods. Alongside conve… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  5. arXiv:2308.02680  [pdf, other

    cs.CY cs.LG

    Fair Models in Credit: Intersectional Discrimination and the Amplification of Inequity

    Authors: Savina Kim, Stefan Lessmann, Galina Andreeva, Michael Rovatsos

    Abstract: The increasing usage of new data sources and machine learning (ML) technology in credit modeling raises concerns with regards to potentially unfair decision-making that rely on protected characteristics (e.g., race, sex, age) or other socio-economic and demographic data. The authors demonstrate the impact of such algorithmic bias in the microfinance context. Difficulties in assessing credit are di… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  6. arXiv:2307.11845  [pdf, other

    cs.CL cs.AI cs.CV q-fin.CP

    Multimodal Document Analytics for Banking Process Automation

    Authors: Christopher Gerling, Stefan Lessmann

    Abstract: Traditional banks face increasing competition from FinTechs in the rapidly evolving financial ecosystem. Raising operational efficiency is vital to address this challenge. Our study aims to improve the efficiency of document-intensive business processes in banking. To that end, we first review the landscape of business documents in the retail segment. Banking documents often contain text, layout,… ▽ More

    Submitted 26 November, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: A Preprint

  7. arXiv:2305.11575  [pdf, other

    stat.ML cs.LG

    The Deep Promotion Time Cure Model

    Authors: Victor Medina-Olivares, Stefan Lessmann, Nadja Klein

    Abstract: We propose a novel method for predicting time-to-event in the presence of cure fractions based on flexible survivals models integrated into a deep neural network framework. Our approach allows for non-linear relationships and high-dimensional interactions between covariates and survival and is suitable for large-scale applications. Furthermore, we allow the method to incorporate an identified pred… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  8. arXiv:2211.00921  [pdf, other

    q-fin.RM cs.LG

    A Data-driven Case-based Reasoning in Bankruptcy Prediction

    Authors: Wei Li, Wolfgang Karl Härdle, Stefan Lessmann

    Abstract: There has been intensive research regarding machine learning models for predicting bankruptcy in recent years. However, the lack of interpretability limits their growth and practical implementation. This study proposes a data-driven explainable case-based reasoning (CBR) system for bankruptcy prediction. Empirical results from a comparative study show that the proposed approach performs superior t… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  9. arXiv:2204.05781  [pdf

    q-fin.ST cs.LG

    Forecasting Cryptocurrency Returns from Sentiment Signals: An Analysis of BERT Classifiers and Weak Supervision

    Authors: Duygu Ider, Stefan Lessmann

    Abstract: Anticipating price developments in financial markets is a topic of continued interest in forecasting. Funneled by advancements in deep learning and natural language processing (NLP) together with the availability of vast amounts of textual data in form of news articles, social media postings, etc., an increasing number of studies incorporate text-based predictors in forecasting models. We contribu… ▽ More

    Submitted 19 March, 2023; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: 29 pages

  10. arXiv:2112.08060  [pdf, other

    cs.LG cs.CV stat.ML

    Leveraging Image-based Generative Adversarial Networks for Time Series Generation

    Authors: Justin Hellermann, Stefan Lessmann

    Abstract: Generative models for images have gained significant attention in computer vision and natural language processing due to their ability to generate realistic samples from complex data distributions. To leverage the advances of image-based generative models for the time series domain, we propose a two-dimensional image representation for time series, the Extended Intertemporal Return Plot (XIRP). Ou… ▽ More

    Submitted 31 August, 2023; v1 submitted 15 December, 2021; originally announced December 2021.

  11. arXiv:2111.11344  [pdf, other

    cs.LG stat.ML

    Modeling Irregular Time Series with Continuous Recurrent Units

    Authors: Mona Schirmer, Mazin Eltayeb, Stefan Lessmann, Maja Rudolph

    Abstract: Recurrent neural networks (RNNs) are a popular choice for modeling sequential data. Modern RNN architectures assume constant time-intervals between observations. However, in many datasets (e.g. medical records) observation times are irregular and can carry important information. To address this challenge, we propose continuous recurrent units (CRUs) -- a neural architecture that can naturally hand… ▽ More

    Submitted 26 July, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: Accepted at ICML 2022, Baltimore, Maryland

  12. arXiv:2105.14599  [pdf

    cs.IR

    Personalization in E-Grocery: Top-N versus Top-k Rankings

    Authors: Franziska Scherpinski, Stefan Lessmann

    Abstract: Business success in e-commerce depends on customer perceived value. A customer with high perceived value buys, returns, and recommends items. The perceived value is at risk whenever the information load harms users' shopping experience. In e-grocery, shoppers face an overwhelming number of items, the majority of which is irrelevant for the shopper. Recommender systems (RS) enable businesses to mas… ▽ More

    Submitted 30 May, 2021; originally announced May 2021.

    MSC Class: 62P20 ACM Class: H.3.3

  13. arXiv:2103.01907  [pdf, other

    stat.ML cs.LG q-fin.RM

    Fairness in Credit Scoring: Assessment, Implementation and Profit Implications

    Authors: Nikita Kozodoi, Johannes Jacob, Stefan Lessmann

    Abstract: The rise of algorithmic decision-making has spawned much research on fair machine learning (ML). Financial institutions use ML for building risk scorecards that support a range of credit-related decisions. Yet, the literature on fair ML in credit scoring is scarce. The paper makes three contributions. First, we revisit statistical fairness criteria and examine their adequacy for credit scoring. Se… ▽ More

    Submitted 17 June, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: Accepted to European Journal of Operational Research

  14. arXiv:2101.03336  [pdf

    cs.LG

    Interpretable Multiple Treatment Revenue Uplift Modeling

    Authors: Robin M. Gubela, Stefan Lessmann

    Abstract: Big data and business analytics are critical drivers of business and societal transformations. Uplift models support a firm's decision-making by predicting the change of a customer's behavior due to a treatment. Prior work examines models for single treatments and binary customer responses. The paper extends corresponding approaches by developing uplift models for multiple treatments and continuou… ▽ More

    Submitted 9 January, 2021; originally announced January 2021.

    Journal ref: Proceedings of the 26th Americas Conference on Information Systems (AMCIS 2020)

  15. arXiv:2008.09202  [pdf, other

    cs.LG

    Conditional Wasserstein GAN-based Oversampling of Tabular Data for Imbalanced Learning

    Authors: Justin Engelmann, Stefan Lessmann

    Abstract: Class imbalance is a common problem in supervised learning and impedes the predictive performance of classification models. Popular countermeasures include oversampling the minority class. Standard methods like SMOTE rely on finding nearest neighbours and linear interpolations which are problematic in case of high-dimensional, complex data distributions. Generative Adversarial Networks (GANs) have… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

  16. Response Transformation and Profit Decomposition for Revenue Uplift Modeling

    Authors: Robin M. Gubela, Stefan Lessmann, Szymon Jaroszewicz

    Abstract: Uplift models support decision-making in marketing campaign planning. Estimating the causal effect of a marketing treatment, an uplift model facilitates targeting communication to responsive customers and efficient allocation of marketing budgets. Research into uplift models focuses on conversion models to maximize incremental sales. The paper introduces uplift modeling strategies for maximizing i… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

    Comments: 53 pages including online appendix

    Journal ref: European Journal of Operational Research 2019

  17. arXiv:1910.00393  [pdf, other

    cs.LG stat.AP stat.ML

    Affordable Uplift: Supervised Randomization in Controlled Experiments

    Authors: Johannes Haupt, Daniel Jacob, Robin M. Gubela, Stefan Lessmann

    Abstract: Customer scoring models are the core of scalable direct marketing. Uplift models provide an estimate of the incremental benefit from a treatment that is used for operational decision-making. Training and monitoring of uplift models require experimental data. However, the collection of data under randomized treatment assignment is costly, since random targeting deviates from an established targetin… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

    MSC Class: 68U35

  18. arXiv:1909.11114  [pdf, other

    stat.AP cs.LG stat.ML

    Churn Prediction with Sequential Data and Deep Neural Networks. A Comparative Analysis

    Authors: C. Gary Mena, Arno De Caigny, Kristof Coussement, Koen W. De Bock, Stefan Lessmann

    Abstract: Off-the-shelf machine learning algorithms for prediction such as regularized logistic regression cannot exploit the information of time-varying features without previously using an aggregation procedure of such sequential data. However, recurrent neural networks provide an alternative approach by which time-varying features can be readily used for modeling. This paper assesses the performance of n… ▽ More

    Submitted 24 September, 2019; originally announced September 2019.

  19. arXiv:1909.06108  [pdf, ps, other

    stat.ML cs.LG q-fin.RM

    Shallow Self-Learning for Reject Inference in Credit Scoring

    Authors: Nikita Kozodoi, Panagiotis Katsas, Stefan Lessmann, Luis Moreira-Matias, Konstantinos Papakonstantinou

    Abstract: Credit scoring models support loan approval decisions in the financial services industry. Lenders train these models on data from previously granted credit applications, where the borrowers' repayment behavior has been observed. This approach creates sample bias. The scoring model (i.e., classifier) is trained on accepted cases only. Applying the resulting model to screen credit applications from… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

    Comments: Preprint of the paper accepted to ECML PKDD 2019

    Journal ref: ECML PKDD 2019. Lecture Notes in Computer Science, vol 11908. Springer, Cham

  20. arXiv:1901.01726  [pdf

    cs.SE

    Evaluating software defect prediction performance: an updated benchmarking study

    Authors: Libo Li, Stefan Lessmann, Bart Baesens

    Abstract: Accurately predicting faulty software units helps practitioners target faulty units and prioritize their efforts to maintain software quality. Prior studies use machine-learning models to detect faulty software code. We revisit past studies and point out potential improvements. Our new study proposes a revised benchmarking configuration. The configuration considers many new dimensions, such as cla… ▽ More

    Submitted 7 January, 2019; originally announced January 2019.

  21. arXiv:1812.06175  [pdf, other

    q-fin.RM cs.LG stat.AP

    Can Deep Learning Predict Risky Retail Investors? A Case Study in Financial Risk Behavior Forecasting

    Authors: Yaodong Yang, Alisa Kolesnikova, Stefan Lessmann, Tiejun Ma, Ming-Chien Sung, Johnnie E. V. Johnson

    Abstract: The paper examines the potential of deep learning to support decisions in financial risk management. We develop a deep learning model for predicting whether individual spread traders secure profits from future trades. This task embodies typical modeling challenges faced in risk and behavior forecasting. Conventional machine learning requires data that is representative of the feature-target relati… ▽ More

    Submitted 17 November, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: Within the "equal" contribution, Yaodong Yang contributed the core deep learning algorithm along with its experimental results, and the first draft of the manuscript (including Figure 1,2,3,4,7,8,9,11, and Table 3)

  22. Robust identification of email tracking: A machine learning approach

    Authors: Johannes Haupt, Benedict Bender, Benjamin Fabian, Stefan Lessmann

    Abstract: Email tracking allows email senders to collect fine-grained behavior and location data on email recipients, who are uniquely identifiable via their email address. Such tracking invades user privacy in that email tracking techniques gather data without user consent or awareness. Striving to increase privacy in email communication, this paper develops a detection engine to be the core of a selective… ▽ More

    Submitted 11 June, 2018; originally announced June 2018.

    Comments: Accepted publication, In press, European Journal of Operational Research, 2018