Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Eickhoff, S B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.04179  [pdf

    cs.LG cs.AI

    On Leakage in Machine Learning Pipelines

    Authors: Leonard Sasse, Eliana Nicolaisen-Sobesky, Juergen Dukart, Simon B. Eickhoff, Michael Götz, Sami Hamdan, Vera Komeyer, Abhijit Kulkarni, Juha Lahnakoski, Bradley C. Love, Federico Raimondo, Kaustubh R. Patil

    Abstract: Machine learning (ML) provides powerful tools for predictive modeling. ML's popularity stems from the promise of sample-level prediction with applications across a variety of fields from physics and marketing to healthcare. However, if not properly implemented and evaluated, ML pipelines may contain leakage typically resulting in overoptimistic performance estimates and failure to generalize to ne… ▽ More

    Submitted 5 March, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: second draft

  2. arXiv:2210.09232  [pdf, other

    cs.LG cs.AI stat.ML

    Confound-leakage: Confound Removal in Machine Learning Leads to Leakage

    Authors: Sami Hamdan, Bradley C. Love, Georg G. von Polier, Susanne Weis, Holger Schwender, Simon B. Eickhoff, Kaustubh R. Patil

    Abstract: Machine learning (ML) approaches to data analysis are now widely adopted in many fields including epidemiology and medicine. To apply these approaches, confounds must first be removed as is commonly done by featurewise removal of their variance by linear regression before applying ML. Here, we show this common approach to confound removal biases ML models, leading to misleading results. Specifical… ▽ More

    Submitted 27 October, 2022; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: Revised Introduction, added CoI, results unchanged

  3. arXiv:2208.07081  [pdf, other

    stat.ME cs.LG stat.ML

    Predictive Data Calibration for Linear Correlation Significance Testing

    Authors: Kaustubh R. Patil, Simon B. Eickhoff, Robert Langner

    Abstract: Inferring linear relationships lies at the heart of many empirical investigations. A measure of linear dependence should correctly evaluate the strength of the relationship as well as qualify whether it is meaningful for the population. Pearson's correlation coefficient (PCC), the \textit{de-facto} measure for bivariate relationships, is known to lack in both regards. The estimated strength $r$ ma… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    ACM Class: G.3; I.2.6; J.3

  4. arXiv:2207.11352  [pdf, other

    eess.IV cs.CV

    Deep neural network heatmaps capture Alzheimer's disease patterns reported in a large meta-analysis of neuroimaging studies

    Authors: Di Wang, Nicolas Honnorat, Peter T. Fox, Kerstin Ritter, Simon B. Eickhoff, Sudha Seshadri, Mohamad Habes

    Abstract: Deep neural networks currently provide the most advanced and accurate machine learning models to distinguish between structural MRI scans of subjects with Alzheimer's disease and healthy controls. Unfortunately, the subtle brain alterations captured by these models are difficult to interpret because of the complexity of these multi-layer and non-linear models. Several heatmap methods have been pro… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  5. arXiv:2101.10091  [pdf

    cs.CY

    JTrack: A Digital Biomarker Platform for Remote Monitoring in Neurological and Psychiatric Diseases

    Authors: Mehran Sahandi Far, Michael Stolz, Jona M. Fischer, Simon B. Eickhoff, Juergen Dukart

    Abstract: Objective: Health-related data being collected by smartphones offer a promising complementary approach to in-clinic assessments. Here we introduce the JTrack platform as a secure, reliable and extendable open-source solution for remote monitoring in daily-life and digital phenotyping. Method: JTrack consists of an Android-based smartphone application and a web-based project management dashboard. A… ▽ More

    Submitted 2 February, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: package Name for application is changed

  6. arXiv:1912.06686  [pdf, other

    q-bio.NC cs.CV eess.IV

    Systematic Misestimation of Machine Learning Performance in Neuroimaging Studies of Depression

    Authors: Claas Flint, Micah Cearns, Nils Opel, Ronny Redlich, David M. A. Mehler, Daniel Emden, Nils R. Winter, Ramona Leenings, Simon B. Eickhoff, Tilo Kircher, Axel Krug, Igor Nenadic, Volker Arolt, Scott Clark, Bernhard T. Baune, Xiaoyi Jiang, Udo Dannlowski, Tim Hahn

    Abstract: We currently observe a disconcerting phenomenon in machine learning studies in psychiatry: While we would expect larger samples to yield better results due to the availability of more data, larger machine learning studies consistently show much weaker performance than the numerous small-scale studies. Here, we systematically investigated this effect focusing on one of the most heavily studied ques… ▽ More

    Submitted 3 May, 2021; v1 submitted 13 December, 2019; originally announced December 2019.

    Journal ref: Neuropsychopharmacology 46 (2021) 1510-1517