Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Lahnakoski, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.04179  [pdf

    cs.LG cs.AI

    On Leakage in Machine Learning Pipelines

    Authors: Leonard Sasse, Eliana Nicolaisen-Sobesky, Juergen Dukart, Simon B. Eickhoff, Michael Götz, Sami Hamdan, Vera Komeyer, Abhijit Kulkarni, Juha Lahnakoski, Bradley C. Love, Federico Raimondo, Kaustubh R. Patil

    Abstract: Machine learning (ML) provides powerful tools for predictive modeling. ML's popularity stems from the promise of sample-level prediction with applications across a variety of fields from physics and marketing to healthcare. However, if not properly implemented and evaluated, ML pipelines may contain leakage typically resulting in overoptimistic performance estimates and failure to generalize to ne… ▽ More

    Submitted 5 March, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: second draft