Skip to main content

Showing 1–4 of 4 results for author: Rocks, J W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.14151  [pdf, other

    cs.LG stat.ML

    Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle

    Authors: Rylan Schaeffer, Mikail Khona, Zachary Robertson, Akhilan Boopathy, Kateryna Pistunova, Jason W. Rocks, Ila Rani Fiete, Oluwasanmi Koyejo

    Abstract: Double descent is a surprising phenomenon in machine learning, in which as the number of model parameters grows relative to the number of data, test error drops as models grow ever larger into the highly overparameterized (data undersampled) regime. This drop in test error flies against classical learning theory on overfitting and has arguably underpinned the success of large models in machine lea… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  2. arXiv:2203.05443  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Bias-variance decomposition of overparameterized regression with random linear features

    Authors: Jason W. Rocks, Pankaj Mehta

    Abstract: In classical statistics, the bias-variance trade-off describes how varying a model's complexity (e.g., number of fit parameters) affects its ability to make accurate predictions. According to this trade-off, optimal performance is achieved when a model is expressive enough to capture trends in the data, yet not so complex that it overfits idiosyncratic features of the training data. Recently, it h… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: 10 pages (double column), 3 figures, 11 pages of appendices (single column)

    Journal ref: Phys. Rev. E 106, 025304 (2022)

  3. arXiv:2103.14108  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    The Geometry of Over-parameterized Regression and Adversarial Perturbations

    Authors: Jason W. Rocks, Pankaj Mehta

    Abstract: Classical regression has a simple geometric description in terms of a projection of the training labels onto the column space of the design matrix. However, for over-parameterized models -- where the number of fit parameters is large enough to perfectly fit the training data -- this picture becomes uninformative. Here, we present an alternative geometric interpretation of regression that applies t… ▽ More

    Submitted 23 April, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

    Comments: 11 pages (single column), 4 figures, 10 pages of supporting material

  4. arXiv:2010.13933  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Memorizing without overfitting: Bias, variance, and interpolation in over-parameterized models

    Authors: Jason W. Rocks, Pankaj Mehta

    Abstract: The bias-variance trade-off is a central concept in supervised learning. In classical statistics, increasing the complexity of a model (e.g., number of parameters) reduces bias but also increases variance. Until recently, it was commonly believed that optimal performance is achieved at intermediate model complexities which strike a balance between bias and variance. Modern Deep Learning methods fl… ▽ More

    Submitted 24 February, 2022; v1 submitted 26 October, 2020; originally announced October 2020.

    Comments: 21 pages (double column), 6 figures, 32 pages of supplemental material (single column)

    Journal ref: Phys. Rev. Research 4, 013201 (2022)