Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: DiAchille, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:1806.09077  [pdf, other

    stat.ML cs.LG

    Beyond Backprop: Online Alternating Minimization with Auxiliary Variables

    Authors: Anna Choromanska, Benjamin Cowen, Sadhana Kumaravel, Ronny Luss, Mattia Rigotti, Irina Rish, Brian Kingsbury, Paolo DiAchille, Viatcheslav Gurev, Ravi Tejwani, Djallel Bouneffouf

    Abstract: Despite significant recent advances in deep neural networks, training them remains a challenge due to the highly non-convex nature of the objective function. State-of-the-art methods rely on error backpropagation, which suffers from several well-known issues, such as vanishing and exploding gradients, inability to handle non-differentiable nonlinearities and to parallelize weight-updates across la… ▽ More

    Submitted 5 June, 2019; v1 submitted 23 June, 2018; originally announced June 2018.

    Comments: First six authors contributed equally to this work: A.C. - theory, manuscript, B.C. - code, experiments, S.K. - code, experiments, R.L. - algorithm, experiments, M.R. - code, experiments, I.R. - algorithm, manuscript