Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Silk, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.11684  [pdf, other

    cs.LG

    Minibatching Offers Improved Generalization Performance for Second Order Optimizers

    Authors: Eric Silk, Swarnita Chakraborty, Nairanjana Dasgupta, Anand D. Sarwate, Andrew Lumsdaine, Tony Chiang

    Abstract: Training deep neural networks (DNNs) used in modern machine learning is computationally expensive. Machine learning scientists, therefore, rely on stochastic first-order methods for training, coupled with significant hand-tuning, to obtain good performance. To better understand performance variability of different stochastic algorithms, including second-order methods, we conduct an empirical study… ▽ More

    Submitted 25 May, 2023; originally announced July 2023.

    Comments: 14 pages, 6 figures, 5 tables