Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Day, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.07765  [pdf, other

    cs.LG hep-ph hep-th stat.ML

    Feature Learning and Generalization in Deep Networks with Orthogonal Weights

    Authors: Hannah Day, Yonatan Kahn, Daniel A. Roberts

    Abstract: Fully-connected deep neural networks with weights initialized from independent Gaussian distributions can be tuned to criticality, which prevents the exponential growth or decay of signals propagating through the network. However, such networks still exhibit fluctuations that grow linearly with the depth of the network, which may impair the training of networks with width comparable to depth. We s… ▽ More

    Submitted 12 June, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: v2: numerical experiments updated with more data, plots updated to match, conclusions unchanged. 30+12 pages, 20 figures

    Report number: MIT-CTP/5625