Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Kahn, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.02264  [pdf, other

    hep-ph cs.LG

    Scaling Laws in Jet Classification

    Authors: Joshua Batson, Yonatan Kahn

    Abstract: We demonstrate the emergence of scaling laws in the benchmark top versus QCD jet classification problem in collider physics. Six distinct physically-motivated classifiers exhibit power-law scaling of the binary cross-entropy test loss as a function of training set size, with distinct power law indices. This result highlights the importance of comparing classifiers as a function of dataset size rat… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 10+2 pages, 7 figures, 1 table

  2. arXiv:2310.07765  [pdf, other

    cs.LG hep-ph hep-th stat.ML

    Feature Learning and Generalization in Deep Networks with Orthogonal Weights

    Authors: Hannah Day, Yonatan Kahn, Daniel A. Roberts

    Abstract: Fully-connected deep neural networks with weights initialized from independent Gaussian distributions can be tuned to criticality, which prevents the exponential growth or decay of signals propagating through the network. However, such networks still exhibit fluctuations that grow linearly with the depth of the network, which may impair the training of networks with width comparable to depth. We s… ▽ More

    Submitted 12 June, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: v2: numerical experiments updated with more data, plots updated to match, conclusions unchanged. 30+12 pages, 20 figures

    Report number: MIT-CTP/5625

  3. arXiv:2102.08380  [pdf, other

    hep-ph cs.LG stat.ML

    Topological Obstructions to Autoencoding

    Authors: Joshua Batson, C. Grace Haaf, Yonatan Kahn, Daniel A. Roberts

    Abstract: Autoencoders have been proposed as a powerful tool for model-independent anomaly detection in high-energy physics. The operating principle is that events which do not belong to the space of training data will be reconstructed poorly, thus flagging them as anomalies. We point out that in a variety of examples of interest, the connection between large reconstruction error and anomalies is not so cle… ▽ More

    Submitted 3 May, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: 24 + 20 pages, 26 figures; no autoencoders were harmed in the making of this project. v2: JHEP published version

    Report number: MIT-CTP/5264

    Journal ref: JHEP04(2021)280