Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Hidary, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:1910.02071  [pdf, other

    quant-ph cs.LG

    Quantum Hamiltonian-Based Models and the Variational Quantum Thermalizer Algorithm

    Authors: Guillaume Verdon, Jacob Marks, Sasha Nanda, Stefan Leichenauer, Jack Hidary

    Abstract: We introduce a new class of generative quantum-neural-network-based models called Quantum Hamiltonian-Based Models (QHBMs). In doing so, we establish a paradigmatic approach for quantum-probabilistic hybrid variational learning, where we efficiently decompose the tasks of learning classical and quantum correlations in a way which maximizes the utility of both classical and quantum processors. In a… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: 13 + 8 pages, 9 figures

  2. arXiv:1909.12264  [pdf, other

    quant-ph cs.LG

    Quantum Graph Neural Networks

    Authors: Guillaume Verdon, Trevor McCourt, Enxhell Luzhnica, Vikash Singh, Stefan Leichenauer, Jack Hidary

    Abstract: We introduce Quantum Graph Neural Networks (QGNN), a new class of quantum neural network ansatze which are tailored to represent quantum processes which have a graph structure, and are particularly suitable to be executed on distributed quantum systems over a quantum network. Along with this general class of ansatze, we introduce further specialized architectures, namely, Quantum Graph Recurrent N… ▽ More

    Submitted 26 September, 2019; originally announced September 2019.

    Comments: 8 pages

  3. arXiv:1906.06329  [pdf, other

    cs.LG cond-mat.str-el cs.CV physics.comp-ph stat.ML

    TensorNetwork for Machine Learning

    Authors: Stavros Efthymiou, Jack Hidary, Stefan Leichenauer

    Abstract: We demonstrate the use of tensor networks for image classification with the TensorNetwork open source library. We explain in detail the encoding of image data into a matrix product state form, and describe how to contract the network in a way that is parallelizable and well-suited to automatic gradients for optimization. Applying the technique to the MNIST and Fashion-MNIST datasets we find out-of… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

    Comments: 9 pages, 8 figures. All code can be found at https://github.com/google/tensornetwork

  4. arXiv:1905.01331  [pdf, other

    cond-mat.str-el cs.LG hep-th physics.comp-ph stat.ML

    TensorNetwork on TensorFlow: A Spin Chain Application Using Tree Tensor Networks

    Authors: Ashley Milsted, Martin Ganahl, Stefan Leichenauer, Jack Hidary, Guifre Vidal

    Abstract: TensorNetwork is an open source library for implementing tensor network algorithms in TensorFlow. We describe a tree tensor network (TTN) algorithm for approximating the ground state of either a periodic quantum spin chain (1D) or a lattice model on a thin torus (2D), and implement the algorithm using TensorNetwork. We use a standard energy minimization procedure over a TTN ansatz with bond dimens… ▽ More

    Submitted 3 May, 2019; originally announced May 2019.

    Comments: All code can be found at https://github.com/google/tensornetwork

  5. arXiv:1905.01330  [pdf, other

    physics.comp-ph cond-mat.str-el cs.LG hep-th stat.ML

    TensorNetwork: A Library for Physics and Machine Learning

    Authors: Chase Roberts, Ashley Milsted, Martin Ganahl, Adam Zalcman, Bruce Fontaine, Yijian Zou, Jack Hidary, Guifre Vidal, Stefan Leichenauer

    Abstract: TensorNetwork is an open source library for implementing tensor network algorithms. Tensor networks are sparse data structures originally designed for simulating quantum many-body physics, but are currently also applied in a number of other research areas, including machine learning. We demonstrate the use of the API with applications both physics and machine learning, with details appearing in co… ▽ More

    Submitted 3 May, 2019; originally announced May 2019.

    Comments: The TensorNetwork library can be found at https://github.com/google/tensornetwork

  6. arXiv:1903.04991  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Theory III: Dynamics and Generalization in Deep Networks

    Authors: Andrzej Banburski, Qianli Liao, Brando Miranda, Lorenzo Rosasco, Fernanda De La Torre, Jack Hidary, Tomaso Poggio

    Abstract: The key to generalization is controlling the complexity of the network. However, there is no obvious control of complexity -- such as an explicit regularization term -- in the training of deep networks for classification. We will show that a classical form of norm control -- but kind of hidden -- is present in deep networks trained with gradient descent techniques on exponential-type losses. In pa… ▽ More

    Submitted 10 April, 2020; v1 submitted 12 March, 2019; originally announced March 2019.

    Comments: 47 pages, 11 figures. This replaces previous versions of Theory III, that appeared on Arxiv [arXiv:1806.11379, arXiv:1801.00173] or on the CBMM site. v5: Changes throughout the paper to the presentation and tightening some of the statements

  7. arXiv:1807.09659  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    A Surprising Linear Relationship Predicts Test Performance in Deep Networks

    Authors: Qianli Liao, Brando Miranda, Andrzej Banburski, Jack Hidary, Tomaso Poggio

    Abstract: Given two networks with the same training loss on a dataset, when would they have drastically different test losses and errors? Better understanding of this question of generalization may improve practical applications of deep networks. In this paper we show that with cross-entropy loss it is surprisingly simple to induce significantly different generalization performances for two networks that ha… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

  8. arXiv:1806.11379  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Theory IIIb: Generalization in Deep Networks

    Authors: Tomaso Poggio, Qianli Liao, Brando Miranda, Andrzej Banburski, Xavier Boix, Jack Hidary

    Abstract: A main puzzle of deep neural networks (DNNs) revolves around the apparent absence of "overfitting", defined in this paper as follows: the expected error does not get worse when increasing the number of neurons or of iterations of gradient descent. This is surprising because of the large capacity demonstrated by DNNs to fit randomly labeled data and the absence of explicit regularization. Recent re… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

    Comments: 38 pages, 7 figures

  9. arXiv:1801.00173  [pdf, other

    cs.LG

    Theory of Deep Learning III: explaining the non-overfitting puzzle

    Authors: Tomaso Poggio, Kenji Kawaguchi, Qianli Liao, Brando Miranda, Lorenzo Rosasco, Xavier Boix, Jack Hidary, Hrushikesh Mhaskar

    Abstract: A main puzzle of deep networks revolves around the absence of overfitting despite large overparametrization and despite the large capacity demonstrated by zero training error on randomly labeled data. In this note, we show that the dynamics associated to gradient descent minimization of nonlinear networks is topologically equivalent, near the asymptotically stable minima of the empirical error, to… ▽ More

    Submitted 16 January, 2018; v1 submitted 30 December, 2017; originally announced January 2018.