Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Adelman, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.08002  [pdf, other

    cs.LG cs.AI cs.DC

    Efficient and Generic 1D Dilated Convolution Layer for Deep Learning

    Authors: Narendra Chaudhary, Sanchit Misra, Dhiraj Kalamkar, Alexander Heinecke, Evangelos Georganas, Barukh Ziv, Menachem Adelman, Bharat Kaul

    Abstract: Convolutional neural networks (CNNs) have found many applications in tasks involving two-dimensional (2D) data, such as image classification and image processing. Therefore, 2D convolution layers have been heavily optimized on CPUs and GPUs. However, in many applications - for example genomics and speech recognition, the data can be one-dimensional (1D). Such applications can benefit from optimize… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  2. Tensor Processing Primitives: A Programming Abstraction for Efficiency and Portability in Deep Learning & HPC Workloads

    Authors: Evangelos Georganas, Dhiraj Kalamkar, Sasikanth Avancha, Menachem Adelman, Deepti Aggarwal, Cristina Anderson, Alexander Breuer, Jeremy Bruestle, Narendra Chaudhary, Abhisek Kundu, Denise Kutnick, Frank Laub, Vasimuddin Md, Sanchit Misra, Ramanarayan Mohanty, Hans Pabst, Brian Retford, Barukh Ziv, Alexander Heinecke

    Abstract: During the past decade, novel Deep Learning (DL) algorithms, workloads and hardware have been developed to tackle a wide range of problems. Despite the advances in workload and hardware ecosystems, the programming methodology of DL systems is stagnant. DL workloads leverage either highly-optimized, yet platform-specific and inflexible kernels from DL libraries, or in the case of novel operators, r… ▽ More

    Submitted 30 November, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

  3. arXiv:1805.08079  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Faster Neural Network Training with Approximate Tensor Operations

    Authors: Menachem Adelman, Kfir Y. Levy, Ido Hakimi, Mark Silberstein

    Abstract: We propose a novel technique for faster deep neural network training which systematically applies sample-based approximation to the constituent tensor operations, i.e., matrix multiplications and convolutions. We introduce new sampling techniques, study their theoretical properties, and prove that they provide the same convergence guarantees when applied to SGD training. We apply approximate tenso… ▽ More

    Submitted 25 October, 2021; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: NeurIPS 2021 camera ready