Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Goyal, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.05573  [pdf, other

    cs.PL

    AI Powered Compiler Techniques for DL Code Optimization

    Authors: Sanket Tavarageri, Gagandeep Goyal, Sasikanth Avancha, Bharat Kaul, Ramakrishna Upadrasta

    Abstract: Creating high performance implementations of deep learning primitives on CPUs is a challenging task. Multiple considerations including multi-level cache hierarchy, and wide SIMD units of CPU platforms influence the choice of program transformations to apply for performance optimization. In this paper, we present machine learning powered compiler techniques to optimize loop nests. We take a two-pro… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2006.02230, arXiv:2002.02145

  2. arXiv:2006.02230  [pdf, other

    cs.DC cs.AI cs.PL

    PolyDL: Polyhedral Optimizations for Creation of High Performance DL primitives

    Authors: Sanket Tavarageri, Alexander Heinecke, Sasikanth Avancha, Gagandeep Goyal, Ramakrishna Upadrasta, Bharat Kaul

    Abstract: Deep Neural Networks (DNNs) have revolutionized many aspects of our lives. The use of DNNs is becoming ubiquitous including in softwares for image recognition, speech recognition, speech synthesis, language translation, to name a few. he training of DNN architectures however is computationally expensive. Once the model is created, its use in the intended application - the inference task, is comput… ▽ More

    Submitted 17 November, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2002.02145

  3. arXiv:2002.02145  [pdf, other

    cs.PL cs.LG

    PolyScientist: Automatic Loop Transformations Combined with Microkernels for Optimization of Deep Learning Primitives

    Authors: Sanket Tavarageri, Alexander Heinecke, Sasikanth Avancha, Gagandeep Goyal, Ramakrishna Upadrasta, Bharat Kaul

    Abstract: At the heart of deep learning training and inferencing are computationally intensive primitives such as convolutions which form the building blocks of deep neural networks. Researchers have taken two distinct approaches to creating high performance implementations of deep learning kernels, namely, 1) library development exemplified by Intel MKL-DNN for CPUs, 2) automatic compilation represented by… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

  4. arXiv:1906.03918  [pdf, other

    cs.CV

    The role of ego vision in view-invariant action recognition

    Authors: Gaurvi Goyal, Nicoletta Noceti, Francesca Odone, Alessandra Sciutti

    Abstract: Analysis and interpretation of egocentric video data is becoming more and more important with the increasing availability and use of wearable cameras. Exploring and fully understanding affinities and differences between ego and allo (or third-person) vision is paramount for the design of effective methods to process, analyse and interpret egocentric data. In addition, a deeper understanding of ego… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: Accepted for presentation at EPIC@CVPR2019 workshop