Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Tavarageri, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.05573  [pdf, other

    cs.PL

    AI Powered Compiler Techniques for DL Code Optimization

    Authors: Sanket Tavarageri, Gagandeep Goyal, Sasikanth Avancha, Bharat Kaul, Ramakrishna Upadrasta

    Abstract: Creating high performance implementations of deep learning primitives on CPUs is a challenging task. Multiple considerations including multi-level cache hierarchy, and wide SIMD units of CPU platforms influence the choice of program transformations to apply for performance optimization. In this paper, we present machine learning powered compiler techniques to optimize loop nests. We take a two-pro… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2006.02230, arXiv:2002.02145

  2. arXiv:2006.02230  [pdf, other

    cs.DC cs.AI cs.PL

    PolyDL: Polyhedral Optimizations for Creation of High Performance DL primitives

    Authors: Sanket Tavarageri, Alexander Heinecke, Sasikanth Avancha, Gagandeep Goyal, Ramakrishna Upadrasta, Bharat Kaul

    Abstract: Deep Neural Networks (DNNs) have revolutionized many aspects of our lives. The use of DNNs is becoming ubiquitous including in softwares for image recognition, speech recognition, speech synthesis, language translation, to name a few. he training of DNN architectures however is computationally expensive. Once the model is created, its use in the intended application - the inference task, is comput… ▽ More

    Submitted 17 November, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2002.02145

  3. arXiv:2002.02145  [pdf, other

    cs.PL cs.LG

    PolyScientist: Automatic Loop Transformations Combined with Microkernels for Optimization of Deep Learning Primitives

    Authors: Sanket Tavarageri, Alexander Heinecke, Sasikanth Avancha, Gagandeep Goyal, Ramakrishna Upadrasta, Bharat Kaul

    Abstract: At the heart of deep learning training and inferencing are computationally intensive primitives such as convolutions which form the building blocks of deep neural networks. Researchers have taken two distinct approaches to creating high performance implementations of deep learning kernels, namely, 1) library development exemplified by Intel MKL-DNN for CPUs, 2) automatic compilation represented by… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

  4. arXiv:1906.08168  [pdf, other

    cs.DC cs.LG

    Automatic Model Parallelism for Deep Neural Networks with Compiler and Hardware Support

    Authors: Sanket Tavarageri, Srinivas Sridharan, Bharat Kaul

    Abstract: The deep neural networks (DNNs) have been enormously successful in tasks that were hitherto in the human-only realm such as image recognition, and language translation. Owing to their success the DNNs are being explored for use in ever more sophisticated tasks. One of the ways that the DNNs are made to scale for the complex undertakings is by increasing their size -- deeper and wider networks can… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

  5. arXiv:1905.12292  [pdf, other

    cs.PL cs.LG cs.PF

    Categorization of Program Regions for Agile Compilation using Machine Learning and Hardware Support

    Authors: Sanket Tavarageri

    Abstract: A compiler processes the code written in a high level language and produces machine executable code. The compiler writers often face the challenge of keeping the compilation times reasonable. That is because aggressive optimization passes which potentially will give rise to high performance are often expensive in terms of running time and memory footprint. Consequently the compiler designers arriv… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

  6. arXiv:1809.05877  [pdf, other

    cs.LG stat.ML

    A Data Analytics Framework for Aggregate Data Analysis

    Authors: Sanket Tavarageri, Nag Mani, Anand Ramasubramanian, Jaskiran Kalsi

    Abstract: In many contexts, we have access to aggregate data, but individual level data is unavailable. For example, medical studies sometimes report only aggregate statistics about disease prevalence because of privacy concerns. Even so, many a time it is desirable, and in fact could be necessary to infer individual level characteristics from aggregate data. For instance, other researchers who want to perf… ▽ More

    Submitted 16 September, 2018; originally announced September 2018.

  7. arXiv:1409.1914  [pdf, ps, other

    cs.DC cs.PL

    A Tale of Three Runtimes

    Authors: Nicolas Vasilache, Muthu Baskaran, Tom Henretty, Benoit Meister, M. Harper Langston, Sanket Tavarageri, Richard Lethin

    Abstract: This contribution discusses the automatic generation of event-driven, tuple-space based programs for task-oriented execution models from a sequential C specification. We developed a hierarchical mapping solution using auto-parallelizing compiler technology to target three different runtimes relying on event-driven tasks (EDTs). Our solution benefits from the important observation that loop types e… ▽ More

    Submitted 5 September, 2014; originally announced September 2014.