Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Pipralia, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.09453  [pdf, other

    cs.LG cs.AR cs.CL

    Weight Block Sparsity: Training, Compilation, and AI Engine Accelerators

    Authors: Paolo D'Alberto, Taehee Jeong, Akshai Jain, Shreyas Manjunath, Mrinal Sarmah, Samuel Hsu, Yaswanth Raparti, Nitesh Pipralia

    Abstract: Nowadays, increasingly larger Deep Neural Networks (DNNs) are being developed, trained, and utilized. These networks require significant computational resources, putting a strain on both advanced and limited devices. Our solution is to implement {\em weight block sparsity}, which is a structured sparsity that is friendly to hardware. By zeroing certain sections of the convolution and fully connect… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 12 pages, 10 figures, 1 table

    ACM Class: C.5; D.3.4