Zum Hauptinhalt springen

Showing 1–27 of 27 results for author: Venkataramani, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.15866  [pdf, other

    cs.LG cs.AI cs.AR

    SmartQuant: CXL-based AI Model Store in Support of Runtime Configurable Weight Quantization

    Authors: Rui Xie, Asad Ul Haq, Linsen Ma, Krystal Sun, Sanchari Sen, Swagath Venkataramani, Liu Liu, Tong Zhang

    Abstract: Recent studies have revealed that, during the inference on generative AI models such as transformer, the importance of different weights exhibits substantial context-dependent variations. This naturally manifests a promising potential of adaptively configuring weight quantization to improve the generative AI inference efficiency. Although configurable weight quantization can readily leverage the h… ▽ More

    Submitted 17 August, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

  2. arXiv:2402.04325  [pdf, other

    cs.LG cs.AI cs.CR

    Enhance DNN Adversarial Robustness and Efficiency via Injecting Noise to Non-Essential Neurons

    Authors: Zhenyu Liu, Garrett Gagnon, Swagath Venkataramani, Liu Liu

    Abstract: Deep Neural Networks (DNNs) have revolutionized a wide range of industries, from healthcare and finance to automotive, by offering unparalleled capabilities in data analysis and decision-making. Despite their transforming impact, DNNs face two critical challenges: the vulnerability to adversarial attacks and the increasing computational costs associated with more complex and larger models. In this… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  3. Approximate Computing and the Efficient Machine Learning Expedition

    Authors: Jörg Henkel, Hai Li, Anand Raghunathan, Mehdi B. Tahoori, Swagath Venkataramani, Xiaoxuan Yang, Georgios Zervakis

    Abstract: Approximate computing (AxC) has been long accepted as a design alternative for efficient system implementation at the cost of relaxed accuracy requirements. Despite the AxC research activities in various application domains, AxC thrived the past decade when it was applied in Machine Learning (ML). The by definition approximate notion of ML models but also the increased computational overheads asso… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

    Comments: Accepted for publication at the International Conference on Computer-Aided Design (ICCAD) 2022

  4. arXiv:2206.09072  [pdf, other

    eess.AS cs.SD

    Semi-supervised Time Domain Target Speaker Extraction with Attention

    Authors: Zhepei Wang, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Jean-Marc Valin, Paris Smaragdis, Mike Goodwin, Arvindh Krishnaswamy

    Abstract: In this work, we propose Exformer, a time-domain architecture for target speaker extraction. It consists of a pre-trained speaker embedder network and a separator network based on transformer encoder blocks. We study multiple methods to combine speaker information with the input mixture, and the resulting Exformer architecture obtains superior extraction performance compared to prior time-domain n… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  5. arXiv:2206.07917  [pdf, other

    eess.AS cs.SD

    To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets

    Authors: Jean-Marc Valin, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Arvindh Krishnaswamy

    Abstract: In real life, room effect, also known as room reverberation, and the present background noise degrade the quality of speech. Recently, deep learning-based speech enhancement approaches have shown a lot of promise and surpassed traditional denoising and dereverberation methods. It is also well established that these state-of-the-art denoising algorithms significantly improve the quality of speech a… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: 5 pages

  6. arXiv:2206.07882  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization

    Authors: Andrea Fasoli, Chia-Yu Chen, Mauricio Serrano, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Kailash Gopalakrishnan

    Abstract: We report on aggressive quantization strategies that greatly accelerate inference of Recurrent Neural Network Transducers (RNN-T). We use a 4 bit integer representation for both weights and activations and apply Quantization Aware Training (QAT) to retrain the full model (acoustic encoder and language model) and achieve near-iso-accuracy. We show that customized quantization schemes that are tailo… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: 5 pages, 2 figures, 1 table. Paper accepted to Interspeech 2022

    ACM Class: I.2.6

  7. arXiv:2108.12074  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    4-bit Quantization of LSTM-based Speech Recognition Models

    Authors: Andrea Fasoli, Chia-Yu Chen, Mauricio Serrano, Xiao Sun, Naigang Wang, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Wei Zhang, Zoltán Tüske, Kailash Gopalakrishnan

    Abstract: We investigate the impact of aggressive low-precision representations of weights and activations in two families of large LSTM-based architectures for Automatic Speech Recognition (ASR): hybrid Deep Bidirectional LSTM - Hidden Markov Models (DBLSTM-HMMs) and Recurrent Neural Network - Transducers (RNN-Ts). Using a 4-bit integer representation, a naïve quantization approach applied to the LSTM port… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: 5 pages, 3 figures, Andrea Fasoli and Chia-Yu Chen equally contributed to this work. Paper accepted to Interspeech 2021

    ACM Class: I.2.6

  8. arXiv:2104.11125  [pdf, other

    cs.LG

    ScaleCom: Scalable Sparsified Gradient Compression for Communication-Efficient Distributed Training

    Authors: Chia-Yu Chen, Jiamin Ni, Songtao Lu, Xiaodong Cui, Pin-Yu Chen, Xiao Sun, Naigang Wang, Swagath Venkataramani, Vijayalakshmi Srinivasan, Wei Zhang, Kailash Gopalakrishnan

    Abstract: Large-scale distributed training of Deep Neural Networks (DNNs) on state-of-the-art platforms is expected to be severely communication constrained. To overcome this limitation, numerous gradient compression techniques have been proposed and have demonstrated high compression ratios. However, most existing methods do not scale well to large scale distributed systems (due to gradient build-up) and/o… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: NeurIPS2020 accepted https://proceedings.neurips.cc/paper/2020/hash/9d58963592071dbf38a0fa114269959c-Abstract.html

  9. arXiv:2006.10388  [pdf, other

    eess.AS cs.SD

    Self-supervised Learning for Speech Enhancement

    Authors: Yu-Che Wang, Shrikant Venkataramani, Paris Smaragdis

    Abstract: Supervised learning for single-channel speech enhancement requires carefully labeled training examples where the noisy mixture is input into the network and the network is trained to produce an output close to the ideal target. To relax the conditions on the training data, we consider the task of training speech enhancement networks in a self-supervised manner. We first use a limited training set… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

  10. arXiv:2002.09286  [pdf, other

    eess.AS cs.LG cs.NE cs.SD stat.ML

    Efficient Trainable Front-Ends for Neural Speech Enhancement

    Authors: Jonah Casebeer, Umut Isik, Shrikant Venkataramani, Arvindh Krishnaswamy

    Abstract: Many neural speech enhancement and source separation systems operate in the time-frequency domain. Such models often benefit from making their Short-Time Fourier Transform (STFT) front-ends trainable. In current literature, these are implemented as large Discrete Fourier Transform matrices; which are prohibitively inefficient for low-compute systems. We present an efficient, trainable front-end ba… ▽ More

    Submitted 19 February, 2020; originally announced February 2020.

    Comments: 5 pages, 5 figures, ICASSP 2020

  11. arXiv:1911.00102  [pdf, other

    cs.SD eess.AS

    End-to-end Non-Negative Autoencoders for Sound Source Separation

    Authors: Shrikant Venkataramani, Efthymios Tzinis, Paris Smaragdis

    Abstract: Discriminative models for source separation have recently been shown to produce impressive results. However, when operating on sources outside of the training set, these models can not perform as well and are cumbersome to update. Classical methods like Non-negative Matrix Factorization (NMF) provide modular approaches to source separation that can be easily updated to adapt to new mixture scenari… ▽ More

    Submitted 31 October, 2019; originally announced November 2019.

  12. arXiv:1910.09804  [pdf, other

    cs.LG cs.CL cs.SD eess.AS stat.ML

    Two-Step Sound Source Separation: Training on Learned Latent Targets

    Authors: Efthymios Tzinis, Shrikant Venkataramani, Zhepei Wang, Cem Subakan, Paris Smaragdis

    Abstract: In this paper, we propose a two-step training procedure for source separation via a deep neural network. In the first step we learn a transform (and it's inverse) to a latent space where masking-based separation performance using oracles is optimal. For the second step, we train a separation module that operates on the previously learned space. In order to do so, we also make use of a scale-invari… ▽ More

    Submitted 23 October, 2019; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: Submitted to ICASSP 2020

    Journal ref: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  13. arXiv:1905.00151  [pdf, other

    cs.SD eess.AS

    A Style Transfer Approach to Source Separation

    Authors: Shrikant Venkataramani, Efthymios Tzinis, Paris Smaragdis

    Abstract: Training neural networks for source separation involves presenting a mixture recording at the input of the network and updating network parameters in order to produce an output that resembles the clean source. Consequently, supervised source separation depends on the availability of paired mixture-clean training examples. In this paper, we interpret source separation as a style transfer problem. W… ▽ More

    Submitted 9 May, 2019; v1 submitted 30 April, 2019; originally announced May 2019.

  14. arXiv:1811.03076  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Class-conditional embeddings for music source separation

    Authors: Prem Seetharaman, Gordon Wichern, Shrikant Venkataramani, Jonathan Le Roux

    Abstract: Isolating individual instruments in a musical mixture has a myriad of potential applications, and seems imminently achievable given the levels of performance reached by recent deep learning methods. While most musical source separation techniques learn an independent model for each instrument, we propose using a common embedding space for the time-frequency bins of all instruments in a mixture ins… ▽ More

    Submitted 7 November, 2018; originally announced November 2018.

    Comments: 5 pages

  15. arXiv:1811.01532  [pdf, other

    cs.DC

    Workload-aware Automatic Parallelization for Multi-GPU DNN Training

    Authors: Sungho Shin, Youngmin Jo, Jungwook Choi, Swagath Venkataramani, Vijayalakshmi Srinivasan, Wonyong Sung

    Abstract: Deep neural networks (DNNs) have emerged as successful solutions for variety of artificial intelligence applications, but their very large and deep models impose high computational requirements during training. Multi-GPU parallelization is a popular option to accelerate demanding computations in DNN training, but most state-of-the-art multi-GPU deep learning frameworks not only require users to ha… ▽ More

    Submitted 6 February, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: This paper is accepted in ICASSP2019

  16. arXiv:1811.01531  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures using Spatial Information

    Authors: Efthymios Tzinis, Shrikant Venkataramani, Paris Smaragdis

    Abstract: We present a monophonic source separation system that is trained by only observing mixtures with no ground truth separation information. We use a deep clustering approach which trains on multi-channel mixtures and learns to project spectrogram bins to source clusters that correlate with various spatial features. We show that using such a training process we can obtain separation performance that i… ▽ More

    Submitted 9 November, 2018; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: Submitted to ICASSP 2019 (v1: November 5th 2018)

    Journal ref: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  17. arXiv:1810.02568  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    End-to-end Networks for Supervised Single-channel Speech Separation

    Authors: Shrikant Venkataramani, Paris Smaragdis

    Abstract: The performance of single channel source separation algorithms has improved greatly in recent times with the development and deployment of neural networks. However, many such networks continue to operate on the magnitude spectrogram of a mixture, and produce an estimate of source magnitude spectrograms, to perform source separation. In this paper, we interpret these steps as additional neural netw… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

  18. arXiv:1807.06964  [pdf, other

    cs.CV

    Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN)

    Authors: Jungwook Choi, Pierce I-Jen Chuang, Zhuo Wang, Swagath Venkataramani, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan

    Abstract: Deep learning algorithms achieve high classification accuracy at the expense of significant computation cost. In order to reduce this cost, several quantization schemes have gained attention recently with some focusing on weight quantization, and others focusing on quantizing activations. This paper proposes novel techniques that target weight and activation quantizations separately resulting in a… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1805.06085

  19. arXiv:1806.00511  [pdf, other

    eess.AS cs.SD eess.SP

    Performance Based Cost Functions for End-to-End Speech Separation

    Authors: Shrikant Venkataramani, Ryley Higa, Paris Smaragdis

    Abstract: Recent neural network strategies for source separation attempt to model audio signals by processing their waveforms directly. Mean squared error (MSE) that measures the Euclidean distance between waveforms of denoised speech and the ground-truth speech, has been a natural cost-function for these approaches. However, MSE is not a perceptually motivated measure and may result in large perceptual dis… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

  20. arXiv:1805.06085  [pdf, other

    cs.CV cs.AI

    PACT: Parameterized Clipping Activation for Quantized Neural Networks

    Authors: Jungwook Choi, Zhuo Wang, Swagath Venkataramani, Pierce I-Jen Chuang, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan

    Abstract: Deep learning algorithms achieve high classification accuracy at the expense of significant computation cost. To address this cost, a number of quantization schemes have been proposed - but most of these techniques focused on quantizing weights, which are relatively smaller in size compared to activations. This paper proposes a novel quantization scheme for activations during training - that enabl… ▽ More

    Submitted 17 July, 2018; v1 submitted 15 May, 2018; originally announced May 2018.

  21. arXiv:1711.06315  [pdf, other

    cs.DC cs.AR cs.CV

    SparCE: Sparsity aware General Purpose Core Extensions to Accelerate Deep Neural Networks

    Authors: Sanchari Sen, Shubham Jain, Swagath Venkataramani, Anand Raghunathan

    Abstract: Deep Neural Networks (DNNs) have emerged as the method of choice for solving a wide range of machine learning tasks. The enormous computational demands posed by DNNs have most commonly been addressed through the design of custom accelerators. However, these accelerators are prohibitive in many design scenarios (e.g., wearable devices and IoT sensors), due to stringent area/cost constraints. Accele… ▽ More

    Submitted 29 November, 2017; v1 submitted 6 November, 2017; originally announced November 2017.

  22. arXiv:1709.07908  [pdf, other

    cs.SD eess.AS

    Neural Network Alternatives to Convolutive Audio Models for Source Separation

    Authors: Shrikant Venkataramani, Y. Cem Subakan, Paris Smaragdis

    Abstract: Convolutive Non-Negative Matrix Factorization model factorizes a given audio spectrogram using frequency templates with a temporal dimension. In this paper, we present a convolutional auto-encoder model that acts as a neural network alternative to convolutive NMF. Using the modeling flexibility granted by neural networks, we also explore the idea of using a Recurrent Neural Network in the encoder.… ▽ More

    Submitted 20 September, 2017; originally announced September 2017.

    Comments: Published in MLSP 2017

  23. arXiv:1705.02514  [pdf, other

    cs.SD

    End-to-end Source Separation with Adaptive Front-Ends

    Authors: Shrikant Venkataramani, Jonah Casebeer, Paris Smaragdis

    Abstract: Source separation and other audio applications have traditionally relied on the use of short-time Fourier transforms as a front-end frequency domain representation step. The unavailability of a neural network equivalent to forward and inverse transforms hinders the implementation of end-to-end learning systems for these applications. We present an auto-encoder neural network that can act as an equ… ▽ More

    Submitted 30 October, 2017; v1 submitted 6 May, 2017; originally announced May 2017.

    Comments: 4 figures, 4 pages

  24. arXiv:1704.01137  [pdf, other

    cs.NE cs.CV cs.LG

    DyVEDeep: Dynamic Variable Effort Deep Neural Networks

    Authors: Sanjay Ganapathy, Swagath Venkataramani, Balaraman Ravindran, Anand Raghunathan

    Abstract: Deep Neural Networks (DNNs) have advanced the state-of-the-art in a variety of machine learning tasks and are deployed in increasing numbers of products and services. However, the computational requirements of training and evaluating large-scale DNNs are growing at a much faster pace than the capabilities of the underlying hardware platforms that they are executed upon. In this work, we propose Dy… ▽ More

    Submitted 4 April, 2017; originally announced April 2017.

  25. arXiv:1609.03296  [pdf, other

    cs.SD

    A Neural Network Alternative to Non-Negative Audio Models

    Authors: Paris Smaragdis, Shrikant Venkataramani

    Abstract: We present a neural network that can act as an equivalent to a Non-Negative Matrix Factorization (NMF), and further show how it can be used to perform supervised source separation. Due to the extensibility of this approach we show how we can achieve better source separation performance as compared to NMF-based methods, and propose a variety of derivative architectures that can be used for further… ▽ More

    Submitted 12 September, 2016; originally announced September 2016.

  26. arXiv:1602.08557  [pdf

    cs.NE

    Multiplier-less Artificial Neurons Exploiting Error Resiliency for Energy-Efficient Neural Computing

    Authors: Syed Shakib Sarwar, Swagath Venkataramani, Anand Raghunathan, Kaushik Roy

    Abstract: Large-scale artificial neural networks have shown significant promise in addressing a wide range of classification and recognition applications. However, their large computational requirements stretch the capabilities of computing platforms. The fundamental components of these neural networks are the neurons and its synapses. The core of a digital hardware neuron consists of multiplier, accumulato… ▽ More

    Submitted 27 February, 2016; originally announced February 2016.

    Comments: Accepted in Design, Automation and Test in Europe 2016 conference (DATE-2016)

    Journal ref: In Design, Automation & Test in Europe Conference & Exhibition (DATE), 2016, pp. 145-150

  27. arXiv:1509.08970  [pdf, other

    cs.CV

    Energy-Efficient Object Detection using Semantic Decomposition

    Authors: Priyadarshini Panda, Swagath Venkataramani, Abhronil Sengupta, Anand Raghunathan, Kaushik Roy

    Abstract: Machine-learning algorithms offer immense possibilities in the development of several cognitive applications. In fact, large scale machine-learning classifiers now represent the state-of-the-art in a wide range of object detection/classification problems. However, the network complexities of large-scale classifiers present them as one of the most challenging and energy intensive workloads across t… ▽ More

    Submitted 20 September, 2016; v1 submitted 29 September, 2015; originally announced September 2015.

    Comments: 10 pages, 13 figures, 3 algorithms, Submitted to IEEE TVLSI(Under Review)