Zum Hauptinhalt springen

Showing 1–14 of 14 results for author: Aketi, S A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13961  [pdf, other

    cs.LG cs.DC cs.MA

    SADDLe: Sharpness-Aware Decentralized Deep Learning with Heterogeneous Data

    Authors: Sakshi Choudhary, Sai Aparna Aketi, Kaushik Roy

    Abstract: Decentralized training enables learning with distributed datasets generated at different locations without relying on a central server. In realistic scenarios, the data distribution across these sparsely connected learning agents can be significantly heterogeneous, leading to local model over-fitting and poor global model generalization. Another challenge is the high communication cost of training… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2404.05919  [pdf, other

    cs.LG

    AdaGossip: Adaptive Consensus Step-size for Decentralized Deep Learning with Communication Compression

    Authors: Sai Aparna Aketi, Abolfazl Hashemi, Kaushik Roy

    Abstract: Decentralized learning is crucial in supporting on-device learning over large distributed datasets, eliminating the need for a central server. However, the communication overhead remains a major bottleneck for the practical realization of such decentralized setups. To tackle this issue, several algorithms for decentralized training with compressed communication have been proposed in the literature… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 11 pages, 3 figures, 8 tables. arXiv admin note: text overlap with arXiv:2305.04792, arXiv:2310.15890

  3. Towards Two-Stream Foveation-based Active Vision Learning

    Authors: Timur Ibrayev, Amitangshu Mukherjee, Sai Aparna Aketi, Kaushik Roy

    Abstract: Deep neural network (DNN) based machine perception frameworks process the entire input in a one-shot manner to provide answers to both "what object is being observed" and "where it is located". In contrast, the "two-stream hypothesis" from neuroscience explains the neural processing in the human visual cortex as an active vision system that utilizes two separate regions of the brain to answer the… ▽ More

    Submitted 20 April, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted version of the article, 18 pages, 14 figures

    Journal ref: IEEE Transactions on Cognitive and Developmental Systems, 2024

  4. arXiv:2403.03292  [pdf, other

    cs.LG cs.DC

    Averaging Rate Scheduler for Decentralized Learning on Heterogeneous Data

    Authors: Sai Aparna Aketi, Sakshi Choudhary, Kaushik Roy

    Abstract: State-of-the-art decentralized learning algorithms typically require the data distribution to be Independent and Identically Distributed (IID). However, in practical scenarios, the data distribution across the agents can have significant heterogeneity. In this work, we propose averaging rate scheduling as a simple yet effective way to reduce the impact of heterogeneity in decentralized learning. O… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 9 pages, 3 figures, 4 tables. arXiv admin note: text overlap with arXiv:2305.04792

  5. arXiv:2310.15890  [pdf, other

    cs.LG

    Cross-feature Contrastive Loss for Decentralized Deep Learning on Heterogeneous Data

    Authors: Sai Aparna Aketi, Kaushik Roy

    Abstract: The current state-of-the-art decentralized learning algorithms mostly assume the data distribution to be Independent and Identically Distributed (IID). However, in practical scenarios, the distributed datasets can have significantly heterogeneous data distributions across the agents. In this work, we present a novel approach for decentralized learning on heterogeneous data, where data-free knowled… ▽ More

    Submitted 5 December, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 12 pages, 7 figures, 11 tables. arXiv admin note: text overlap with arXiv:2305.04792

    Journal ref: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024

  6. arXiv:2305.04792  [pdf, other

    cs.LG cs.MA

    Global Update Tracking: A Decentralized Learning Algorithm for Heterogeneous Data

    Authors: Sai Aparna Aketi, Abolfazl Hashemi, Kaushik Roy

    Abstract: Decentralized learning enables the training of deep learning models over large distributed datasets generated at different locations, without the need for a central server. However, in practical scenarios, the data distribution across these devices can be significantly different, leading to a degradation in model performance. In this paper, we focus on designing a decentralized learning algorithm… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 22 pages, 10 tables, 3 figures

  7. arXiv:2304.04326  [pdf, other

    cs.LG cs.DC

    Homogenizing Non-IID datasets via In-Distribution Knowledge Distillation for Decentralized Learning

    Authors: Deepak Ravikumar, Gobinda Saha, Sai Aparna Aketi, Kaushik Roy

    Abstract: Decentralized learning enables serverless training of deep neural networks (DNNs) in a distributed manner on multiple nodes. This allows for the use of large datasets, as well as the ability to train with a wide variety of data sources. However, one of the key challenges with decentralized learning is heterogeneity in the data distribution across the nodes. In this paper, we propose In-Distributio… ▽ More

    Submitted 24 February, 2024; v1 submitted 9 April, 2023; originally announced April 2023.

  8. arXiv:2303.15378  [pdf, other

    cs.LG cs.DC cs.MA

    CoDeC: Communication-Efficient Decentralized Continual Learning

    Authors: Sakshi Choudhary, Sai Aparna Aketi, Gobinda Saha, Kaushik Roy

    Abstract: Training at the edge utilizes continuously evolving data generated at different locations. Privacy concerns prohibit the co-location of this spatially as well as temporally distributed data, deeming it crucial to design training algorithms that enable efficient continual learning over decentralized private data. Decentralized learning allows serverless training with spatially distributed data. A f… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  9. arXiv:2209.14390  [pdf, other

    cs.LG cs.DC cs.MA

    Neighborhood Gradient Clustering: An Efficient Decentralized Learning Method for Non-IID Data Distributions

    Authors: Sai Aparna Aketi, Sangamesh Kodge, Kaushik Roy

    Abstract: Decentralized learning over distributed datasets can have significantly different data distributions across the agents. The current state-of-the-art decentralized algorithms mostly assume the data distributions to be Independent and Identically Distributed. This paper focuses on improving decentralized learning over non-IID data. We propose \textit{Neighborhood Gradient Clustering (NGC)}, a novel… ▽ More

    Submitted 20 March, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: 29 pages, 5 figures, 16 tables. arXiv admin note: text overlap with arXiv:2103.02051 by other authors

  10. Low Precision Decentralized Distributed Training over IID and non-IID Data

    Authors: Sai Aparna Aketi, Sangamesh Kodge, Kaushik Roy

    Abstract: Decentralized distributed learning is the key to enabling large-scale machine learning (training) on edge devices utilizing private user-generated local data, without relying on the cloud. However, the practical realization of such on-device training is limited by the communication and compute bottleneck. In this paper, we propose and show the convergence of low precision decentralized training th… ▽ More

    Submitted 11 September, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: 11 pages, 7 figures, 9 tables

    Journal ref: Neural Networks 2022

  11. arXiv:2102.05715  [pdf, other

    cs.LG cs.AI cs.CV cs.DC

    Sparse-Push: Communication- & Energy-Efficient Decentralized Distributed Learning over Directed & Time-Varying Graphs with non-IID Datasets

    Authors: Sai Aparna Aketi, Amandeep Singh, Jan Rabaey

    Abstract: Current deep learning (DL) systems rely on a centralized computing paradigm which limits the amount of available training data, increases system latency, and adds privacy and security constraints. On-device learning, enabled by decentralized and distributed training of DL models over peer-to-peer wirelessly connected edge devices, not only alleviate the above limitations but also enable next-gen a… ▽ More

    Submitted 11 February, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: 12 pages, 7 figures, 7 tables

  12. arXiv:2002.11052  [pdf, other

    cs.LG cs.CV stat.ML

    Relevant-features based Auxiliary Cells for Energy Efficient Detection of Natural Errors

    Authors: Sai Aparna Aketi, Priyadarshini Panda, Kaushik Roy

    Abstract: Deep neural networks have demonstrated state-of-the-art performance on many classification tasks. However, they have no inherent capability to recognize when their predictions are wrong. There have been several efforts in the recent past to detect natural errors but the suggested mechanisms pose additional energy requirements. To address this issue, we propose an ensemble of classifiers at hidden… ▽ More

    Submitted 25 February, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: 16 pages, 3 figures, 6 tables

  13. arXiv:2002.09958  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Gradual Channel Pruning while Training using Feature Relevance Scores for Convolutional Neural Networks

    Authors: Sai Aparna Aketi, Sourjya Roy, Anand Raghunathan, Kaushik Roy

    Abstract: The enormous inference cost of deep neural networks can be scaled down by network compression. Pruning is one of the predominant approaches used for deep network compression. However, existing pruning techniques have one or more of the following limitations: 1) Additional energy cost on top of the compute heavy training stage due to pruning and fine-tuning stages, 2) Layer-wise pruning based on th… ▽ More

    Submitted 29 April, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

    Comments: 15 pages, 2 figures, 4 tables

  14. SERAD: Soft Error Resilient Asynchronous Design using a Bundled Data Protocol

    Authors: Sai Aparna Aketi, Smriti Gupta, Huimei Cheng, Joycee Mekie, Peter A. Beerel

    Abstract: The risk of soft errors due to radiation continues to be a significant challenge for engineers trying to build systems that can handle harsh environments. Building systems that are Radiation Hardened by Design (RHBD) is the preferred approach, but existing techniques are expensive in terms of performance, power, and/or area. This paper introduces a novel soft-error resilient asynchronous bundled-d… ▽ More

    Submitted 12 January, 2020; originally announced January 2020.