Zum Hauptinhalt springen

Showing 1–50 of 74 results for author: Joshi, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00302  [pdf, other

    cs.LG cs.DC

    FedAST: Federated Asynchronous Simultaneous Training

    Authors: Baris Askin, Pranay Sharma, Carlee Joe-Wong, Gauri Joshi

    Abstract: Federated Learning (FL) enables edge devices or clients to collaboratively train machine learning (ML) models without sharing their private data. Much of the existing work in FL focuses on efficiently learning a model for a single task. In this paper, we study simultaneous training of multiple FL models using a common set of clients. The few existing simultaneous training methods employ synchronou… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted to UAI 2024

  2. arXiv:2403.12329  [pdf, other

    cs.LG cs.DC stat.ML

    FedFisher: Leveraging Fisher Information for One-Shot Federated Learning

    Authors: Divyansh Jhunjhunwala, Shiqiang Wang, Gauri Joshi

    Abstract: Standard federated learning (FL) algorithms typically require multiple rounds of communication between the server and the clients, which has several drawbacks, including requiring constant network connectivity, repeated investment of computational resources, and susceptibility to privacy attacks. One-Shot FL is a new paradigm that aims to address this challenge by enabling the server to train a gl… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted to AISTATS 2024

  3. arXiv:2402.05876  [pdf, other

    cs.LG cs.MA stat.ML

    Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices

    Authors: Jiin Woo, Laixi Shi, Gauri Joshi, Yuejie Chi

    Abstract: Offline reinforcement learning (RL), which seeks to learn an optimal policy using offline data, has garnered significant interest due to its potential in critical applications where online data collection is infeasible or expensive. This work explores the benefit of federated learning for offline RL, aiming at collaboratively leveraging offline datasets at multiple agents. Focusing on finite-horiz… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  4. arXiv:2402.01147  [pdf, other

    cs.LG cs.PF

    Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems

    Authors: Neharika Jali, Guannan Qu, Weina Wang, Gauri Joshi

    Abstract: We consider the problem of efficiently routing jobs that arrive into a central queue to a system of heterogeneous servers. Unlike homogeneous systems, a threshold policy, that routes jobs to the slow server(s) when the queue length exceeds a certain threshold, is known to be optimal for the one-fast-one-slow two-server system. But an optimal policy for the multi-server system is unknown and non-tr… ▽ More

    Submitted 21 April, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: AISTATS 2024; Corrected typos

  5. arXiv:2401.06432  [pdf, other

    cs.LG cs.DC

    Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models

    Authors: Yae Jee Cho, Luyang Liu, Zheng Xu, Aldi Fahrezi, Gauri Joshi

    Abstract: Foundation models (FMs) adapt well to specific domains or tasks with fine-tuning, and federated learning (FL) enables the potential for privacy-preserving fine-tuning of the FMs with on-device local data. For federated fine-tuning of FMs, we consider the FMs with small to medium parameter sizes of single digit billion at maximum, referred to as on-device FMs (ODFMs) that can be deployed on devices… ▽ More

    Submitted 20 February, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  6. arXiv:2310.18868  [pdf, other

    cs.DC cs.LG

    Correlation Aware Sparsified Mean Estimation Using Random Projection

    Authors: Shuli Jiang, Pranay Sharma, Gauri Joshi

    Abstract: We study the problem of communication-efficient distributed vector mean estimation, a commonly used subroutine in distributed optimization and Federated Learning (FL). Rand-$k$ sparsification is a commonly used technique to reduce communication cost, where each client sends $k < d$ of its coordinates to the server. However, Rand-$k$ is agnostic to any correlations, that might exist between clients… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: 32 pages, 13 figures. Proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS 2023), New Orleans, USA

  7. arXiv:2310.18784  [pdf, other

    cs.LG math.OC math.ST stat.ML

    High-probability Convergence Bounds for Nonlinear Stochastic Gradient Descent Under Heavy-tailed Noise

    Authors: Aleksandar Armacki, Pranay Sharma, Gauri Joshi, Dragana Bajovic, Dusan Jakovetic, Soummya Kar

    Abstract: We study high-probability convergence guarantees of learning on streaming data in the presence of heavy-tailed noise. In the proposed scenario, the model is updated in an online fashion, as new information is observed, without storing any additional data. To combat the heavy-tailed noise, we consider a general framework of nonlinear stochastic gradient descent (SGD), providing several strong resul… ▽ More

    Submitted 30 April, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: 30 pages, 3 figures

  8. arXiv:2307.08809  [pdf, other

    cs.LG cs.AI cs.CV

    Local or Global: Selective Knowledge Assimilation for Federated Learning with Limited Labels

    Authors: Yae Jee Cho, Gauri Joshi, Dimitrios Dimitriadis

    Abstract: Many existing FL methods assume clients with fully-labeled data, while in realistic settings, clients have limited labels due to the expensive and laborious process of labeling. Limited labeled local data of the clients often leads to their local model having poor generalization abilities to their larger unlabeled local data, such as having class-distribution mismatch with the unlabeled data. As a… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: To appear in the proceedings of ICCV 2023

  9. arXiv:2305.10697  [pdf, other

    cs.LG stat.ML

    The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond

    Authors: Jiin Woo, Gauri Joshi, Yuejie Chi

    Abstract: When the data used for reinforcement learning (RL) are collected by multiple agents in a distributed manner, federated versions of RL algorithms allow collaborative learning without the need for agents to share their local data. In this paper, we consider federated Q-learning, which aims to learn an optimal Q-function by periodically aggregating local Q-estimates trained on local data alone. Focus… ▽ More

    Submitted 12 December, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Short version at ICML 2023

  10. arXiv:2302.04249  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Federated Minimax Optimization with Client Heterogeneity

    Authors: Pranay Sharma, Rohan Panda, Gauri Joshi

    Abstract: Minimax optimization has seen a surge in interest with the advent of modern applications such as GANs, and it is inherently more challenging than simple minimization. The difficulty is exacerbated by the training data residing at multiple edge devices or \textit{clients}, especially when these clients can have heterogeneous datasets and local computation capabilities. We propose a general federate… ▽ More

    Submitted 9 February, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: 52 pages, 8 figures

  11. arXiv:2302.03109  [pdf, other

    cs.LG cs.DC

    On the Convergence of Federated Averaging with Cyclic Client Participation

    Authors: Yae Jee Cho, Pranay Sharma, Gauri Joshi, Zheng Xu, Satyen Kale, Tong Zhang

    Abstract: Federated Averaging (FedAvg) and its variants are the most popular optimization algorithms in federated learning (FL). Previous convergence analyses of FedAvg either assume full client participation or partial client participation where the clients can be uniformly sampled. However, in practical cross-device FL systems, only a subset of clients that satisfy local criteria such as battery status, n… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  12. arXiv:2301.09604  [pdf, other

    cs.LG

    FedExP: Speeding Up Federated Averaging via Extrapolation

    Authors: Divyansh Jhunjhunwala, Shiqiang Wang, Gauri Joshi

    Abstract: Federated Averaging (FedAvg) remains the most popular algorithm for Federated Learning (FL) optimization due to its simple implementation, stateless nature, and privacy guarantees combined with secure aggregation. Recent work has sought to generalize the vanilla averaging in FedAvg to a generalized gradient descent step by treating client updates as pseudo-gradients and using a server step size. W… ▽ More

    Submitted 6 March, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Accepted to ICLR 2023. V2 fixes minor typos and cleans up proofs

  13. arXiv:2207.14130  [pdf, other

    cs.LG

    FedVARP: Tackling the Variance Due to Partial Client Participation in Federated Learning

    Authors: Divyansh Jhunjhunwala, Pranay Sharma, Aushim Nagarkatti, Gauri Joshi

    Abstract: Data-heterogeneous federated learning (FL) systems suffer from two significant sources of convergence error: 1) client drift error caused by performing multiple local optimization steps at clients, and 2) partial client participation error caused by the fact that only a small subset of the edge clients participate in every training round. We find that among these, only the former has received sign… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Comments: Accepted to UAI 2022

  14. arXiv:2207.10992  [pdf, other

    cs.CV cs.AI

    Taguchi based Design of Sequential Convolution Neural Network for Classification of Defective Fasteners

    Authors: Manjeet Kaur, Krishan Kumar Chauhan, Tanya Aggarwal, Pushkar Bharadwaj, Renu Vig, Isibor Kennedy Ihianle, Garima Joshi, Kayode Owa

    Abstract: Fasteners play a critical role in securing various parts of machinery. Deformations such as dents, cracks, and scratches on the surface of fasteners are caused by material properties and incorrect handling of equipment during production processes. As a result, quality control is required to ensure safe and reliable operations. The existing defect inspection method relies on manual examination, whi… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: 13 pages, 6 figures

  15. arXiv:2207.04330  [pdf, ps, other

    cs.LG cs.DC math.OC stat.ML

    Multi-Model Federated Learning with Provable Guarantees

    Authors: Neelkamal Bhuyan, Sharayu Moharir, Gauri Joshi

    Abstract: Federated Learning (FL) is a variant of distributed learning where edge devices collaborate to learn a model without sharing their data with the central server or each other. We refer to the process of training multiple independent models simultaneously in a federated setting using a common pool of clients as multi-model FL. In this work, we propose two variants of the popular FedAvg algorithm for… ▽ More

    Submitted 20 September, 2022; v1 submitted 9 July, 2022; originally announced July 2022.

  16. arXiv:2207.03983  [pdf, other

    cs.PF cs.DC cs.IT

    Tackling Heterogeneous Traffic in Multi-access Systems via Erasure Coded Servers

    Authors: Tuhinangshu Choudhury, Weina Wang, Gauri Joshi

    Abstract: Most data generated by modern applications is stored in the cloud, and there is an exponential growth in the volume of jobs to access these data and perform computations using them. The volume of data access or computing jobs can be heterogeneous across different job types and can unpredictably change over time. Cloud service providers cope with this demand heterogeneity and unpredictability by ov… ▽ More

    Submitted 14 August, 2022; v1 submitted 8 July, 2022; originally announced July 2022.

  17. arXiv:2206.10185  [pdf, other

    cs.LG

    Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling

    Authors: Sajad Khodadadian, Pranay Sharma, Gauri Joshi, Siva Theja Maguluri

    Abstract: Since reinforcement learning algorithms are notoriously data-intensive, the task of sampling observations from the environment is usually split across multiple agents. However, transferring these observations from the agents to a central location can be prohibitively expensive in terms of the communication cost, and it can also compromise the privacy of each agent's local behavior policy. In this… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: 69 pages, 1 figure, accepted to ICML 2022 for long presentation

  18. arXiv:2206.04723  [pdf, other

    cs.LG

    On the Unreasonable Effectiveness of Federated Averaging with Heterogeneous Data

    Authors: Jianyu Wang, Rudrajit Das, Gauri Joshi, Satyen Kale, Zheng Xu, Tong Zhang

    Abstract: Existing theory predicts that data heterogeneity will degrade the performance of the Federated Averaging (FedAvg) algorithm in federated learning. However, in practice, the simple FedAvg algorithm converges very well. This paper explains the seemingly unreasonable effectiveness of FedAvg that contradicts the previous theoretical predictions. We find that the key assumption of bounded gradient diss… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  19. arXiv:2206.00799  [pdf, other

    cs.LG

    Federated Learning under Distributed Concept Drift

    Authors: Ellango Jothimurugesan, Kevin Hsieh, Jianyu Wang, Gauri Joshi, Phillip B. Gibbons

    Abstract: Federated Learning (FL) under distributed concept drift is a largely unexplored area. Although concept drift is itself a well-studied phenomenon, it poses particular challenges for FL, because drifts arise staggered in time and space (across clients). To the best of our knowledge, this work is the first to explicitly study data heterogeneity in both dimensions. We first demonstrate that prior solu… ▽ More

    Submitted 27 February, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: 20 pages. Published in AISTATS 2023

    ACM Class: I.2.6

  20. arXiv:2205.14840  [pdf, other

    cs.LG

    Maximizing Global Model Appeal in Federated Learning

    Authors: Yae Jee Cho, Divyansh Jhunjhunwala, Tian Li, Virginia Smith, Gauri Joshi

    Abstract: Federated learning typically considers collaboratively training a global model using local data at edge clients. Clients may have their own individual requirements, such as having a minimal training loss threshold, which they expect to be met by the global model. However, due to client heterogeneity, the global model may not meet each client's requirements, and only a small subset may find the glo… ▽ More

    Submitted 4 February, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

  21. arXiv:2204.12703  [pdf, other

    cs.LG

    Heterogeneous Ensemble Knowledge Transfer for Training Large Models in Federated Learning

    Authors: Yae Jee Cho, Andre Manoel, Gauri Joshi, Robert Sim, Dimitrios Dimitriadis

    Abstract: Federated learning (FL) enables edge-devices to collaboratively learn a model without disclosing their private data to a central aggregating server. Most existing FL algorithms require models of identical architecture to be deployed across the clients and server, making it infeasible to train large models due to clients' limited system resources. In this work, we propose a novel ensemble knowledge… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: To appear in the proceedings of the 31st International Joint Conference on Artificial Intelligence (IJCAI 2022)

  22. arXiv:2203.11724  [pdf

    cs.LG cs.AI cs.SI

    Explainable Misinformation Detection Across Multiple Social Media Platforms

    Authors: Gargi Joshi, Ananya Srivastava, Bhargav Yagnik, Mohammed Hasan, Zainuddin Saiyed, Lubna A Gabralla, Ajith Abraham, Rahee Walambe, Ketan Kotecha

    Abstract: In this work, the integration of two machine learning approaches, namely domain adaptation and explainable AI, is proposed to address these two issues of generalized detection and explainability. Firstly the Domain Adversarial Neural Network (DANN) develops a generalized misinformation detector across multiple social media platforms DANN is employed to generate the classification results for test… ▽ More

    Submitted 19 September, 2022; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: 28 pages,4 figures

  23. arXiv:2203.04850  [pdf, other

    math.OC cs.DC cs.LG

    Federated Minimax Optimization: Improved Convergence Analyses and Algorithms

    Authors: Pranay Sharma, Rohan Panda, Gauri Joshi, Pramod K. Varshney

    Abstract: In this paper, we consider nonconvex minimax optimization, which is gaining prominence in many modern machine learning applications such as GANs. Large-scale edge-based collection of training data in these applications calls for communication-efficient distributed optimization algorithms, such as those used in federated learning, to process the data. In this paper, we analyze Local stochastic grad… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: 52 pages, 4 figures

  24. arXiv:2201.11865  [pdf, other

    cs.LG cs.DC

    FedLite: A Scalable Approach for Federated Learning on Resource-constrained Clients

    Authors: Jianyu Wang, Hang Qi, Ankit Singh Rawat, Sashank Reddi, Sagar Waghmare, Felix X. Yu, Gauri Joshi

    Abstract: In classical federated learning, the clients contribute to the overall training by communicating local updates for the underlying model on their private data to a coordinating server. However, updating and communicating the entire model becomes prohibitively expensive when resource-constrained clients collectively aim to train a large machine learning model. Split learning provides a natural solut… ▽ More

    Submitted 16 February, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

  25. arXiv:2110.07751  [pdf, other

    cs.LG stat.ML

    Leveraging Spatial and Temporal Correlations in Sparsified Mean Estimation

    Authors: Divyansh Jhunjhunwala, Ankur Mallick, Advait Gadhikar, Swanand Kadhe, Gauri Joshi

    Abstract: We study the problem of estimating at a central server the mean of a set of vectors distributed across several nodes (one vector per node). When the vectors are high-dimensional, the communication cost of sending entire vectors may be prohibitive, and it may be imperative for them to use sparsification techniques. While most existing work on sparsified mean estimation is agnostic to the characteri… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted to NeurIPS 2021

  26. arXiv:2109.08119  [pdf, other

    cs.LG

    Personalized Federated Learning for Heterogeneous Clients with Clustered Knowledge Transfer

    Authors: Yae Jee Cho, Jianyu Wang, Tarun Chiruvolu, Gauri Joshi

    Abstract: Personalized federated learning (FL) aims to train model(s) that can perform well for individual clients that are highly data and system heterogeneous. Most work in personalized FL, however, assumes using the same model architecture at all clients and increases the communication cost by sending/receiving models. This may not be feasible for realistic scenarios of FL. In practice, clients have high… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  27. Best-Arm Identification in Correlated Multi-Armed Bandits

    Authors: Samarth Gupta, Gauri Joshi, Osman Yağan

    Abstract: In this paper we consider the problem of best-arm identification in multi-armed bandits in the fixed confidence setting, where the goal is to identify, with probability $1-δ$ for some $δ>0$, the arm with the highest mean reward in minimum possible samples from the set of arms $\mathcal{K}$. Most existing best-arm identification algorithms and analyses operate under the assumption that the rewards… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Journal ref: IEEE Journal on Selected Areas in Information Theory (Volume: 2, Issue: 2, June 2021)

  28. arXiv:2108.03120  [pdf, other

    eess.SY cs.AI cs.LG cs.RO

    Stochastic Deep Model Reference Adaptive Control

    Authors: Girish Joshi, Girish Chowdhary

    Abstract: In this paper, we present a Stochastic Deep Neural Network-based Model Reference Adaptive Control. Building on our work "Deep Model Reference Adaptive Control", we extend the controller capability by using Bayesian deep neural networks (DNN) to represent uncertainties and model non-linearities. Stochastic Deep Model Reference Adaptive Control uses a Lyapunov-based method to adapt the output-layer… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Comments: Accepted in IEEE CDC 2021

  29. arXiv:2107.06917  [pdf, other

    cs.LG

    A Field Guide to Federated Optimization

    Authors: Jianyu Wang, Zachary Charles, Zheng Xu, Gauri Joshi, H. Brendan McMahan, Blaise Aguera y Arcas, Maruan Al-Shedivat, Galen Andrew, Salman Avestimehr, Katharine Daly, Deepesh Data, Suhas Diggavi, Hubert Eichner, Advait Gadhikar, Zachary Garrett, Antonious M. Girgis, Filip Hanzely, Andrew Hard, Chaoyang He, Samuel Horvath, Zhouyuan Huo, Alex Ingerman, Martin Jaggi, Tara Javidi, Peter Kairouz , et al. (28 additional authors not shown)

    Abstract: Federated learning and analytics are a distributed approach for collaboratively learning models (or statistics) from decentralized data, motivated by and designed for privacy protection. The distributed learning process can be formulated as solving federated optimization problems, which emphasize communication efficiency, data heterogeneity, compatibility with privacy and system requirements, and… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

  30. Job Dispatching Policies for Queueing Systems with Unknown Service Rates

    Authors: Tuhinangshu Choudhury, Gauri Joshi, Weina Wang, Sanjay Shakkottai

    Abstract: In multi-server queueing systems where there is no central queue holding all incoming jobs, job dispatching policies are used to assign incoming jobs to the queue at one of the servers. Classic job dispatching policies such as join-the-shortest-queue and shortest expected delay assume that the service rates and queue lengths of the servers are known to the dispatcher. In this work, we tackle the p… ▽ More

    Submitted 10 June, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

  31. arXiv:2106.02305  [pdf, other

    cs.LG cs.DC stat.ML

    Local Adaptivity in Federated Learning: Convergence and Consistency

    Authors: Jianyu Wang, Zheng Xu, Zachary Garrett, Zachary Charles, Luyang Liu, Gauri Joshi

    Abstract: The federated learning (FL) framework trains a machine learning model using decentralized data stored at edge client devices by periodically aggregating locally trained models. Popular optimization algorithms of FL use vanilla (stochastic) gradient descent for both local updates at clients and global updates at the aggregating server. Recently, adaptive optimization methods such as AdaGrad have be… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  32. A Review on Explainability in Multimodal Deep Neural Nets

    Authors: Gargi Joshi, Rahee Walambe, Ketan Kotecha

    Abstract: Artificial Intelligence techniques powered by deep neural nets have achieved much success in several application domains, most significantly and notably in the Computer Vision applications and Natural Language Processing tasks. Surpassing human-level performance propelled the research in the applications where different modalities amongst language, vision, sensory, text play an important role in a… ▽ More

    Submitted 18 May, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: 24 pages 6 figures

    Journal ref: in IEEE Access, vol. 9, pp. 59800-59821, 2021

  33. arXiv:2105.04699  [pdf, other

    cs.LG cs.AI

    Adaptive Policy Transfer in Reinforcement Learning

    Authors: Girish Joshi, Girish Chowdhary

    Abstract: Efficient and robust policy transfer remains a key challenge for reinforcement learning to become viable for real-wold robotics. Policy transfer through warm initialization, imitation, or interacting over a large set of agents with randomized instances, have been commonly applied to solve a variety of Reinforcement Learning tasks. However, this seems far from how skill transfer happens in the biol… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

  34. arXiv:2102.04487  [pdf, other

    cs.LG cs.DC stat.ML

    Adaptive Quantization of Model Updates for Communication-Efficient Federated Learning

    Authors: Divyansh Jhunjhunwala, Advait Gadhikar, Gauri Joshi, Yonina C. Eldar

    Abstract: Communication of model updates between client nodes and the central aggregating server is a major bottleneck in federated learning, especially in bandwidth-limited settings and high-dimensional models. Gradient quantization is an effective way of reducing the number of bits required to communicate each model update, albeit at the cost of having a higher error floor due to the higher variance of th… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: Accepted to ICASSP 2021

  35. arXiv:2012.13608  [pdf, other

    cs.DC cs.IT

    Synergy via Redundancy: Adaptive Replication Strategies and Fundamental Limits

    Authors: Gauri Joshi, Dhruva Kaushal

    Abstract: The maximum possible throughput (or the rate of job completion) of a multi-server system is typically the sum of the service rates of individual servers. Recent work shows that launching multiple replicas of a job and canceling them as soon as one copy finishes can boost the throughput, especially when the service time distribution has high variability. This means that redundancy can, in fact, cre… ▽ More

    Submitted 25 December, 2020; originally announced December 2020.

    Comments: Accepted for publication in the IEEE/ACM Transactions on Networking

  36. arXiv:2012.08009  [pdf, other

    cs.LG cs.AI

    Bandit-based Communication-Efficient Client Selection Strategies for Federated Learning

    Authors: Yae Jee Cho, Samarth Gupta, Gauri Joshi, Osman Yağan

    Abstract: Due to communication constraints and intermittent client availability in federated learning, only a subset of clients can participate in each training round. While most prior works assume uniform and unbiased client selection, recent work on biased client selection has shown that selecting clients with higher local losses can improve error convergence speed. However, previously proposed biased sel… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

  37. arXiv:2011.02920  [pdf, other

    cs.RO eess.SY

    Asynchronous Deep Model Reference Adaptive Control

    Authors: Girish Joshi, Jasvir Virdi, Girish Chowdhary

    Abstract: In this paper, we present Asynchronous implementation of Deep Neural Network-based Model Reference Adaptive Control (DMRAC). We evaluate this new neuro-adaptive control architecture through flight tests on a small quadcopter. We demonstrate that a single DMRAC controller can handle significant nonlinearities due to severe system faults and deliberate wind disturbances while executing high-bandwidt… ▽ More

    Submitted 4 November, 2020; originally announced November 2020.

    Comments: Accepted in CORL2020. arXiv admin note: text overlap with arXiv:1909.08602

  38. arXiv:2010.01243  [pdf, other

    cs.LG cs.DC stat.ML

    Client Selection in Federated Learning: Convergence Analysis and Power-of-Choice Selection Strategies

    Authors: Yae Jee Cho, Jianyu Wang, Gauri Joshi

    Abstract: Federated learning is a distributed optimization paradigm that enables a large number of resource-limited client nodes to cooperatively train a model without data sharing. Several works have analyzed the convergence of federated learning by accounting of data heterogeneity, communication and computation limitations, and partial client participation. However, they assume unbiased client participati… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

  39. arXiv:2009.03988  [pdf, other

    cs.HC eess.SP

    A new architecture for hand-worn Sign language to Speech translator

    Authors: Sai Charan Bodda, Palki Gupta, Gaurav Joshi, Ayush Chaturvedi

    Abstract: People with speech and hearing impairments often rely on sign language to communicate with others but most of the general population cannot understand sign language and sign language itself is a difficult language to learn, so there is a definite need for technologies to translate sign language to speech. In this paper, we describe the design and implementation of Smart glove, a hand-worn hardware… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

    Comments: 9 pages, 5 figures, research done as part of Samsung electronics

  40. arXiv:2009.01598  [pdf, other

    cs.IT cs.DM cs.PF

    Service Rate Region: A New Aspect of Coded Distributed System Design

    Authors: Mehmet Aktas, Gauri Joshi, Swanand Kadhe, Fatemeh Kazemi, Emina Soljanin

    Abstract: Erasure coding has been recognized as a powerful method to mitigate delays due to slow or straggling nodes in distributed systems. This work shows that erasure coding of data objects can flexibly handle skews in the request rates. Coding can help boost the \emph{service rate region}, that is, increase the overall volume of data access requests that the system can handle. This paper aims to postula… ▽ More

    Submitted 27 June, 2021; v1 submitted 3 September, 2020; originally announced September 2020.

  41. arXiv:2007.10800  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Probabilistic Neighbourhood Component Analysis: Sample Efficient Uncertainty Estimation in Deep Learning

    Authors: Ankur Mallick, Chaitanya Dwivedi, Bhavya Kailkhura, Gauri Joshi, T. Yong-Jin Han

    Abstract: While Deep Neural Networks (DNNs) achieve state-of-the-art accuracy in various applications, they often fall short in accurately estimating their predictive uncertainty and, in turn, fail to recognize when these predictions may be wrong. Several uncertainty-aware models, such as Bayesian Neural Network (BNNs) and Deep Ensembles have been proposed in the literature for quantifying predictive uncert… ▽ More

    Submitted 18 July, 2020; originally announced July 2020.

  42. arXiv:2007.07481  [pdf, other

    cs.LG cs.DC stat.ML

    Tackling the Objective Inconsistency Problem in Heterogeneous Federated Optimization

    Authors: Jianyu Wang, Qinghua Liu, Hao Liang, Gauri Joshi, H. Vincent Poor

    Abstract: In federated optimization, heterogeneity in the clients' local datasets and computation speeds results in large variations in the number of local updates performed by each client in each communication round. Naive weighted aggregation of such models causes objective inconsistency, that is, the global model converges to a stationary point of a mismatched objective function which can be arbitrarily… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

  43. arXiv:2003.10579  [pdf, other

    stat.ML cs.DC cs.LG cs.PF

    Slow and Stale Gradients Can Win the Race

    Authors: Sanghamitra Dutta, Jianyu Wang, Gauri Joshi

    Abstract: Distributed Stochastic Gradient Descent (SGD) when run in a synchronous manner, suffers from delays in runtime as it waits for the slowest workers (stragglers). Asynchronous methods can alleviate stragglers, but cause gradient staleness that can adversely affect the convergence error. In this work, we present a novel theoretical characterization of the speedup offered by asynchronous methods by an… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

    Comments: Some of the results have appeared in AISTATS 2018. This is an extended version with additional results, in particular, an adaptive synchronicity strategy called AdaSync. arXiv admin note: substantial text overlap with arXiv:1803.01113

  44. arXiv:2003.05649  [pdf, other

    cs.LG cs.DC stat.ML

    Machine Learning on Volatile Instances

    Authors: Xiaoxi Zhang, Jianyu Wang, Gauri Joshi, Carlee Joe-Wong

    Abstract: Due to the massive size of the neural network models and training datasets used in machine learning today, it is imperative to distribute stochastic gradient descent (SGD) by splitting up tasks such as gradient evaluation across multiple worker nodes. However, running distributed SGD can be prohibitively expensive because it may require specialized computing resources such as GPUs for extended per… ▽ More

    Submitted 12 March, 2020; originally announced March 2020.

  45. arXiv:2002.09539  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Overlap Local-SGD: An Algorithmic Approach to Hide Communication Delays in Distributed SGD

    Authors: Jianyu Wang, Hao Liang, Gauri Joshi

    Abstract: Distributed stochastic gradient descent (SGD) is essential for scaling the machine learning algorithms to a large number of computing nodes. However, the infrastructures variability such as high communication delay or random node slowdown greatly impedes the performance of distributed SGD algorithm, especially in a wireless system or sensor networks. In this paper, we propose an algorithmic approa… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

    Comments: Accepted to ICASSP 2020

  46. arXiv:1912.04977  [pdf, other

    cs.LG cs.CR stat.ML

    Advances and Open Problems in Federated Learning

    Authors: Peter Kairouz, H. Brendan McMahan, Brendan Avent, Aurélien Bellet, Mehdi Bennis, Arjun Nitin Bhagoji, Kallista Bonawitz, Zachary Charles, Graham Cormode, Rachel Cummings, Rafael G. L. D'Oliveira, Hubert Eichner, Salim El Rouayheb, David Evans, Josh Gardner, Zachary Garrett, Adrià Gascón, Badih Ghazi, Phillip B. Gibbons, Marco Gruteser, Zaid Harchaoui, Chaoyang He, Lie He, Zhouyuan Huo, Ben Hutchinson , et al. (34 additional authors not shown)

    Abstract: Federated learning (FL) is a machine learning setting where many clients (e.g. mobile devices or whole organizations) collaboratively train a model under the orchestration of a central server (e.g. service provider), while keeping the training data decentralized. FL embodies the principles of focused data collection and minimization, and can mitigate many of the systemic privacy risks and costs re… ▽ More

    Submitted 8 March, 2021; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: Published in Foundations and Trends in Machine Learning Vol 4 Issue 1. See: https://www.nowpublishers.com/article/Details/MAL-083

  47. Multi-Armed Bandits with Correlated Arms

    Authors: Samarth Gupta, Shreyas Chaudhari, Gauri Joshi, Osman Yağan

    Abstract: We consider a multi-armed bandit framework where the rewards obtained by pulling different arms are correlated. We develop a unified approach to leverage these reward correlations and present fundamental generalizations of classic bandit algorithms to the correlated setting. We present a unified proof technique to analyze the proposed algorithms. Rigorous analysis of C-UCB (the correlated bandit v… ▽ More

    Submitted 10 September, 2021; v1 submitted 6 November, 2019; originally announced November 2019.

    Comments: A special case of the model studied in this paper is presented in arXiv:1808.05904

    Journal ref: IEEE Transactions on Information Theory, 2021

  48. arXiv:1910.05858  [pdf, other

    cs.LG stat.ML

    Deep Kernels with Probabilistic Embeddings for Small-Data Learning

    Authors: Ankur Mallick, Chaitanya Dwivedi, Bhavya Kailkhura, Gauri Joshi, T. Yong-Jin Han

    Abstract: Gaussian Processes (GPs) are known to provide accurate predictions and uncertainty estimates even with small amounts of labeled data by capturing similarity between data points through their kernel function. However traditional GP kernels are not very effective at capturing similarity between high dimensional data points. Neural networks can be used to learn good representations that encode intric… ▽ More

    Submitted 13 November, 2021; v1 submitted 13 October, 2019; originally announced October 2019.

  49. arXiv:1910.00762  [pdf, other

    cs.LG stat.ML

    Accelerating Deep Learning by Focusing on the Biggest Losers

    Authors: Angela H. Jiang, Daniel L. -K. Wong, Giulio Zhou, David G. Andersen, Jeffrey Dean, Gregory R. Ganger, Gauri Joshi, Michael Kaminksy, Michael Kozuch, Zachary C. Lipton, Padmanabhan Pillai

    Abstract: This paper introduces Selective-Backprop, a technique that accelerates the training of deep neural networks (DNNs) by prioritizing examples with high loss at each iteration. Selective-Backprop uses the output of a training example's forward pass to decide whether to use that example to compute gradients and update parameters, or to skip immediately to the next example. By reducing the number of co… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

  50. arXiv:1909.08602  [pdf, other

    cs.LG eess.SY

    Deep Model Reference Adaptive Control

    Authors: Girish Joshi, Girish Chowdhary

    Abstract: We present a new neuroadaptive architecture: Deep Neural Network based Model Reference Adaptive Control (DMRAC). Our architecture utilizes the power of deep neural network representations for modeling significant nonlinearities while marrying it with the boundedness guarantees that characterize MRAC based controllers. We demonstrate through simulations and analysis that DMRAC can subsume previousl… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

    Comments: Accepted in IEEE CDC-2019