Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Ying, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15861  [pdf, other

    cs.LG cs.DC

    Achieving Dimension-Free Communication in Federated Learning via Zeroth-Order Optimization

    Authors: Zhe Li, Bicheng Ying, Zidong Liu, Haibo Yang

    Abstract: Federated Learning (FL) offers a promising framework for collaborative and privacy-preserving machine learning across distributed data sources. However, the substantial communication costs associated with FL pose a significant challenge to its efficiency. Specifically, in each communication round, the communication costs scale linearly with the model's dimension, which presents a formidable obstac… ▽ More

    Submitted 24 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2306.00256  [pdf, other

    cs.LG

    DSGD-CECA: Decentralized SGD with Communication-Optimal Exact Consensus Algorithm

    Authors: Lisang Ding, Kexin Jin, Bicheng Ying, Kun Yuan, Wotao Yin

    Abstract: Decentralized Stochastic Gradient Descent (SGD) is an emerging neural network training approach that enables multiple agents to train a model collaboratively and simultaneously. Rather than using a central parameter server to collect gradients from all the agents, each agent keeps a copy of the model parameters and communicates with a small number of other agents to exchange model updates. Their c… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  3. arXiv:2111.04287  [pdf, other

    cs.DC cs.LG

    BlueFog: Make Decentralized Algorithms Practical for Optimization and Deep Learning

    Authors: Bicheng Ying, Kun Yuan, Hanbin Hu, Yiming Chen, Wotao Yin

    Abstract: Decentralized algorithm is a form of computation that achieves a global goal through local dynamics that relies on low-cost communication between directly-connected agents. On large-scale optimization tasks involving distributed datasets, decentralized algorithms have shown strong, sometimes superior, performance over distributed algorithms with a central node. Recently, developing decentralized a… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

  4. arXiv:2110.13363  [pdf, other

    cs.LG math.OC

    Exponential Graph is Provably Efficient for Decentralized Deep Training

    Authors: Bicheng Ying, Kun Yuan, Yiming Chen, Hanbin Hu, Pan Pan, Wotao Yin

    Abstract: Decentralized SGD is an emerging training method for deep learning known for its much less (thus faster) communication per iteration, which relaxes the averaging step in parallel SGD to inexact averaging. The less exact the averaging is, however, the more the total iterations the training needs to take. Therefore, the key to making decentralized SGD efficient is to realize nearly-exact averaging u… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

  5. arXiv:1903.10956  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    On the Influence of Bias-Correction on Distributed Stochastic Optimization

    Authors: Kun Yuan, Sulaiman A. Alghunaim, Bicheng Ying, Ali H. Sayed

    Abstract: Various bias-correction methods such as EXTRA, gradient tracking methods, and exact diffusion have been proposed recently to solve distributed {\em deterministic} optimization problems. These methods employ constant step-sizes and converge linearly to the {\em exact} solution under proper conditions. However, their performance under stochastic and adaptive settings is less explored. It is still un… ▽ More

    Submitted 11 July, 2019; v1 submitted 26 March, 2019; originally announced March 2019.

    Comments: 17 pages, 9 figure, submitted for publication

  6. arXiv:1810.08901  [pdf, other

    cs.SI cs.MA

    Dynamic Average Diffusion with randomized Coordinate Updates

    Authors: Bicheng Ying, Kun Yuan, Ali H. Sayed

    Abstract: This work derives and analyzes an online learning strategy for tracking the average of time-varying distributed signals by relying on randomized coordinate-descent updates. During each iteration, each agent selects or observes a random entry of the observation vector, and different agents may select different entries of their observations before engaging in a consultation step. Careful coordinatio… ▽ More

    Submitted 30 July, 2019; v1 submitted 21 October, 2018; originally announced October 2018.

  7. arXiv:1805.11384  [pdf, other

    cs.MA cs.LG math.OC

    Supervised Learning Under Distributed Features

    Authors: Bicheng Ying, Kun Yuan, Ali H. Sayed

    Abstract: This work studies the problem of learning under both large datasets and large-dimensional feature space scenarios. The feature information is assumed to be spread across agents in a network, where each agent observes some of the features. Through local cooperation, the agents are supposed to interact with each other to solve an inference problem and converge towards the global minimizer of an empi… ▽ More

    Submitted 22 May, 2020; v1 submitted 29 May, 2018; originally announced May 2018.

  8. arXiv:1803.07964  [pdf, other

    cs.LG math.OC stat.ML

    Stochastic Learning under Random Reshuffling with Constant Step-sizes

    Authors: Bicheng Ying, Kun Yuan, Stefan Vlaski, Ali H. Sayed

    Abstract: In empirical risk optimization, it has been observed that stochastic gradient implementations that rely on random reshuffling of the data achieve better performance than implementations that rely on sampling the data uniformly. Recent works have pursued justifications for this behavior by examining the convergence rate of the learning process under diminishing step-sizes. This work focuses on the… ▽ More

    Submitted 9 October, 2018; v1 submitted 21 March, 2018; originally announced March 2018.

  9. arXiv:1801.05479  [pdf, other

    cs.MA cs.SI

    Belief Control Strategies for Interactions over Weakly-Connected Graphs

    Authors: Hawraa Salami, Bicheng Ying, Ali H. Sayed

    Abstract: In diffusion social learning over weakly-connected graphs, it has been shown recently that influential agents shape the beliefs of non-influential agents. This paper analyzes this mechanism more closely and addresses two main questions. First, the article examines how much freedom influential agents have in controlling the beliefs of the receiving agents, namely, whether receiving agents can be dr… ▽ More

    Submitted 5 November, 2018; v1 submitted 16 January, 2018; originally announced January 2018.

    Comments: Submitted for publication

  10. arXiv:1708.01384  [pdf, other

    cs.LG math.OC stat.ML

    Variance-Reduced Stochastic Learning by Networked Agents under Random Reshuffling

    Authors: Kun Yuan, Bicheng Ying, Jiageng Liu, Ali H. Sayed

    Abstract: A new amortized variance-reduced gradient (AVRG) algorithm was developed in \cite{ying2017convergence}, which has constant storage requirement in comparison to SAGA and balanced gradient computations in comparison to SVRG. One key advantage of the AVRG strategy is its amenability to decentralized implementations. In this work, we show how AVRG can be extended to the network case where multiple lea… ▽ More

    Submitted 29 May, 2018; v1 submitted 4 August, 2017; originally announced August 2017.

    Comments: 23 pages, 12 figures, submitted for publication

  11. arXiv:1708.01383  [pdf, other

    cs.LG math.OC stat.ML

    Variance-Reduced Stochastic Learning under Random Reshuffling

    Authors: Bicheng Ying, Kun Yuan, Ali H. Sayed

    Abstract: Several useful variance-reduced stochastic gradient algorithms, such as SVRG, SAGA, Finito, and SAG, have been proposed to minimize empirical risks with linear convergence properties to the exact minimizer. The existing convergence results assume uniform data sampling with replacement. However, it has been observed in related works that random reshuffling can deliver superior performance over unif… ▽ More

    Submitted 16 February, 2018; v1 submitted 4 August, 2017; originally announced August 2017.

  12. arXiv:1704.06025  [pdf, other

    math.OC cs.MA stat.ML

    Performance Limits of Stochastic Sub-Gradient Learning, Part II: Multi-Agent Case

    Authors: Bicheng Ying, Ali H. Sayed

    Abstract: The analysis in Part I revealed interesting properties for subgradient learning algorithms in the context of stochastic optimization when gradient noise is present. These algorithms are used when the risk functions are non-smooth and involve non-differentiable components. They have been long recognized as being slow converging methods. However, it was revealed in Part I that the rate of convergenc… ▽ More

    Submitted 20 April, 2017; originally announced April 2017.

  13. arXiv:1609.03703  [pdf, other

    cs.SI cs.MA

    Social Learning over Weakly-Connected Graphs

    Authors: Hawraa Salami, Bicheng Ying, Ali H. Sayed

    Abstract: In this paper, we study diffusion social learning over weakly-connected graphs. We show that the asymmetric flow of information hinders the learning abilities of certain agents regardless of their local observations. Under some circumstances that we clarify in this work, a scenario of total influence (or "mind-control") arises where a set of influential agents ends up shaping the beliefs of non-in… ▽ More

    Submitted 6 January, 2017; v1 submitted 13 September, 2016; originally announced September 2016.

    Comments: To appear in 2017 in the IEEE Transactions on Signal and Information Processing over Networks

  14. arXiv:1607.01838  [pdf, other

    cs.MA cs.DC eess.SY

    Coordinate-Descent Diffusion Learning by Networked Agents

    Authors: Chengcheng Wang, Yonggang Zhang, Bicheng Ying, Ali H. Sayed

    Abstract: This work examines the mean-square error performance of diffusion stochastic algorithms under a generalized coordinate-descent scheme. In this setting, the adaptation step by each agent is limited to a random subset of the coordinates of its stochastic gradient vector. The selection of coordinates varies randomly from iteration to iteration and from agent to agent across the network. Such schemes… ▽ More

    Submitted 10 October, 2017; v1 submitted 6 July, 2016; originally announced July 2016.

    Comments: Accepted for publication

  15. arXiv:1603.04136  [pdf, ps, other

    math.OC cs.LG stat.ML

    On the Influence of Momentum Acceleration on Online Learning

    Authors: Kun Yuan, Bicheng Ying, Ali H. Sayed

    Abstract: The article examines in some detail the convergence rate and mean-square-error performance of momentum stochastic gradient methods in the constant step-size and slow adaptation regime. The results establish that momentum methods are equivalent to the standard stochastic gradient method with a re-scaled (larger) step-size value. The size of the re-scaling is determined by the value of the momentum… ▽ More

    Submitted 12 October, 2016; v1 submitted 14 March, 2016; originally announced March 2016.

    Comments: 66 pages, 9 figures, to appear in Journal of Machine Learning Research, 2016

  16. arXiv:1602.07630  [pdf, ps, other

    math.OC cs.LG stat.ML

    Online Dual Coordinate Ascent Learning

    Authors: Bicheng Ying, Kun Yuan, Ali H. Sayed

    Abstract: The stochastic dual coordinate-ascent (S-DCA) technique is a useful alternative to the traditional stochastic gradient-descent algorithm for solving large-scale optimization problems due to its scalability to large data sets and strong theoretical guarantees. However, the available S-DCA formulation is limited to finite sample sizes and relies on performing multiple passes over the same data. This… ▽ More

    Submitted 24 February, 2016; originally announced February 2016.

  17. arXiv:1511.07902  [pdf, other

    stat.ML cs.LG cs.MA

    Performance Limits of Stochastic Sub-Gradient Learning, Part I: Single Agent Case

    Authors: Bicheng Ying, Ali H. Sayed

    Abstract: In this work and the supporting Part II, we examine the performance of stochastic sub-gradient learning strategies under weaker conditions than usually considered in the literature. The new conditions are shown to be automatically satisfied by several important cases of interest including SVM, LASSO, and Total-Variation denoising formulations. In comparison, these problems do not satisfy the tradi… ▽ More

    Submitted 21 April, 2017; v1 submitted 24 November, 2015; originally announced November 2015.

    Comments: Part II is available on http://arxiv.org/abs/1704.06025

  18. arXiv:1412.1523  [pdf, ps, other

    cs.MA cs.IT cs.LG

    Information Exchange and Learning Dynamics over Weakly-Connected Adaptive Networks

    Authors: Bicheng Ying, Ali H. Sayed

    Abstract: The paper examines the learning mechanism of adaptive agents over weakly-connected graphs and reveals an interesting behavior on how information flows through such topologies. The results clarify how asymmetries in the exchange of data can mask local information at certain agents and make them totally dependent on other agents. A leader-follower relationship develops with the performance of some a… ▽ More

    Submitted 6 December, 2015; v1 submitted 3 December, 2014; originally announced December 2014.