Zum Hauptinhalt springen

Showing 1–50 of 156 results for author: Chi, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.16981  [pdf, ps, other

    cs.LG math.OC stat.ML

    The Sample-Communication Complexity Trade-off in Federated Q-Learning

    Authors: Sudeep Salgia, Yuejie Chi

    Abstract: We consider the problem of federated Q-learning, where $M$ agents aim to collaboratively learn the optimal Q-function of an unknown infinite-horizon Markov decision process with finite state and action spaces. We investigate the trade-off between sample and communication complexities for the widely used class of intermittent communication algorithms. We first establish the converse result, where i… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  2. arXiv:2408.10147  [pdf, other

    cs.LG cs.CL cs.IT math.OC stat.ML

    In-Context Learning with Representations: Contextual Generalization of Trained Transformers

    Authors: Tong Yang, Yu Huang, Yingbin Liang, Yuejie Chi

    Abstract: In-context learning (ICL) refers to a remarkable capability of pretrained large language models, which can learn a new task given a few examples during inference. However, theoretical understanding of ICL is largely under-explored, particularly whether transformers can be trained to generalize to unseen examples in a prompt, which will require the model to acquire contextual knowledge of the promp… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  3. arXiv:2408.02320  [pdf, ps, other

    cs.LG eess.SP math.NA math.ST stat.ML

    A Sharp Convergence Theory for The Probability Flow ODEs of Diffusion Models

    Authors: Gen Li, Yuting Wei, Yuejie Chi, Yuxin Chen

    Abstract: Diffusion models, which convert noise into new data instances by learning to reverse a diffusion process, have become a cornerstone in contemporary generative modeling. In this work, we develop non-asymptotic convergence theory for a popular diffusion-based sampler (i.e., the probability flow ODE sampler) in discrete time, assuming access to $\ell_2$-accurate estimates of the (Stein) score functio… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: This manuscript presents improved theory for probability flow ODEs compared to its earlier version arXiv:2306.09251

  4. arXiv:2407.16521  [pdf, other

    cs.CL

    AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game

    Authors: Yizhou Chi, Lingjun Mao, Zineng Tang

    Abstract: Strategic social deduction games serve as valuable testbeds for evaluating the understanding and inference skills of language models, offering crucial insights into social science, artificial intelligence, and strategic gaming. This paper focuses on creating proxies of human behavior in simulated environments, with Among Us utilized as a tool for studying simulated human behavior. The study introd… ▽ More

    Submitted 24 July, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: Wordplay @ ACL 2024

  5. arXiv:2407.13255  [pdf, other

    cs.IT eess.SP

    Interleaved Block-Sparse Transform

    Authors: Lei Liu, Ming Wang, Shufeng Li, Yuhao Chi, Ning Wei, ZhaoYang Zhang

    Abstract: Low-complexity Bayes-optimal memory approximate message passing (MAMP) is an efficient signal estimation algorithm in compressed sensing and multicarrier modulation. However, achieving replica Bayes optimality with MAMP necessitates a large-scale right-unitarily invariant transformation, which is prohibitive in practical systems due to its high computational complexity and hardware costs. To solve… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: Submitted to the IEEE Journal

  6. arXiv:2406.14420  [pdf, other

    cs.LG cs.DC cs.DS math.OC

    Communication-efficient Vertical Federated Learning via Compressed Error Feedback

    Authors: Pedro Valdeira, João Xavier, Cláudia Soares, Yuejie Chi

    Abstract: Communication overhead is a known bottleneck in federated learning (FL). To address this, lossy compression is commonly used on the information communicated between the server and clients during training. In horizontal FL, where each client holds a subset of the samples, such communication-compressed training methods have recently seen significant progress. However, in their vertical FL counterpar… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  7. arXiv:2406.01219  [pdf, other

    cs.CR cs.SE

    Constraint-based Adversarial Example Synthesis

    Authors: Fang Yu, Ya-Yu Chi, Yu-Fang Chen

    Abstract: In the era of rapid advancements in artificial intelligence (AI), neural network models have achieved notable breakthroughs. However, concerns arise regarding their vulnerability to adversarial attacks. This study focuses on enhancing Concolic Testing, a specialized technique for testing Python programs implementing neural networks. The extended tool, PyCT, now accommodates a broader range of neur… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  8. arXiv:2406.00519  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Discrete Concepts in Latent Hierarchical Models

    Authors: Lingjing Kong, Guangyi Chen, Biwei Huang, Eric P. Xing, Yuejie Chi, Kun Zhang

    Abstract: Learning concepts from natural high-dimensional data (e.g., images) holds potential in building human-aligned and interpretable machine learning models. Despite its encouraging prospect, formalization and theoretical insights into this crucial task are still lacking. In this work, we formalize concepts as discrete latent causal variables that are related via a hierarchical causal model that encode… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  9. arXiv:2405.19320  [pdf, other

    cs.LG cs.AI stat.ML

    Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF

    Authors: Shicong Cen, Jincheng Mei, Katayoon Goshvadi, Hanjun Dai, Tong Yang, Sherry Yang, Dale Schuurmans, Yuejie Chi, Bo Dai

    Abstract: Reinforcement learning from human feedback (RLHF) has demonstrated great promise in aligning large language models (LLMs) with human preference. Depending on the availability of preference data, both online and offline RLHF are active areas of investigation. A key bottleneck is understanding how to incorporate uncertainty estimation in the reward function learned from the preference data for RLHF,… ▽ More

    Submitted 5 July, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  10. arXiv:2405.15784  [pdf, other

    cs.IR cs.AI cs.CL

    CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval

    Authors: Yizhou Chi, Jessy Lin, Kevin Lin, Dan Klein

    Abstract: Users often make ambiguous requests that require clarification. We study the problem of asking clarification questions in an information retrieval setting, where systems often face ambiguous search queries and it is challenging to turn the uncertainty in the retrieval model into a natural language question. We present CLARINET, a system that asks informative clarification questions by choosing que… ▽ More

    Submitted 28 April, 2024; originally announced May 2024.

  11. arXiv:2405.14766  [pdf, other

    cs.CL cs.LG

    Evaluating Large Language Models for Public Health Classification and Extraction Tasks

    Authors: Joshua Harris, Timothy Laurence, Leo Loman, Fan Grayson, Toby Nonnenmacher, Harry Long, Loes WalsGriffith, Amy Douglas, Holly Fountain, Stelios Georgiou, Jo Hardstaff, Kathryn Hopkins, Y-Ling Chi, Galena Kuyumdzhieva, Lesley Larkin, Samuel Collins, Hamish Mohammed, Thomas Finnie, Luke Hounsome, Steven Riley

    Abstract: Advances in Large Language Models (LLMs) have led to significant interest in their potential to support human experts across a range of domains, including public health. In this work we present automated evaluations of LLMs for public health tasks involving the classification and extraction of free text. We combine six externally annotated datasets with seven new internally annotated datasets to e… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 33 pages. Feedback and comments are highly appreciated

    MSC Class: 68T50

  12. arXiv:2405.02604  [pdf, ps, other

    cs.IT eess.SP

    Interleave Frequency Division Multiplexing

    Authors: Yuhao Chi, Lei Liu, Yao Ge, Xuehui Chen, Ying Li, Zhaoyang Zhang

    Abstract: In this letter, we study interleave frequency division multiplexing (IFDM) for multicarrier modulation in static multipath and mobile time-varying channels, which outperforms orthogonal frequency division multiplexing (OFDM), orthogonal time frequency space (OTFS), and affine frequency division multiplexing (AFDM) by considering practical advanced detectors. The fundamental principle underlying ex… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: Accepted by IEEE Wireless Communications Letters

  13. arXiv:2404.19264  [pdf, other

    cs.RO

    DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets

    Authors: Xiaoyu Huang, Yufeng Chi, Ruofeng Wang, Zhongyu Li, Xue Bin Peng, Sophia Shao, Borivoje Nikolic, Koushil Sreenath

    Abstract: This work introduces DiffuseLoco, a framework for training multi-skill diffusion-based policies for dynamic legged locomotion from offline datasets, enabling real-time control of diverse skills on robots in the real world. Offline learning at scale has led to breakthroughs in computer vision, natural language processing, and robotic manipulation domains. However, scaling up learning for legged rob… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  14. arXiv:2404.18909  [pdf, other

    cs.LG cs.MA stat.ML

    Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty

    Authors: Laixi Shi, Eric Mazumdar, Yuejie Chi, Adam Wierman

    Abstract: To overcome the sim-to-real gap in reinforcement learning (RL), learned policies must maintain robustness against environmental uncertainties. While robust RL has been widely studied in single-agent regimes, in multi-agent environments, the problem remains understudied -- despite the fact that the problems posed by environmental uncertainties are often exacerbated by strategic interactions. This w… ▽ More

    Submitted 8 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted by International Conference on Machine Learning, 2024

  15. arXiv:2404.16066  [pdf, other

    cs.HC cs.LG cs.SI

    Social Media Use is Predictable from App Sequences: Using LSTM and Transformer Neural Networks to Model Habitual Behavior

    Authors: Heinrich Peters, Joseph B. Bayer, Sandra C. Matz, Yikun Chi, Sumer S. Vaid, Gabriella M. Harari

    Abstract: The present paper introduces a novel approach to studying social media habits through predictive modeling of sequential smartphone user behaviors. While much of the literature on media and technology habits has relied on self-report questionnaires and simple behavioral frequency measures, we examine an important yet understudied aspect of media and technology habits: their embeddedness in repetiti… ▽ More

    Submitted 23 June, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

  16. arXiv:2404.05966  [pdf, other

    cs.CL cs.AI

    THOUGHTSCULPT: Reasoning with Intermediate Revision and Search

    Authors: Yizhou Chi, Kevin Yang, Dan Klein

    Abstract: We present THOUGHTSCULPT, a general reasoning and search method for tasks with outputs that can be decomposed into components. THOUGHTSCULPT explores a search tree of potential solutions using Monte Carlo Tree Search (MCTS), building solutions one action at a time and evaluating according to any domain-specific heuristic, which in practice is often simply an LLM evaluator. Critically, our action s… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Code and data available at https://github.com/cyzus/thoughtsculpt

  17. arXiv:2404.01365  [pdf, other

    cs.LG cs.AI cs.CL

    Prompt-prompted Adaptive Structured Pruning for Efficient LLM Generation

    Authors: Harry Dong, Beidi Chen, Yuejie Chi

    Abstract: With the development of transformer-based large language models (LLMs), they have been applied to many fields due to their remarkable utility, but this comes at a considerable computational cost at deployment. Fortunately, some methods such as pruning or constructing a mixture of experts (MoE) aim at exploiting sparsity in transformer feedforward (FF) blocks to gain boosts in speed and reduction i… ▽ More

    Submitted 11 August, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: Revision 1: Updated abstract with code link; re-ran top-k + sampling rows in Table 4, conclusions unchanged Revision 2: Reframing and new experiments, conclusions unchanged

  18. arXiv:2403.19066  [pdf, other

    cs.CV cs.AI

    Generative Quanta Color Imaging

    Authors: Vishal Purohit, Junjie Luo, Yiheng Chi, Qi Guo, Stanley H. Chan, Qiang Qiu

    Abstract: The astonishing development of single-photon cameras has created an unprecedented opportunity for scientific and industrial imaging. However, the high data throughput generated by these 1-bit sensors creates a significant bottleneck for low-power applications. In this paper, we explore the possibility of generating a color image from a single binary frame of a single-photon camera. We evidently fi… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted at IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  19. arXiv:2403.17042  [pdf, other

    eess.IV cs.CV cs.LG eess.SP math.OC stat.ML

    Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image Reconstruction

    Authors: Xingyu Xu, Yuejie Chi

    Abstract: In a great number of tasks in science and engineering, the goal is to infer an unknown image from a small number of measurements collected from a known forward model describing certain sensing or imaging modality. Due to resource constraints, this task is often extremely ill-posed, which necessitates the adoption of expressive prior information to regularize the solution space. Score-based diffusi… ▽ More

    Submitted 11 June, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  20. arXiv:2403.12946  [pdf, ps, other

    cs.LG math.ST

    Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes

    Authors: He Wang, Laixi Shi, Yuejie Chi

    Abstract: In offline reinforcement learning (RL), the absence of active exploration calls for attention on the model robustness to tackle the sim-to-real gap, where the discrepancy between the simulated and deployed environments can significantly undermine the performance of the learned policy. To endow the learned policy with robustness in a sample-efficient manner in the presence of high-dimensional state… ▽ More

    Submitted 26 June, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: accepted by Reinforcement Learning Conference (RLC)

  21. arXiv:2403.03852  [pdf, other

    cs.LG cs.AI cs.IT math.OC stat.ML

    Accelerating Convergence of Score-Based Diffusion Models, Provably

    Authors: Gen Li, Yu Huang, Timofey Efimov, Yuting Wei, Yuejie Chi, Yuxin Chen

    Abstract: Score-based diffusion models, while achieving remarkable empirical performance, often suffer from low sampling speed, due to extensive function evaluations needed during the sampling phase. Despite a flurry of recent activities towards speeding up diffusion generative modeling in practice, theoretical underpinnings for acceleration techniques remain severely limited. In this paper, we design novel… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: The first two authors contributed equally

  22. arXiv:2403.02233  [pdf, other

    cs.LG math.OC stat.ML

    How Transformers Learn Diverse Attention Correlations in Masked Vision Pretraining

    Authors: Yu Huang, Zixin Wen, Yuejie Chi, Yingbin Liang

    Abstract: Masked reconstruction, which predicts randomly masked patches from unmasked ones, has emerged as an important approach in self-supervised pretraining. However, the theoretical understanding of masked pretraining is rather limited, especially for the foundational architecture of transformers. In this paper, to the best of our knowledge, we provide the first end-to-end theoretical guarantee of learn… ▽ More

    Submitted 4 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: v2 polishes writing

  23. arXiv:2402.15309  [pdf, other

    cs.LG cs.CL

    Counterfactual Generation with Identifiability Guarantees

    Authors: Hanqi Yan, Lingjing Kong, Lin Gui, Yuejie Chi, Eric Xing, Yulan He, Kun Zhang

    Abstract: Counterfactual generation lies at the core of various machine learning tasks, including image translation and controllable text generation. This generation process usually requires the identification of the disentangled latent representations, such as content and style, that underlie the observed data. However, it becomes more challenging when faced with a scarcity of paired data and labeling info… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Neurips23. Controllable generation in causal perspective with a case study of ChatGPT, sheds light on theory-guaranteed alignment in language models

  24. arXiv:2402.09398  [pdf, other

    cs.LG cs.AI

    Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM Inference

    Authors: Harry Dong, Xinyu Yang, Zhenyu Zhang, Zhangyang Wang, Yuejie Chi, Beidi Chen

    Abstract: Many computational factors limit broader deployment of large language models. In this paper, we focus on a memory bottleneck imposed by the key-value (KV) cache, a computational shortcut that requires storing previous KV pairs during decoding. While existing KV cache methods approach this problem by pruning or evicting large swaths of relatively less important KV pairs to dramatically reduce the m… ▽ More

    Submitted 12 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  25. arXiv:2402.05876  [pdf, other

    cs.LG cs.MA stat.ML

    Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices

    Authors: Jiin Woo, Laixi Shi, Gauri Joshi, Yuejie Chi

    Abstract: Offline reinforcement learning (RL), which seeks to learn an optimal policy using offline data, has garnered significant interest due to its potential in critical applications where online data collection is infeasible or expensive. This work explores the benefit of federated learning for offline RL, aiming at collaboratively leveraging offline datasets at multiple agents. Focusing on finite-horiz… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  26. arXiv:2402.02698  [pdf, other

    cs.LG cs.AI math.OC

    Beyond Expectations: Learning with Stochastic Dominance Made Practical

    Authors: Shicong Cen, Jincheng Mei, Hanjun Dai, Dale Schuurmans, Yuejie Chi, Bo Dai

    Abstract: Stochastic dominance models risk-averse preferences for decision making with uncertain outcomes, which naturally captures the intrinsic structure of the underlying uncertainty, in contrast to simply resorting to the expectations. Despite theoretically appealing, the application of stochastic dominance in machine learning has been scarce, due to the following challenges: $\textbf{i)}$, the original… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  27. arXiv:2401.17094  [pdf, ps, other

    math.CO cs.IT

    Constructing rotatable permutations of $\mathbb{F}_{2^m}^3$ with $3$-homogeneous functions

    Authors: Yunwen Chi, Kangquan Li, Longjiang Qu

    Abstract: In the literature, there are many results about permutation polynomials over finite fields. However, very few permutations of vector spaces are constructed although it has been shown that permutations of vector spaces have many applications in cryptography, especially in constructing permutations with low differential and boomerang uniformities. In this paper, motivated by the butterfly structur… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  28. arXiv:2401.04244  [pdf, other

    eess.IV cs.CV

    Spatio-Temporal Turbulence Mitigation: A Translational Perspective

    Authors: Xingguang Zhang, Nicholas Chimitt, Yiheng Chi, Zhiyuan Mao, Stanley H. Chan

    Abstract: Recovering images distorted by atmospheric turbulence is a challenging inverse problem due to the stochastic nature of turbulence. Although numerous turbulence mitigation (TM) algorithms have been proposed, their efficiency and generalization to real-world dynamic scenarios remain severely limited. Building upon the intuitions of classical TM algorithms, we present the Deep Atmospheric TUrbulence… ▽ More

    Submitted 7 April, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted by CVPR 2024, project page https://xg416.github.io/DATUM/

  29. arXiv:2311.18787  [pdf, other

    cs.LG cs.DC math.OC

    Communication-Efficient Federated Optimization over Semi-Decentralized Networks

    Authors: He Wang, Yuejie Chi

    Abstract: In large-scale federated and decentralized learning, communication efficiency is one of the most challenging bottlenecks. While gossip communication -- where agents can exchange information with their connected neighbors -- is more cost-effective than communicating with the remote server, it often requires a greater number of communication rounds, especially for large and sparse networks. To tackl… ▽ More

    Submitted 11 January, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  30. arXiv:2311.18055  [pdf

    cs.RO math.GT physics.app-ph

    Adaptive Hierarchical Origami Metastructures

    Authors: Yanbin Li, Antonio Di Lallo, Junxi Zhu, Yinding Chi, Hao Su, Jie Yin

    Abstract: Shape-morphing capabilities are crucial for enabling multifunctionality in both biological and artificial systems. Various strategies for shape morphing have been proposed for applications in metamaterials and robotics. However, few of these approaches have achieved the ability to seamlessly transform into a multitude of volumetric shapes post-fabrication using a relatively simple actuation and co… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  31. arXiv:2311.12063  [pdf, other

    cs.CV

    DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields

    Authors: Yu Chi, Fangneng Zhan, Sibo Wu, Christian Theobalt, Adam Kortylewski

    Abstract: Progress in 3D computer vision tasks demands a huge amount of data, yet annotating multi-view images with 3D-consistent annotations, or point clouds with part segmentation is both time-consuming and challenging. This paper introduces DatasetNeRF, a novel approach capable of generating infinite, high-quality 3D-consistent 2D annotations alongside 3D point cloud segmentations, while utilizing minima… ▽ More

    Submitted 19 August, 2024; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: Accepted by ECCV 2024. Project page: https://ychgoaround.github.io/projects/DatasetNeRF/

  32. arXiv:2311.10189  [pdf, other

    cs.DC cs.AR

    TAPA-CS: Enabling Scalable Accelerator Design on Distributed HBM-FPGAs

    Authors: Neha Prakriya, Yuze Chi, Suhail Basalama, Linghao Song, Jason Cong

    Abstract: Despite the increasing adoption of Field-Programmable Gate Arrays (FPGAs) in compute clouds, there remains a significant gap in programming tools and abstractions which can leverage network-connected, cloud-scale, multi-die FPGAs to generate accelerators with high frequency and throughput. To this end, we propose TAPA-CS, a task-parallel dataflow programming framework which automatically partition… ▽ More

    Submitted 1 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  33. arXiv:2311.04012  [pdf, other

    cs.IT eess.SP

    Memory AMP for Generalized MIMO: Coding Principle and Information-Theoretic Optimality

    Authors: Yufei Chen, Lei Liu, Yuhao Chi, Ying Li, Zhaoyang Zhang

    Abstract: To support complex communication scenarios in next-generation wireless communications, this paper focuses on a generalized MIMO (GMIMO) with practical assumptions, such as massive antennas, practical channel coding, arbitrary input distributions, and general right-unitarily-invariant channel matrices (covering Rayleigh fading, certain ill-conditioned and correlated channel matrices). The orthogona… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 16 pages, 13 figures, accepted by IEEE TWC. arXiv admin note: substantial text overlap with arXiv:2310.17943

  34. arXiv:2311.00201  [pdf, ps, other

    cs.LG cs.AI

    Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning

    Authors: Tong Yang, Shicong Cen, Yuting Wei, Yuxin Chen, Yuejie Chi

    Abstract: Federated reinforcement learning (RL) enables collaborative decision making of multiple distributed agents without sharing local data trajectories. In this work, we consider a multi-task setting, in which each agent has its own private reward function corresponding to different tasks, while sharing the same transition kernel of the environment. Focusing on infinite-horizon Markov decision processe… ▽ More

    Submitted 16 August, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

  35. arXiv:2310.19059  [pdf, other

    cs.LG cs.DC math.OC

    Escaping Saddle Points in Heterogeneous Federated Learning via Distributed SGD with Communication Compression

    Authors: Sijin Chen, Zhize Li, Yuejie Chi

    Abstract: We consider the problem of finding second-order stationary points of heterogeneous federated learning (FL). Previous works in FL mostly focus on first-order convergence guarantees, which do not rule out the scenario of unstable saddle points. Meanwhile, it is a key bottleneck of FL to achieve communication efficiency without compensating the learning accuracy, especially when local data are highly… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: 27 pages

  36. arXiv:2310.17943  [pdf, other

    cs.IT

    Low-Complexity and Information-Theoretic Optimal Memory AMP for Coded Generalized MIMO

    Authors: Yufei Chen, Lei Liu, Yuhao Chi, Ying Li, Zhaoyang Zhang

    Abstract: This paper considers a generalized multiple-input multiple-output (GMIMO) with practical assumptions, such as massive antennas, practical channel coding, arbitrary input distributions, and general right-unitarily-invariant channel matrices (covering Rayleigh fading, certain ill-conditioned and correlated channel matrices). Orthogonal/vector approximate message passing (OAMP/VAMP) has been proved t… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 6 pages, 6 figures, accepted at GLOBECOM 2023

  37. arXiv:2310.06159  [pdf, other

    cs.LG math.OC stat.ML

    Provably Accelerating Ill-Conditioned Low-rank Estimation via Scaled Gradient Descent, Even with Overparameterization

    Authors: Cong Ma, Xingyu Xu, Tian Tong, Yuejie Chi

    Abstract: Many problems encountered in science and engineering can be formulated as estimating a low-rank object (e.g., matrices and tensors) from incomplete, and possibly corrupted, linear measurements. Through the lens of matrix and tensor factorization, one of the most popular approaches is to employ simple iterative algorithms such as gradient descent (GD) to recover the low-rank factors directly, which… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Book chapter for "Explorations in the Mathematics of Data Science - The Inaugural Volume of the Center for Approximation and Mathematical Data Analytics". arXiv admin note: text overlap with arXiv:2104.14526

  38. arXiv:2310.05230  [pdf, other

    math.OC cs.GT cs.IT cs.LG

    Global Convergence of Policy Gradient Methods in Reinforcement Learning, Games and Control

    Authors: Shicong Cen, Yuejie Chi

    Abstract: Policy gradient methods, where one searches for the policy of interest by maximizing the value functions using first-order information, become increasingly popular for sequential decision making in reinforcement learning, games, and control. Guaranteeing the global optimality of policy gradient methods, however, is highly nontrivial due to nonconcavity of the value functions. In this exposition, w… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: SIAG/OPT Views and News

  39. arXiv:2309.09977  [pdf, other

    cs.LG cs.DC cs.DS math.OC

    A Multi-Token Coordinate Descent Method for Semi-Decentralized Vertical Federated Learning

    Authors: Pedro Valdeira, Yuejie Chi, Cláudia Soares, João Xavier

    Abstract: Communication efficiency is a major challenge in federated learning (FL). In client-server schemes, the server constitutes a bottleneck, and while decentralized setups spread communications, they do not necessarily reduce them due to slower convergence. We propose Multi-Token Coordinate Descent (MTCD), a communication-efficient algorithm for semi-decentralized vertical federated learning, exploiti… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  40. arXiv:2308.09693  [pdf, other

    cs.CV cs.LG eess.IV

    A Lightweight Transformer for Faster and Robust EBSD Data Collection

    Authors: Harry Dong, Sean Donegan, Megna Shah, Yuejie Chi

    Abstract: Three dimensional electron back-scattered diffraction (EBSD) microscopy is a critical tool in many applications in materials science, yet its data quality can fluctuate greatly during the arduous collection process, particularly via serial-sectioning. Fortunately, 3D EBSD data is inherently sequential, opening up the opportunity to use transformers, state-of-the-art deep learning architectures tha… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

  41. arXiv:2307.13824  [pdf, other

    cs.LG cs.AI

    Offline Reinforcement Learning with On-Policy Q-Function Regularization

    Authors: Laixi Shi, Robert Dadashi, Yuejie Chi, Pablo Samuel Castro, Matthieu Geist

    Abstract: The core challenge of offline reinforcement learning (RL) is dealing with the (potentially catastrophic) extrapolation error induced by the distribution shift between the history dataset and the desired policy. A large portion of prior work tackles this challenge by implicitly/explicitly regularizing the learning policy towards the behavior policy, which is hard to estimate reliably in practice. I… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: Published at European Conference on Machine Learning (ECML), 2023

  42. arXiv:2307.07907  [pdf, other

    cs.LG

    Seeing is not Believing: Robust Reinforcement Learning against Spurious Correlation

    Authors: Wenhao Ding, Laixi Shi, Yuejie Chi, Ding Zhao

    Abstract: Robustness has been extensively studied in reinforcement learning (RL) to handle various forms of uncertainty such as random perturbations, rare events, and malicious attacks. In this work, we consider one critical type of robustness against spurious correlation, where different portions of the state do not have correlations induced by unobserved confounders. These spurious correlations are ubiqui… ▽ More

    Submitted 25 October, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: Accepted to NeurIPS 2023

  43. arXiv:2307.02334  [pdf, ps, other

    eess.IV cs.CV

    Dual Arbitrary Scale Super-Resolution for Multi-Contrast MRI

    Authors: Jiamiao Zhang, Yichen Chi, Jun Lyu, Wenming Yang, Yapeng Tian

    Abstract: Limited by imaging systems, the reconstruction of Magnetic Resonance Imaging (MRI) images from partial measurement is essential to medical imaging research. Benefiting from the diverse and complementary information of multi-contrast MR images in different imaging modalities, multi-contrast Super-Resolution (SR) reconstruction is promising to yield SR images with higher quality. In the medical scen… ▽ More

    Submitted 10 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: Accepted by MICCAI2023

  44. arXiv:2306.17367  [pdf, other

    eess.IV cs.CV

    Spatially Varying Exposure with 2-by-2 Multiplexing: Optimality and Universality

    Authors: Xiangyu Qu, Yiheng Chi, Stanley H. Chan

    Abstract: The advancement of new digital image sensors has enabled the design of exposure multiplexing schemes where a single image capture can have multiple exposures and conversion gains in an interlaced format, similar to that of a Bayer color filter array. In this paper, we ask the question of how to design such multiplexing schemes for adaptive high-dynamic range (HDR) imaging where the multiplexing sc… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  45. arXiv:2306.09251  [pdf, ps, other

    stat.ML cs.IT cs.LG math.ST

    Towards Faster Non-Asymptotic Convergence for Diffusion-Based Generative Models

    Authors: Gen Li, Yuting Wei, Yuxin Chen, Yuejie Chi

    Abstract: Diffusion models, which convert noise into new data instances by learning to reverse a Markov diffusion process, have become a cornerstone in contemporary generative modeling. While their practical power has now been widely recognized, the theoretical underpinnings remain far from mature. In this work, we develop a suite of non-asymptotic theory towards understanding the data generation process of… ▽ More

    Submitted 6 March, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: accepted in part to ICLR 2024

  46. arXiv:2306.07916  [pdf, other

    cs.LG cs.AI stat.ML

    Identification of Nonlinear Latent Hierarchical Models

    Authors: Lingjing Kong, Biwei Huang, Feng Xie, Eric Xing, Yuejie Chi, Kun Zhang

    Abstract: Identifying latent variables and causal structures from observational data is essential to many real-world applications involving biological data, medical data, and unstructured data such as images and languages. However, this task can be highly challenging, especially when observed variables are generated by causally related latent variables and the relationships are nonlinear. In this work, we i… ▽ More

    Submitted 31 October, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  47. arXiv:2306.04898  [pdf, other

    cs.LG cs.CV

    Understanding Masked Autoencoders via Hierarchical Latent Variable Models

    Authors: Lingjing Kong, Martin Q. Ma, Guangyi Chen, Eric P. Xing, Yuejie Chi, Louis-Philippe Morency, Kun Zhang

    Abstract: Masked autoencoder (MAE), a simple and effective self-supervised learning framework based on the reconstruction of masked image regions, has recently achieved prominent success in a variety of vision tasks. Despite the emergence of intriguing empirical observations on MAE, a theoretically principled understanding is still lacking. In this work, we formally characterize and justify existing empiric… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: CVPR 2023 Highlight

  48. arXiv:2305.19001  [pdf, other

    stat.ML cs.IT cs.LG math.OC math.ST

    High-probability sample complexities for policy evaluation with linear function approximation

    Authors: Gen Li, Weichen Wu, Yuejie Chi, Cong Ma, Alessandro Rinaldo, Yuting Wei

    Abstract: This paper is concerned with the problem of policy evaluation with linear function approximation in discounted infinite horizon Markov decision processes. We investigate the sample complexities required to guarantee a predefined estimation error of the best linear coefficients for two widely-used policy evaluation algorithms: the temporal difference (TD) learning algorithm and the two-timescale li… ▽ More

    Submitted 2 May, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: The first two authors contributed equally; paper accepted to IEEE Transactions on Information Theory

  49. arXiv:2305.16589  [pdf, other

    cs.LG cs.IT math.ST

    The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model

    Authors: Laixi Shi, Gen Li, Yuting Wei, Yuxin Chen, Matthieu Geist, Yuejie Chi

    Abstract: This paper investigates model robustness in reinforcement learning (RL) to reduce the sim-to-real gap in practice. We adopt the framework of distributionally robust Markov decision processes (RMDPs), aimed at learning a policy that optimizes the worst-case performance when the deployed environment falls within a prescribed uncertainty set around the nominal MDP. Despite recent efforts, the sample… ▽ More

    Submitted 12 April, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Neural Information Processing Systems (2023)

  50. arXiv:2305.14708  [pdf, other

    cs.CV

    EgoVSR: Towards High-Quality Egocentric Video Super-Resolution

    Authors: Yichen Chi, Junhao Gu, Jiamiao Zhang, Wenming Yang, Yapeng Tian

    Abstract: Due to the limitations of capture devices and scenarios, egocentric videos frequently have low visual quality, mainly caused by high compression and severe motion blur. With the increasing application of egocentric videos, there is an urgent need to enhance the quality of these videos through super-resolution. However, existing Video Super-Resolution (VSR) works, focusing on third-person view vide… ▽ More

    Submitted 26 July, 2023; v1 submitted 24 May, 2023; originally announced May 2023.