Zum Hauptinhalt springen

Showing 1–50 of 177 results for author: Zhang, Q

Searching in archive stat. Search in all archives.
.
  1. arXiv:2409.02397  [pdf, other

    stat.ME stat.AP

    High-dimensional Bayesian Model for Disease-Specific Gene Detection in Spatial Transcriptomics

    Authors: Qicheng Zhao, Qihuang Zhang

    Abstract: Identifying disease-indicative genes is critical for deciphering disease mechanisms and has attracted significant interest in biomedical research. Spatial transcriptomics offers unprecedented insights for the detection of disease-specific genes by enabling within-tissue contrasts. However, this new technology poses challenges for conventional statistical models developed for RNA-sequencing, as the… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: 23 Pages

  2. arXiv:2408.16068  [pdf, other

    q-bio.GN cs.AI stat.ML

    Identification of Prognostic Biomarkers for Stage III Non-Small Cell Lung Carcinoma in Female Nonsmokers Using Machine Learning

    Authors: Huili Zheng, Qimin Zhang, Yiru Gong, Zheyan Liu, Shaohan Chen

    Abstract: Lung cancer remains a leading cause of cancer-related deaths globally, with non-small cell lung cancer (NSCLC) being the most common subtype. This study aimed to identify key biomarkers associated with stage III NSCLC in non-smoking females using gene expression profiling from the GDS3837 dataset. Utilizing XGBoost, a machine learning algorithm, the analysis achieved a strong predictive performanc… ▽ More

    Submitted 29 August, 2024; v1 submitted 28 August, 2024; originally announced August 2024.

    Comments: This paper has been accepted for publication in the IEEE ICBASE 2024 conference

  3. arXiv:2407.13980  [pdf, other

    stat.ME cs.LG stat.ML

    Byzantine-tolerant distributed learning of finite mixture models

    Authors: Qiong Zhang, Jiahua Chen

    Abstract: This paper proposes two split-and-conquer (SC) learning estimators for finite mixture models that are tolerant to Byzantine failures. In SC learning, individual machines obtain local estimates, which are then transmitted to a central server for aggregation. During this communication, the server may receive malicious or incorrect information from some local machines, a scenario known as Byzantine f… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    ACM Class: G.3; I.5.3

  4. arXiv:2407.06935  [pdf, other

    cs.LG stat.CO stat.ML

    Bayesian Federated Learning with Hamiltonian Monte Carlo: Algorithm and Theory

    Authors: Jiajun Liang, Qian Zhang, Wei Deng, Qifan Song, Guang Lin

    Abstract: This work introduces a novel and efficient Bayesian federated learning algorithm, namely, the Federated Averaging stochastic Hamiltonian Monte Carlo (FA-HMC), for parameter estimation and uncertainty quantification. We establish rigorous convergence guarantees of FA-HMC on non-iid distributed data sets, under the strong convexity and Hessian smoothness assumptions. Our analysis investigates the ef… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  5. arXiv:2407.05400  [pdf, other

    stat.ME

    Collaborative Analysis for Paired A/B Testing Experiments

    Authors: Qiong Zhang, Lulu Kang, Xinwei Deng

    Abstract: With the extensive use of digital devices, online experimental platforms are commonly used to conduct experiments to collect data for evaluating different variations of products, algorithms, and interface designs, a.k.a., A/B tests. In practice, multiple A/B testing experiments are often carried out based on a common user population on the same platform. The same user's responses to different expe… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  6. arXiv:2407.00028  [pdf, other

    q-bio.NC cs.LG stat.AP

    Harnessing XGBoost for Robust Biomarker Selection of Obsessive-Compulsive Disorder (OCD) from Adolescent Brain Cognitive Development (ABCD) data

    Authors: Xinyu Shen, Qimin Zhang, Huili Zheng, Weiwei Qi

    Abstract: This study evaluates the performance of various supervised machine learning models in analyzing highly correlated neural signaling data from the Adolescent Brain Cognitive Development (ABCD) Study, with a focus on predicting obsessive-compulsive disorder scales. We simulated a dataset to mimic the correlation structures commonly found in imaging data and evaluated logistic regression, elastic netw… ▽ More

    Submitted 14 May, 2024; originally announced July 2024.

  7. arXiv:2406.16859  [pdf, other

    stat.ME

    On the extensions of the Chatterjee-Spearman test

    Authors: Qingyang Zhang

    Abstract: Chatterjee (2021) introduced a novel independence test that is rank-based, asymptotically normal and consistent against all alternatives. One limitation of Chatterjee's test is its low statistical power for detecting monotonic relationships. To address this limitation, in our previous work (Zhang, 2024, Commun. Stat. - Theory Methods), we proposed to combine Chatterjee's and Spearman's correlation… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 35 pages, 2 figures

  8. arXiv:2406.07455  [pdf, other

    cs.LG stat.ML

    Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis

    Authors: Qining Zhang, Honghao Wei, Lei Ying

    Abstract: In this paper, we study reinforcement learning from human feedback (RLHF) under an episodic Markov decision process with a general trajectory-wise reward model. We developed a model-free RLHF best policy identification algorithm, called $\mathsf{BSAD}$, without explicit reward model inference, which is a critical intermediate step in the contemporary RLHF paradigms for training large language mode… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  9. arXiv:2406.01378  [pdf, ps, other

    cs.LG stat.ML

    A Theory of Learnability for Offline Decision Making

    Authors: Chenjie Mao, Qiaosheng Zhang

    Abstract: We study the problem of offline decision making, which focuses on learning decisions from datasets only partially correlated with the learning objective. While previous research has extensively studied specific offline decision making problems like offline reinforcement learning (RL) and off-policy evaluation (OPE), a unified framework and theory remain absent. To address this gap, we introduce a… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  10. arXiv:2405.19440  [pdf, other

    cs.LG math.OC stat.ML

    On the Convergence of Multi-objective Optimization under Generalized Smoothness

    Authors: Qi Zhang, Peiyao Xiao, Kaiyi Ji, Shaofeng Zou

    Abstract: Multi-objective optimization (MOO) is receiving more attention in various fields such as multi-task learning. Recent works provide some effective algorithms with theoretical analysis but they are limited by the standard $L$-smooth or bounded-gradient assumptions, which are typically unsatisfactory for neural networks, such as recurrent neural networks (RNNs) and transformers. In this paper, we stu… ▽ More

    Submitted 1 July, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  11. arXiv:2405.16413  [pdf, other

    cs.AI cs.CL cs.LG stat.AP

    Augmented Risk Prediction for the Onset of Alzheimer's Disease from Electronic Health Records with Large Language Models

    Authors: Jiankun Wang, Sumyeong Ahn, Taykhoom Dalal, Xiaodan Zhang, Weishen Pan, Qiannan Zhang, Bin Chen, Hiroko H. Dodge, Fei Wang, Jiayu Zhou

    Abstract: Alzheimer's disease (AD) is the fifth-leading cause of death among Americans aged 65 and older. Screening and early detection of AD and related dementias (ADRD) are critical for timely intervention and for identifying clinical trial participants. The widespread adoption of electronic health records (EHRs) offers an important resource for developing ADRD screening tools such as machine learning bas… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  12. arXiv:2405.08699  [pdf

    stat.ML cs.LG

    Weakly-supervised causal discovery based on fuzzy knowledge and complex data complementarity

    Authors: Wenrui Li, Wei Zhang, Qinghao Zhang, Xuegong Zhang, Xiaowo Wang

    Abstract: Causal discovery based on observational data is important for deciphering the causal mechanism behind complex systems. However, the effectiveness of existing causal discovery methods is limited due to inferior prior knowledge, domain inconsistencies, and the challenges of high-dimensional datasets with small sample sizes. To address this gap, we propose a novel weakly-supervised fuzzy knowledge an… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  13. arXiv:2405.06813  [pdf, other

    stat.ME math.ST

    A note on distance variance for categorical variables

    Authors: Qingyang Zhang

    Abstract: This study investigates the extension of distance variance, a validated spread metric for continuous and binary variables [Edelmann et al., 2020, Ann. Stat., 48(6)], to quantify the spread of general categorical variables. We provide both geometric and algebraic characterizations of distance variance, revealing its connections to some commonly used entropy measures, and the variance-covariance mat… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 3 figures

  14. arXiv:2404.19292  [pdf, other

    cs.IT cs.LG cs.MA stat.ML

    Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning

    Authors: Qiaosheng Zhang, Chenjia Bai, Shuyue Hu, Zhen Wang, Xuelong Li

    Abstract: This work designs and analyzes a novel set of algorithms for multi-agent reinforcement learning (MARL) based on the principle of information-directed sampling (IDS). These algorithms draw inspiration from foundational concepts in information theory, and are proven to be sample efficient in MARL settings such as two-player zero-sum Markov games (MGs) and multi-player general-sum MGs. For episodic t… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  15. arXiv:2404.17181  [pdf, other

    stat.ME

    Consistent information criteria for regularized regression and loss-based learning problems

    Authors: Qingyuan Zhang, Hien Duy Nguyen

    Abstract: Many problems in statistics and machine learning can be formulated as model selection problems, where the goal is to choose an optimal parsimonious model among a set of candidate models. It is typical to conduct model selection by penalizing the objective function via information criteria (IC), as with the pioneering work by Akaike and Schwarz. Via recent work, we propose a generalized IC framewor… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  16. arXiv:2404.06735  [pdf, other

    stat.ML cs.LG math.ST stat.AP stat.ME

    A Copula Graphical Model for Multi-Attribute Data using Optimal Transport

    Authors: Qi Zhang, Bing Li, Lingzhou Xue

    Abstract: Motivated by modern data forms such as images and multi-view data, the multi-attribute graphical model aims to explore the conditional independence structure among vectors. Under the Gaussian assumption, the conditional independence between vectors is characterized by blockwise zeros in the precision matrix. To relax the restrictive Gaussian assumption, in this paper, we introduce a novel semipara… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 37 pages

  17. arXiv:2404.01436  [pdf, ps, other

    stat.ML cs.LG math.OC

    Convergence Guarantees for RMSProp and Adam in Generalized-smooth Non-convex Optimization with Affine Noise Variance

    Authors: Qi Zhang, Yi Zhou, Shaofeng Zou

    Abstract: This paper provides the first tight convergence analyses for RMSProp and Adam in non-convex optimization under the most relaxed assumptions of coordinate-wise generalized smoothness and affine noise variance. We first analyze RMSProp, which is a special case of Adam with adaptive learning rates but without first-order momentum. Specifically, to solve the challenges due to dependence among adaptive… ▽ More

    Submitted 3 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  18. arXiv:2404.01200  [pdf, other

    stat.ML cs.LG

    Large-Scale Non-convex Stochastic Constrained Distributionally Robust Optimization

    Authors: Qi Zhang, Yi Zhou, Ashley Prater-Bennette, Lixin Shen, Shaofeng Zou

    Abstract: Distributionally robust optimization (DRO) is a powerful framework for training robust models against data distribution shifts. This paper focuses on constrained DRO, which has an explicit characterization of the robustness level. Existing studies on constrained DRO mostly focus on convex loss function, and exclude the practical and challenging case with non-convex loss function, e.g., neural netw… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: We have corrected Theorem 1 in Sec 4 for AAAI 2024 version, where the order of $n_z$ changes from $ε^{-k_*} )$ to $ε^{-2k_*-2}$

  19. arXiv:2403.17882  [pdf, other

    stat.ME

    On the properties of distance covariance for categorical data: Robustness, sure screening, and approximate null distributions

    Authors: Qingyang Zhang

    Abstract: Pearson's Chi-squared test, though widely used for detecting association between categorical variables, exhibits low statistical power in large sparse contingency tables. To address this limitation, two novel permutation tests have been recently developed: the distance covariance permutation test and the U-statistic permutation test. Both leverage the distance covariance functional but employ diff… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 34 pages, 8 figures

  20. arXiv:2403.12459  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Non-negative Contrastive Learning

    Authors: Yifei Wang, Qi Zhang, Yaoyu Guo, Yisen Wang

    Abstract: Deep representations have shown promising performance when transferred to downstream tasks in a black-box manner. Yet, their inherent lack of interpretability remains a significant challenge, as these features are often opaque to human understanding. In this paper, we propose Non-negative Contrastive Learning (NCL), a renaissance of Non-negative Matrix Factorization (NMF) aimed at deriving interpr… ▽ More

    Submitted 22 April, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 22 pages. Accepted by ICLR 2024

  21. arXiv:2402.16710  [pdf, other

    cs.LG stat.ML

    Cost Aware Best Arm Identification

    Authors: Kellen Kanarios, Qining Zhang, Lei Ying

    Abstract: In this paper, we study a best arm identification problem with dual objects. In addition to the classic reward, each arm is associated with a cost distribution and the goal is to identify the largest reward arm using the minimum expected cost. We call it \emph{Cost Aware Best Arm Identification} (CABAI), which captures the separation of testing and implementation phases in product development pipe… ▽ More

    Submitted 30 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  22. arXiv:2401.05281  [pdf, other

    stat.ME

    Asymptotic expected sensitivity function and its applications to nonparametric correlation estimators

    Authors: Qingyang Zhang

    Abstract: We introduce a new type of influence function, the asymptotic expected sensitivity function, which is often equivalent to but mathematically more tractable than the traditional one based on the Gateaux derivative. To illustrate, we study the robustness of some important rank correlations, including Spearman's and Kendall's correlations, and the recently developed Chatterjee's correlation.

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 24 pages, 4 figures

  23. arXiv:2401.03341  [pdf, other

    cs.LG stat.ML

    Weakly Augmented Variational Autoencoder in Time Series Anomaly Detection

    Authors: Zhangkai Wu, Longbing Cao, Qi Zhang, Junxian Zhou, Hui Chen

    Abstract: Due to their unsupervised training and uncertainty estimation, deep Variational Autoencoders (VAEs) have become powerful tools for reconstruction-based Time Series Anomaly Detection (TSAD). Existing VAE-based TSAD methods, either statistical or deep, tune meta-priors to estimate the likelihood probability for effectively capturing spatiotemporal dependencies in the data. However, these methods con… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  24. arXiv:2311.17797  [pdf, other

    cs.LG stat.ME

    Learning to Simulate: Generative Metamodeling via Quantile Regression

    Authors: L. Jeff Hong, Yanxi Hou, Qingkai Zhang, Xiaowei Zhang

    Abstract: Stochastic simulation models, while effective in capturing the dynamics of complex systems, are often too slow to run for real-time decision-making. Metamodeling techniques are widely used to learn the relationship between a summary statistic of the outputs (e.g., the mean or quantile) and the inputs of the simulator, so that it can be used in real time. However, this methodology requires the know… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Main body: 36 pages, 7 figures; supplemental material: 12 pages

  25. arXiv:2311.13767  [pdf, other

    stat.ME

    Hierarchical False Discovery Rate Control for High-dimensional Survival Analysis with Interactions

    Authors: Weijuan Liang, Qingzhao Zhang, Shuangge Ma

    Abstract: With the development of data collection techniques, analysis with a survival response and high-dimensional covariates has become routine. Here we consider an interaction model, which includes a set of low-dimensional covariates, a set of high-dimensional covariates, and their interactions. This model has been motivated by gene-environment (G-E) interaction analysis, where the E variables have a lo… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  26. arXiv:2311.04158  [pdf, other

    cs.LG cs.DS stat.ML

    Computing Approximate $\ell_p$ Sensitivities

    Authors: Swati Padmanabhan, David P. Woodruff, Qiuyi Zhang

    Abstract: Recent works in dimensionality reduction for regression tasks have introduced the notion of sensitivity, an estimate of the importance of a specific datapoint in a dataset, offering provable guarantees on the quality of the approximation after removing low-sensitivity datapoints via subsampling. However, fast algorithms for approximating $\ell_p$ sensitivities, which we show is equivalent to appro… ▽ More

    Submitted 21 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

  27. arXiv:2309.08797  [pdf, other

    stat.ME

    On the Asymptotics of Graph Cut Objectives for Experimental Designs of Network A/B Testing

    Authors: Qiong Zhang

    Abstract: A/B testing is an effective way to assess the potential impacts of two treatments. For A/B tests conducted by IT companies, the test users of A/B testing are often connected and form a social network. The responses of A/B testing can be related to the network connection of test users. This paper discusses the relationship between the design criteria of network A/B testing and graph cut objectives.… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  28. arXiv:2309.00591  [pdf, other

    cs.LG stat.ML

    Fast and Regret Optimal Best Arm Identification: Fundamental Limits and Low-Complexity Algorithms

    Authors: Qining Zhang, Lei Ying

    Abstract: This paper considers a stochastic Multi-Armed Bandit (MAB) problem with dual objectives: (i) quick identification and commitment to the optimal arm, and (ii) reward maximization throughout a sequence of $T$ consecutive rounds. Though each objective has been individually well-studied, i.e., best arm identification for (i) and regret minimization for (ii), the simultaneous realization of both object… ▽ More

    Submitted 29 May, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

  29. arXiv:2308.03946  [pdf, other

    stat.ME

    Regulation-incorporated Gene Expression Network-based Heterogeneity Analysis

    Authors: Rong Li, Qingzhao Zhang, Shuangge Ma

    Abstract: Gene expression-based heterogeneity analysis has been extensively conducted. In recent studies, it has been shown that network-based analysis, which takes a system perspective and accommodates the interconnections among genes, can be more informative than that based on simpler statistics. Gene expressions are highly regulated. Incorporating regulations in analysis can better delineate the "sources… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  30. arXiv:2307.02066  [pdf, ps, other

    cs.LG stat.ML

    Universal Rates for Multiclass Learning

    Authors: Steve Hanneke, Shay Moran, Qian Zhang

    Abstract: We study universal rates for multiclass classification, establishing the optimal rates (up to log factors) for all hypothesis classes. This generalizes previous results on binary classification (Bousquet, Hanneke, Moran, van Handel, and Yehudayoff, 2021), and resolves an open question studied by Kalavasis, Velegkas, and Karbasi (2022) who handled the multiclass setting with a bounded number of cla… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 67 pages, accepted to the 36th Annual Conference on Learning Theory (COLT 2023)

  31. Automatic Assessment of Divergent Thinking in Chinese Language with TransDis: A Transformer-Based Language Model Approach

    Authors: Tianchen Yang, Qifan Zhang, Zhaoyang Sun, Yubo Hou

    Abstract: Language models have been increasingly popular for automatic creativity assessment, generating semantic distances to objectively measure the quality of creative ideas. However, there is currently a lack of an automatic assessment system for evaluating creative ideas in the Chinese language. To address this gap, we developed TransDis, a scoring system using transformer-based language models, capabl… ▽ More

    Submitted 24 December, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

  32. arXiv:2306.05857  [pdf, other

    stat.ML cs.LG

    How Sparse Can We Prune A Deep Network: A Fundamental Limit Viewpoint

    Authors: Qiaozhe Zhang, Ruijie Zhang, Jun Sun, Yingzhuang Liu

    Abstract: Network pruning is an effective measure to alleviate the storage and computational burden of deep neural networks arising from its high overparameterization. Thus raises a fundamental question: How sparse can we prune a deep network without sacrifice on the performance? To address this problem, in this work we'll take a first principles approach, i.e. we directly impose the sparsity constraint on… ▽ More

    Submitted 21 February, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

  33. arXiv:2303.04435  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    A Message Passing Perspective on Learning Dynamics of Contrastive Learning

    Authors: Yifei Wang, Qi Zhang, Tianqi Du, Jiansheng Yang, Zhouchen Lin, Yisen Wang

    Abstract: In recent years, contrastive learning achieves impressive results on self-supervised visual representation learning, but there still lacks a rigorous understanding of its learning dynamics. In this paper, we show that if we cast a contrastive objective equivalently into the feature space, then its learning dynamics admits an interpretable form. Specifically, we show that its gradient descent corre… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: ICLR 2023

  34. arXiv:2303.00288   

    stat.AP

    The Race of mRNA therapy: Evidence from Patent Landscape

    Authors: Jianxiong Ren, Xiaoming Zhang, Xingyong Si, Xiangjun Kong, Jinyu Cong, Pingping Wang, Xiang Li, Qianru Zhang, Peifen Yao, Mengyao Li, Yuanqi Cai, Zhaocai Sun, Kunmeng Liu, Benzheng Wei

    Abstract: mRNA therapy is gaining worldwide attention as an emerging therapeutic approach. The widespread use of mRNA vaccines during the COVID-19 outbreak has demonstrated the potential of mRNA therapy. As mRNA-based drugs have expanded and their indications have broadened, more patents for mRNA innovations have emerged. The global patent landscape for mRNA therapy has not yet been analyzed, indicating a r… ▽ More

    Submitted 15 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: I have received requests from co-authors and funding agencies to withdraw the manuscript

  35. arXiv:2302.10131  [pdf, other

    stat.ME

    On relationships between Chatterjee's and Spearman's correlation coefficients

    Authors: Qingyang Zhang

    Abstract: In his seminal work, Chatterjee (2021) introduced a novel correlation measure which is distribution-free, asymptotically normal, and consistent against all alternatives. In this paper, we study the probabilistic relationships between Chatterjee's correlation and the widely used Spearman's correlation. We show that, under independence, the two sample-based correlations are asymptotically joint norm… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  36. arXiv:2301.03705  [pdf, ps, other

    stat.ME

    Locally sparse quantile estimation for a partially functional interaction model

    Authors: Weijuan Liang, Qingzhao Zhang, Shuangge Ma

    Abstract: Functional data analysis has been extensively conducted. In this study, we consider a partially functional model, under which some covariates are scalars and have linear effects, while some other variables are functional and have unspecified nonlinear effects. Significantly advancing from the existing literature, we consider a model with interactions between the functional and scalar covariates. T… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: 24 pages, 5 figures

    MSC Class: 62R10 ACM Class: G.3

  37. arXiv:2211.15355  [pdf, other

    cs.LG stat.ML

    Causal Deep Reinforcement Learning Using Observational Data

    Authors: Wenxuan Zhu, Chao Yu, Qiang Zhang

    Abstract: Deep reinforcement learning (DRL) requires the collection of interventional data, which is sometimes expensive and even unethical in the real world, such as in the autonomous driving and the medical field. Offline reinforcement learning promises to alleviate this issue by exploiting the vast amount of observational data available in the real world. However, observational data may mislead the learn… ▽ More

    Submitted 9 June, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

  38. arXiv:2211.12685  [pdf, other

    stat.ML cs.IT cs.LG math.OC

    Mutual Information Learned Regressor: an Information-theoretic Viewpoint of Training Regression Systems

    Authors: Jirong Yi, Qiaosheng Zhang, Zhen Chen, Qiao Liu, Wei Shao, Yusen He, Yaohua Wang

    Abstract: As one of the central tasks in machine learning, regression finds lots of applications in different fields. An existing common practice for solving regression problems is the mean square error (MSE) minimization approach or its regularized variants which require prior knowledge about the models. Recently, Yi et al., proposed a mutual information based supervised learning framework where they intro… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 28 pages, 2 figures, presubmitted to AISTATS2023 for reviewing

  39. arXiv:2211.10837  [pdf, other

    cs.LG stat.CO

    Non-reversible Parallel Tempering for Deep Posterior Approximation

    Authors: Wei Deng, Qian Zhang, Qi Feng, Faming Liang, Guang Lin

    Abstract: Parallel tempering (PT), also known as replica exchange, is the go-to workhorse for simulations of multi-modal distributions. The key to the success of PT is to adopt efficient swap schemes. The popular deterministic even-odd (DEO) scheme exploits the non-reversibility property and has successfully reduced the communication cost from $O(P^2)$ to $O(P)$ given sufficiently many $P$ chains. However,… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: Accepted by AAAI 2023

  40. arXiv:2211.02781  [pdf, other

    stat.ME

    Heterogeneity-aware Clustered Distributed Learning for Multi-source Data Analysis

    Authors: Yuanxing Chen, Qingzhao Zhang, Shuangge Ma, Kuangnan Fang

    Abstract: In diverse fields ranging from finance to omics, it is increasingly common that data is distributed and with multiple individual sources (referred to as ``clients'' in some studies). Integrating raw data, although powerful, is often not feasible, for example, when there are considerations on privacy protection. Distributed learning techniques have been developed to integrate summary statistics as… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  41. arXiv:2210.08495  [pdf, other

    cs.NE cs.AI cs.LG stat.ML

    Pareto Set Learning for Expensive Multi-Objective Optimization

    Authors: Xi Lin, Zhiyuan Yang, Xiaoyuan Zhang, Qingfu Zhang

    Abstract: Expensive multi-objective optimization problems can be found in many real-world applications, where their objective function evaluations involve expensive computations or physical experiments. It is desirable to obtain an approximate Pareto front with a limited evaluation budget. Multi-objective Bayesian optimization (MOBO) has been widely used for finding a finite set of Pareto optimal solutions.… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

    Comments: To appear in 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  42. arXiv:2209.10058  [pdf, other

    cs.LG cs.CV cs.IT stat.ML

    Mutual Information Learned Classifiers: an Information-theoretic Viewpoint of Training Deep Learning Classification Systems

    Authors: Jirong Yi, Qiaosheng Zhang, Zhen Chen, Qiao Liu, Wei Shao

    Abstract: Deep learning systems have been reported to achieve state-of-the-art performances in many applications, and a key is the existence of well trained classifiers on benchmark datasets. As a main-stream loss function, the cross entropy can easily lead us to find models which demonstrate severe overfitting behavior. In this paper, we show that the existing cross entropy loss minimization problem essent… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: 22 pages, 17 figures, 3 tables, 5 theorems

  43. arXiv:2208.01713  [pdf, other

    math.ST stat.ME

    On optimal block resampling for Gaussian-subordinated long-range dependent processes

    Authors: Qihao Zhang, Soumendra N. Lahiri, Daniel J. Nordman

    Abstract: Block-based resampling estimators have been intensively investigated for weakly dependent time processes, which has helped to inform implementation (e.g., best block sizes). However, little is known about resampling performance and block sizes under strong or long-range dependence. To establish guideposts in block selection, we consider a broad class of strongly dependent time processes, formed by… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    MSC Class: Primary 62G09; secondary 62G20; 62M10

  44. arXiv:2207.04613  [pdf, other

    stat.ME math.ST stat.ML

    Nonlinear Sufficient Dimension Reduction for Distribution-on-Distribution Regression

    Authors: Qi Zhang, Bing Li, Lingzhou Xue

    Abstract: We introduce a new approach to nonlinear sufficient dimension reduction in cases where both the predictor and the response are distributional data, modeled as members of a metric space. Our key step is to build universal kernels (cc-universal) on the metric spaces, which results in reproducing kernel Hilbert spaces for the predictor and response that are rich enough to characterize the conditional… ▽ More

    Submitted 24 April, 2023; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: 36 pages

  45. arXiv:2206.04733  [pdf, other

    stat.AP eess.SY math.ST

    On Low-Complexity Quickest Intervention of Mutated Diffusion Processes Through Local Approximation

    Authors: Qining Zhang, Honghao Wei, Weina Wang, Lei Ying

    Abstract: We consider the problem of controlling a mutated diffusion process with an unknown mutation time. The problem is formulated as the quickest intervention problem with the mutation modeled by a change-point, which is a generalization of the quickest change-point detection (QCD). Our goal is to intervene in the mutated process as soon as possible while maintaining a low intervention cost with optimal… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  46. arXiv:2205.13320  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Learning Universal Hyperparameter Optimizers with Transformers

    Authors: Yutian Chen, Xingyou Song, Chansoo Lee, Zi Wang, Qiuyi Zhang, David Dohan, Kazuya Kawakami, Greg Kochanski, Arnaud Doucet, Marc'aurelio Ranzato, Sagi Perel, Nando de Freitas

    Abstract: Meta-learning hyperparameter optimization (HPO) algorithms from prior experiments is a promising approach to improve optimization efficiency over objective functions from a similar distribution. However, existing methods are restricted to learning from experiments sharing the same set of hyperparameters. In this paper, we introduce the OptFormer, the first text-based Transformer HPO framework that… ▽ More

    Submitted 13 October, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: Published as a conference paper in Neural Information Processing Systems (NeurIPS) 2022. Code can be found in https://github.com/google-research/optformer and Google AI Blog can be found in https://ai.googleblog.com/2022/08/optformer-towards-universal.html

  47. arXiv:2205.01769  [pdf, other

    stat.ME

    On the asymptotic distribution of the symmetrized Chatterjee's correlation coefficient

    Authors: Qingyang Zhang

    Abstract: Chatterjee (2021) introduced an asymmetric correlation measure that has attracted much attention over the past year. In this paper, we derive the asymptotic distribution of the symmetric version of Chatterjee's correlation, and suggest a finite sample test for independence.

    Submitted 1 June, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

    Comments: 4 figures

    MSC Class: 62E20

  48. arXiv:2203.13457  [pdf, other

    cs.LG cs.CV stat.ML

    Chaos is a Ladder: A New Theoretical Understanding of Contrastive Learning via Augmentation Overlap

    Authors: Yifei Wang, Qi Zhang, Yisen Wang, Jiansheng Yang, Zhouchen Lin

    Abstract: Recently, contrastive learning has risen to be a promising approach for large-scale self-supervised learning. However, theoretical understanding of how it works is still unclear. In this paper, we propose a new guarantee on the downstream performance without resorting to the conditional independence assumption that is widely adopted in previous work but hardly holds in practice. Our new theory hin… ▽ More

    Submitted 27 May, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepeted by ICLR 2022

  49. arXiv:2112.05120  [pdf, other

    stat.ML cs.LG

    On Convergence of Federated Averaging Langevin Dynamics

    Authors: Wei Deng, Qian Zhang, Yi-An Ma, Zhao Song, Guang Lin

    Abstract: We propose a federated averaging Langevin algorithm (FA-LD) for uncertainty quantification and mean predictions with distributed clients. In particular, we generalize beyond normal posterior distributions and consider a general class of models. We develop theoretical guarantees for FA-LD for strongly log-concave distributions with non-i.i.d data and study how the injected noise and the stochastic-… ▽ More

    Submitted 5 October, 2023; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: A polished proof without the federated formulation of Langevin diffusion to avoid confusion

  50. arXiv:2110.12490  [pdf, other

    cs.IR stat.AP

    Paperfetcher: A tool to automate handsearch for systematic reviews

    Authors: Akash Pallath, Qiyang Zhang

    Abstract: Handsearch is an important technique that contributes to thorough literature search in systematic reviews. Traditional handsearch requires reviewers to systematically browse through each issue of a curated list of field-specific journals and conference proceedings to find articles relevant to their review. This manual process is not only time-consuming, laborious, costly, and error-prone, but it a… ▽ More

    Submitted 6 January, 2022; v1 submitted 24 October, 2021; originally announced October 2021.