Zum Hauptinhalt springen

Showing 1–50 of 63 results for author: Wu, Q

Searching in archive stat. Search in all archives.
.
  1. arXiv:2408.05765  [pdf, other

    cs.LG stat.ML

    Scalable and Adaptive Spectral Embedding for Attributed Graph Clustering

    Authors: Yunhui Liu, Tieke He, Qing Wu, Tao Zheng, Jianhua Zhao

    Abstract: Attributed graph clustering, which aims to group the nodes of an attributed graph into disjoint clusters, has made promising advancements in recent years. However, most existing methods face challenges when applied to large graphs due to the expensive computational cost and high memory usage. In this paper, we introduce Scalable and Adaptive Spectral Embedding (SASE), a simple attributed graph clu… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

    Comments: Accepted by CIKM 2024 (Short Paper)

  2. arXiv:2402.18392  [pdf, other

    cs.LG cs.AI econ.EM stat.ML

    Unveiling the Potential of Robustness in Evaluating Causal Inference Models

    Authors: Yiyan Huang, Cheuk Hang Leung, Siyi Wang, Yijun Li, Qi Wu

    Abstract: The growing demand for personalized decision-making has led to a surge of interest in estimating the Conditional Average Treatment Effect (CATE). The intersection of machine learning and causal inference has yielded various effective CATE estimators. However, deploying these estimators in practice is often hindered by the absence of counterfactual labels, making it challenging to select the desira… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  3. arXiv:2402.14368  [pdf, other

    stat.AP

    Parsimonious Generative Machine Learning for Non-Gaussian Tail Modeling and Risk-Neutral Distribution Extraction

    Authors: Qi Wu, Zhonghao Xian, Xing Yan, Nan Yang

    Abstract: In financial modeling problems, non-Gaussian tails exist widely in many circumstances. Among them, the accurate estimation of risk-neutral distribution (RND) from option prices is of great importance for researchers and practitioners. A precise RND can provide valuable information regarding the market's expectations, and can further help empirical asset pricing studies. This paper presents a parsi… ▽ More

    Submitted 4 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  4. arXiv:2312.10388  [pdf, other

    stat.ME cs.AI q-fin.GN

    The Causal Impact of Credit Lines on Spending Distributions

    Authors: Yijun Li, Cheuk Hang Leung, Xiangqian Sun, Chaoqun Wang, Yiyan Huang, Xing Yan, Qi Wu, Dongdong Wang, Zhixiang Huang

    Abstract: Consumer credit services offered by e-commerce platforms provide customers with convenient loan access during shopping and have the potential to stimulate sales. To understand the causal impact of credit lines on spending, previous studies have employed causal estimators, based on direct regression (DR), inverse propensity weighting (IPW), and double machine learning (DML) to estimate the treatmen… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  5. arXiv:2310.05308  [pdf, other

    cs.LG cs.DS stat.ML

    Adversarial Attacks on Combinatorial Multi-Armed Bandits

    Authors: Rishab Balasubramanian, Jiawei Li, Prasad Tadepalli, Huazheng Wang, Qingyun Wu, Haoyu Zhao

    Abstract: We study reward poisoning attacks on Combinatorial Multi-armed Bandits (CMAB). We first provide a sufficient and necessary condition for the attackability of CMAB, a notion to capture the vulnerability and robustness of CMAB. The attackability condition depends on the intrinsic properties of the corresponding CMAB instance such as the reward distributions of super arms and outcome distributions of… ▽ More

    Submitted 3 June, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: 28 pages, Accepted to ICML 2024

  6. arXiv:2306.07761  [pdf, other

    cs.LG stat.ML

    Multi-Fidelity Multi-Armed Bandits Revisited

    Authors: Xuchuang Wang, Qingyun Wu, Wei Chen, John C. S. Lui

    Abstract: We study the multi-fidelity multi-armed bandit (MF-MAB), an extension of the canonical multi-armed bandit (MAB) problem. MF-MAB allows each arm to be pulled with different costs (fidelities) and observation accuracy. We study both the best arm identification with fixed confidence (BAI) and the regret minimization objectives. For BAI, we present (a) a cost complexity lower bound, (b) an algorithmic… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  7. arXiv:2306.01337  [pdf, other

    cs.CL stat.ML

    MathChat: Converse to Tackle Challenging Math Problems with LLM Agents

    Authors: Yiran Wu, Feiran Jia, Shaokun Zhang, Hangyu Li, Erkang Zhu, Yue Wang, Yin Tat Lee, Richard Peng, Qingyun Wu, Chi Wang

    Abstract: Employing Large Language Models (LLMs) to address mathematical problems is an intriguing research endeavor, considering the abundance of math problems expressed in natural language across numerous science and engineering fields. LLMs, with their generalized ability, are used as a foundation model to build AI agents for different tasks. In this paper, we study the effectiveness of utilizing LLM age… ▽ More

    Submitted 28 June, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Update version

  8. arXiv:2304.00739  [pdf, other

    stat.AP

    Two-sample test of sparse stochastic block models

    Authors: Qianyong Wu, Jiang Hu

    Abstract: The paper discusses a statistical problem related to testing for differences between two sparse networks with community structures. The community-wise edge probability matrices have entries of order $O(n^{-1}/\log n)$, where $n$ represents the size of the network. The authors propose a test statistic that combines a method proposed by Wu et al. \cite{WuTwoSampleSBM2022} and a resampling process. T… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  9. arXiv:2303.14508  [pdf, other

    stat.ME

    A spectral based goodness-of-fit test for stochastic block models

    Authors: Qianyong Wu, Jiang Hu

    Abstract: Community detection is a fundamental problem in complex network data analysis. Though many methods have been proposed, most existing methods require the number of communities to be the known parameter, which is not in practice. In this paper, we propose a novel goodness-of-fit test for the stochastic block model. The test statistic is based on the linear spectral of the adjacency matrix. Under the… ▽ More

    Submitted 20 May, 2024; v1 submitted 25 March, 2023; originally announced March 2023.

  10. A Contextual Bandit Approach for Value-oriented Prediction Interval Forecasting

    Authors: Yufan Zhang, Honglin Wen, Qiuwei Wu

    Abstract: Prediction interval (PI) is an effective tool to quantify uncertainty and usually serves as an input to downstream robust optimization. Traditional approaches focus on improving the quality of PI in the view of statistical scores and assume the improvement in quality will lead to a higher value in the power systems operation. However, such an assumption cannot always hold in practice. In this pape… ▽ More

    Submitted 12 February, 2023; v1 submitted 8 October, 2022; originally announced October 2022.

    Comments: the revision to IEEE Transactions on Smart Grid

  11. arXiv:2209.01956  [pdf, other

    cs.LG cs.AI stat.ME

    Moderately-Balanced Representation Learning for Treatment Effects with Orthogonality Information

    Authors: Yiyan Huang, Cheuk Hang Leung, Shumin Ma, Qi Wu, Dongdong Wang, Zhixiang Huang

    Abstract: Estimating the average treatment effect (ATE) from observational data is challenging due to selection bias. Existing works mainly tackle this challenge in two ways. Some researchers propose constructing a score function that satisfies the orthogonal condition, which guarantees that the established ATE estimator is "orthogonal" to be more robust. The others explore representation learning models to… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: This paper was accepted and will be published at the 19th Pacific Rim International Conference on Artificial Intelligence (PRICAI2022)

  12. arXiv:2209.01805  [pdf, other

    econ.EM q-fin.RM stat.ME stat.ML

    Robust Causal Learning for the Estimation of Average Treatment Effects

    Authors: Yiyan Huang, Cheuk Hang Leung, Xing Yan, Qi Wu, Shumin Ma, Zhiri Yuan, Dongdong Wang, Zhixiang Huang

    Abstract: Many practical decision-making problems in economics and healthcare seek to estimate the average treatment effect (ATE) from observational data. The Double/Debiased Machine Learning (DML) is one of the prevalent methods to estimate ATE in the observational study. However, the DML estimators can suffer an error-compounding issue and even give an extreme estimate when the propensity scores are missp… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: This paper was accepted and will be published at The 2022 International Joint Conference on Neural Networks (IJCNN2022). arXiv admin note: substantial text overlap with arXiv:2103.11869

  13. arXiv:2208.07573  [pdf, other

    stat.ME math.ST stat.ML

    Higher-order accurate two-sample network inference and network hashing

    Authors: Meijia Shao, Dong Xia, Yuan Zhang, Qiong Wu, Shuo Chen

    Abstract: Two-sample hypothesis testing for network comparison presents many significant challenges, including: leveraging repeated network observations and known node registration, but without requiring them to operate; relaxing strong structural assumptions; achieving finite-sample higher-order accuracy; handling different network sizes and sparsity levels; fast computation and memory parsimony; controlli… ▽ More

    Submitted 2 February, 2024; v1 submitted 16 August, 2022; originally announced August 2022.

  14. arXiv:2206.14846  [pdf, other

    cs.LG cs.SI stat.ML

    Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization

    Authors: Kaixuan Huang, Yu Wu, Xuezhou Zhang, Shenyinying Tu, Qingyun Wu, Mengdi Wang, Huazheng Wang

    Abstract: Online influence maximization aims to maximize the influence spread of a content in a social network with unknown network model by selecting a few seed nodes. Recent studies followed a non-adaptive setting, where the seed nodes are selected before the start of the diffusion process and network parameters are updated when the diffusion stops. We consider an adaptive version of content-dependent onl… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

  15. arXiv:2205.08698  [pdf, other

    stat.AP cs.AI cs.LG eess.SY

    Optimal Adaptive Prediction Intervals for Electricity Load Forecasting in Distribution Systems via Reinforcement Learning

    Authors: Yufan Zhang, Honglin Wen, Qiuwei Wu, Qian Ai

    Abstract: Prediction intervals offer an effective tool for quantifying the uncertainty of loads in distribution systems. The traditional central PIs cannot adapt well to skewed distributions, and their offline training fashion is vulnerable to unforeseen changes in future load patterns. Therefore, we propose an optimal PI estimation approach, which is online and adaptive to different data distributions by a… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: revision to IEEE Transactions on Smart Grid

  16. arXiv:2205.02936  [pdf, other

    stat.AP stat.ME

    Station-wise statistical joint assessment of wind speed and direction under future climates across the United States

    Authors: Qiuyi Wu, Julie Bessac, Whitney Huang, Jiali Wang

    Abstract: This study develops a statistical conditional approach to evaluate climate model performance in wind speed and direction and to project their future changes under the representative concentration pathway 8.5 scenario over inland and offshore locations across the Continental United States. The proposed conditional approach extends the scope of existing studies by characterizing the changes of the f… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

  17. arXiv:2108.06201  [pdf, other

    stat.ML cs.LG stat.CO

    Data-driven advice for interpreting local and global model predictions in bioinformatics problems

    Authors: Markus Loecher, Qi Wu

    Abstract: Tree-based algorithms such as random forests and gradient boosted trees continue to be among the most popular and powerful machine learning models used across multiple disciplines. The conventional wisdom of estimating the impact of a feature in tree based models is to measure the \textit{node-wise reduction of a loss function}, which (i) yields only global importance measures and (ii) is known to… ▽ More

    Submitted 30 December, 2021; v1 submitted 13 August, 2021; originally announced August 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2003.12043. text overlap with arXiv:1905.04610 by other authors

  18. arXiv:2107.09629  [pdf, other

    q-fin.TR stat.ME

    Order Book Queue Hawkes-Markovian Modeling

    Authors: Philip Protter, Qianfan Wu, Shihao Yang

    Abstract: This article presents a Hawkes process model with Markovian baseline intensities for high-frequency order book data modeling. We classify intraday order book trading events into a range of categories based on their order types and the price changes after their arrivals. To capture the stimulating effects between multiple types of order book events, we use the multivariate Hawkes process to model t… ▽ More

    Submitted 5 January, 2022; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: 71 pages, 80 figures

    MSC Class: 62P05 (Primary) 62G05 (Secondary)

  19. arXiv:2103.11869  [pdf, other

    stat.ML cs.LG econ.EM q-fin.ST

    Robust Orthogonal Machine Learning of Treatment Effects

    Authors: Yiyan Huang, Cheuk Hang Leung, Qi Wu, Xing Yan

    Abstract: Causal learning is the key to obtaining stable predictions and answering \textit{what if} problems in decision-makings. In causal learning, it is central to seek methods to estimate the average treatment effect (ATE) from observational data. The Double/Debiased Machine Learning (DML) is one of the prevalent methods to estimate ATE. However, the DML estimators can suffer from an \textit{error-compo… ▽ More

    Submitted 5 December, 2022; v1 submitted 22 March, 2021; originally announced March 2021.

  20. arXiv:2012.09448  [pdf, other

    q-fin.RM stat.ME stat.ML

    The Causal Learning of Retail Delinquency

    Authors: Yiyan Huang, Cheuk Hang Leung, Xing Yan, Qi Wu, Nanbo Peng, Dongdong Wang, Zhixiang Huang

    Abstract: This paper focuses on the expected difference in borrower's repayment when there is a change in the lender's credit decisions. Classical estimators overlook the confounding effects and hence the estimation error can be magnificent. As such, we propose another approach to construct the estimators such that the error can be greatly reduced. The proposed estimators are shown to be unbiased, consisten… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

    Comments: This paper was accepted and will be published in the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-21)

  21. arXiv:2009.14250  [pdf, ps, other

    cs.LG stat.ML

    A Framework of Learning Through Empirical Gain Maximization

    Authors: Yunlong Feng, Qiang Wu

    Abstract: We develop in this paper a framework of empirical gain maximization (EGM) to address the robust regression problem where heavy-tailed noise or outliers may present in the response variable. The idea of EGM is to approximate the density function of the noise distribution instead of approximating the truth function directly as usual. Unlike the classical maximum likelihood estimation that encourages… ▽ More

    Submitted 11 January, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

  22. arXiv:2009.13249  [pdf, other

    cs.IR cs.LG stat.ML

    Interest-Behaviour Multiplicative Network for Resource-limited Recommendation

    Authors: Qianliang Wu, Tong Zhang, Zhen Cui, Jian Yang

    Abstract: Resource constraints, e.g. limited product inventory or financial strength, may affect consumers' choices or preferences in some recommendation tasks but are usually ignored in previous recommendation methods. In this paper, we aim to mine the cue of user preferences in resource-limited recommendation tasks, for which purpose we specifically build a large used car transaction dataset possessing re… ▽ More

    Submitted 11 November, 2020; v1 submitted 24 September, 2020; originally announced September 2020.

  23. arXiv:2009.12755  [pdf, ps, other

    math.ST cs.LG stat.ML

    A Statistical Learning Assessment of Huber Regression

    Authors: Yunlong Feng, Qiang Wu

    Abstract: As one of the triumphs and milestones of robust statistics, Huber regression plays an important role in robust inference and estimation. It has also been finding a great variety of applications in machine learning. In a parametric setup, it has been extensively studied. However, in the statistical learning context where a function is typically learned in a nonparametric way, there is still a lack… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

  24. arXiv:2009.02463  [pdf, other

    cs.LG stat.ML

    Unifying Clustered and Non-stationary Bandits

    Authors: Chuanhao Li, Qingyun Wu, Hongning Wang

    Abstract: Non-stationary bandits and online clustering of bandits lift the restrictive assumptions in contextual bandits and provide solutions to many important real-world scenarios. Though the essence in solving these two problems overlaps considerably, they have been studied independently. In this paper, we connect these two strands of bandit research under the notion of test of homogeneity, which seamles… ▽ More

    Submitted 5 September, 2020; originally announced September 2020.

    Comments: 26 pages, 3 figures

  25. arXiv:2008.00942  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Improving Generative Adversarial Networks with Local Coordinate Coding

    Authors: Jiezhang Cao, Yong Guo, Qingyao Wu, Chunhua Shen, Junzhou Huang, Mingkui Tan

    Abstract: Generative adversarial networks (GANs) have shown remarkable success in generating realistic data from some predefined prior distribution (e.g., Gaussian noises). However, such prior distribution is often independent of real data and thus may lose semantic information (e.g., geometric structure or content in images) of data. In practice, the semantic information might be represented by some latent… ▽ More

    Submitted 28 July, 2020; originally announced August 2020.

    Comments: 20 pages, 5 figures

  26. arXiv:2008.00123  [pdf, other

    cs.LG stat.ML

    Noise-Response Analysis of Deep Neural Networks Quantifies Robustness and Fingerprints Structural Malware

    Authors: N. Benjamin Erichson, Dane Taylor, Qixuan Wu, Michael W. Mahoney

    Abstract: The ubiquity of deep neural networks (DNNs), cloud-based training, and transfer learning is giving rise to a new cybersecurity frontier in which unsecure DNNs have `structural malware' (i.e., compromised weights and activation pathways). In particular, DNNs can be designed to have backdoors that allow an adversary to easily and reliably fool an image classifier by adding a pattern of pixels called… ▽ More

    Submitted 3 February, 2021; v1 submitted 31 July, 2020; originally announced August 2020.

    Comments: 9 pages, 7 figures, accepted to the SIAM International Conference on Data Mining (SDM 21)

  27. arXiv:2006.16744  [pdf, other

    cs.LG cs.DC math.ST stat.ML

    Optimal Rates of Distributed Regression with Imperfect Kernels

    Authors: Hongwei Sun, Qiang Wu

    Abstract: Distributed machine learning systems have been receiving increasing attentions for their efficiency to process large scale data. Many distributed frameworks have been proposed for different machine learning tasks. In this paper, we study the distributed kernel regression via the divide and conquer approach. This approach has been proved asymptotically minimax optimal if the kernel is perfectly sel… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

    Comments: 2 figures

    MSC Class: 68T05; 68Q32; 68W15

  28. arXiv:2005.12979  [pdf, other

    cs.IR cs.LG cs.SI stat.ML

    Seamlessly Unifying Attributes and Items: Conversational Recommendation for Cold-Start Users

    Authors: Shijun Li, Wenqiang Lei, Qingyun Wu, Xiangnan He, Peng Jiang, Tat-Seng Chua

    Abstract: Static recommendation methods like collaborative filtering suffer from the inherent limitation of performing real-time personalization for cold-start users. Online recommendation, e.g., multi-armed bandit approach, addresses this limitation by interactively exploring user preference online and pursuing the exploration-exploitation (EE) trade-off. However, existing bandit-based methods model recomm… ▽ More

    Submitted 5 October, 2022; v1 submitted 23 May, 2020; originally announced May 2020.

    Comments: TOIS 2021

    ACM Class: I.2.6

  29. arXiv:2005.01571  [pdf, other

    cs.LG stat.ML

    Frugal Optimization for Cost-related Hyperparameters

    Authors: Qingyun Wu, Chi Wang, Silu Huang

    Abstract: The increasing demand for democratizing machine learning algorithms calls for hyperparameter optimization (HPO) solutions at low cost. Many machine learning algorithms have hyperparameters which can cause a large variation in the training cost. But this effect is largely ignored in existing HPO methods, which are incapable to properly control cost during the optimization process. To address this p… ▽ More

    Submitted 22 December, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: 29 pages (including supplementary appendix)

  30. arXiv:2003.03477  [pdf, other

    cs.LG cs.DC stat.ML

    ShadowSync: Performing Synchronization in the Background for Highly Scalable Distributed Training

    Authors: Qinqing Zheng, Bor-Yiing Su, Jiyan Yang, Alisson Azzolini, Qiang Wu, Ou Jin, Shri Karandikar, Hagay Lupesko, Liang Xiong, Eric Zhou

    Abstract: Recommendation systems are often trained with a tremendous amount of data, and distributed training is the workhorse to shorten the training time. While the training throughput can be increased by simply adding more workers, it is also increasingly challenging to preserve the model quality. In this paper, we present \shadowsync, a distributed framework specifically tailored to modern scale recomme… ▽ More

    Submitted 23 February, 2021; v1 submitted 6 March, 2020; originally announced March 2020.

  31. arXiv:2003.03051  [pdf, other

    cs.LG stat.ML

    Cost-Sensitive Portfolio Selection via Deep Reinforcement Learning

    Authors: Yifan Zhang, Peilin Zhao, Qingyao Wu, Bin Li, Junzhou Huang, Mingkui Tan

    Abstract: Portfolio Selection is an important real-world financial task and has attracted extensive attention in artificial intelligence communities. This task, however, has two main difficulties: (i) the non-stationary price series and complex asset correlations make the learning of feature representation very hard; (ii) the practicality principle in financial markets requires controlling both transaction… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Journal ref: IEEE Transactions on Knowledge and Data Engineering (TKDE), 2020

  32. Nonparametric Bayesian Two-Level Clustering for Subject-Level Single-Cell Expression Data

    Authors: Qiuyu Wu, Xiangyu Luo

    Abstract: The advent of single-cell sequencing opens new avenues for personalized treatment. In this paper, we address a two-level clustering problem of simultaneous subject subgroup discovery (subject level) and cell type detection (cell level) for single-cell expression data from multiple subjects. However, current statistical approaches either cluster cells without considering the subject heterogeneity o… ▽ More

    Submitted 18 February, 2021; v1 submitted 17 December, 2019; originally announced December 2019.

  33. arXiv:1911.07498  [pdf, other

    cs.LG stat.ML

    Online Adaptive Asymmetric Active Learning with Limited Budgets

    Authors: Yifan Zhang, Peilin Zhao, Shuaicheng Niu, Qingyao Wu, Jiezhang Cao, Junzhou Huang, Mingkui Tan

    Abstract: Online Active Learning (OAL) aims to manage unlabeled datastream by selectively querying the label of data. OAL is applicable to many real-world problems, such as anomaly detection in health-care and finance. In these problems, there are two key challenges: the query budget is often limited; the ratio between classes is highly imbalanced. In practice, it is quite difficult to handle imbalanced unl… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.

    Journal ref: IEEE Transactions on Knowledge and Data Engineering (TKDE), 2019

  34. arXiv:1911.07293  [pdf, other

    cs.LG stat.ML

    Collaborative Unsupervised Domain Adaptation for Medical Image Diagnosis

    Authors: Yifan Zhang, Ying Wei, Peilin Zhao, Shuaicheng Niu, Qingyao Wu, Mingkui Tan, Junzhou Huang

    Abstract: Deep learning based medical image diagnosis has shown great potential in clinical medicine. However, it often suffers two major difficulties in practice: 1) only limited labeled samples are available due to expensive annotation costs over medical images; 2) labeled images may contain considerable label noises (e.g., mislabeling labels) due to diagnostic difficulties. In this paper, we seek to expl… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

    Comments: Medical Imaging meets NeurIPS, 2019

  35. arXiv:1911.04706  [pdf, other

    cs.LG stat.ML

    FLAML: A Fast and Lightweight AutoML Library

    Authors: Chi Wang, Qingyun Wu, Markus Weimer, Erkang Zhu

    Abstract: We study the problem of using low computational cost to automate the choices of learners and hyperparameters for an ad-hoc training dataset and error metric, by conducting trials of different configurations on the given training data. We investigate the joint impact of multiple factors on both trial cost and model error, and propose several design guidelines. Following them, we build a fast and li… ▽ More

    Submitted 18 May, 2021; v1 submitted 12 November, 2019; originally announced November 2019.

    Comments: 14 pages, published in Fourth Conference on Machine Learning and Systems (MLSys 2021)

  36. arXiv:1910.12469  [pdf, other

    cs.LG stat.ML

    Learning Latent Process from High-Dimensional Event Sequences via Efficient Sampling

    Authors: Qitian Wu, Zixuan Zhang, Xiaofeng Gao, Junchi Yan, Guihai Chen

    Abstract: We target modeling latent dynamics in high-dimension marked event sequences without any prior knowledge about marker relations. Such problem has been rarely studied by previous works which would have fundamental difficulty to handle the arisen challenges: 1) the high-dimensional markers and unknown relation network among them pose intractable obstacles for modeling the latent dynamic process; 2) o… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

  37. arXiv:1909.13035  [pdf, other

    cs.LG stat.ML

    Bridging Explicit and Implicit Deep Generative Models via Neural Stein Estimators

    Authors: Qitian Wu, Rui Gao, Hongyuan Zha

    Abstract: There are two types of deep generative models: explicit and implicit. The former defines an explicit density form that allows likelihood inference; while the latter targets a flexible transformation from random noise to generated samples. While the two classes of generative models have shown great power in many applications, both of them, when used alone, suffer from respective limitations and dra… ▽ More

    Submitted 26 October, 2021; v1 submitted 28 September, 2019; originally announced September 2019.

    Comments: Accepted by NeurIPS2021 main conference

  38. arXiv:1909.10101  [pdf, other

    stat.AP

    IFAA: Robust association identification and Inference For Absolute Abundance in microbiome analyses

    Authors: Zhigang Li, Lu Tian, A. James O'Malley, Margaret R. Karagas, Anne G. Hoen, Brock C. Christensen, Juliette C. Madan, Quran Wu, Raad Z. Gharaibeh, Christian Jobin, Hongzhe Li

    Abstract: The target of inference in microbiome analyses is usually relative abundance (RA) because RA in a sample (e.g., stool) can be considered as an approximation of RA in an entire ecosystem (e.g., gut). However, inference on RA suffers from the fact that RA are calculated by dividing absolute abundances (AA) over the common denominator (CD), the summation of all AA (i.e., library size). Because of tha… ▽ More

    Submitted 10 October, 2019; v1 submitted 22 September, 2019; originally announced September 2019.

    Comments: Corresponding email: [email protected]

  39. arXiv:1908.09207  [pdf, ps, other

    cs.LG stat.ML

    Demystifying the MLPerf Benchmark Suite

    Authors: Snehil Verma, Qinzhe Wu, Bagus Hanindhito, Gunjan Jha, Eugene B. John, Ramesh Radhakrishnan, Lizy K. John

    Abstract: MLPerf, an emerging machine learning benchmark suite strives to cover a broad range of applications of machine learning. We present a study on its characteristics and how the MLPerf benchmarks differ from some of the previous deep learning benchmarks like DAWNBench and DeepBench. We find that application benchmarks such as MLPerf (although rich in kernels) exhibit different features compared to ke… ▽ More

    Submitted 24 August, 2019; originally announced August 2019.

  40. Model Adaptation via Model Interpolation and Boosting for Web Search Ranking

    Authors: Jianfeng Gao, Qiang Wu, Chris Burges, Krysta Svore, Yi Su, Nazan Khan, Shalin Shah, Hongyan Zhou

    Abstract: This paper explores two classes of model adaptation methods for Web search ranking: Model Interpolation and error-driven learning approaches based on a boosting algorithm. The results show that model interpolation, though simple, achieves the best results on all the open test sets where the test data is very different from the training data. The tree-based boosting algorithm achieves the best perf… ▽ More

    Submitted 21 July, 2019; originally announced July 2019.

  41. arXiv:1907.01647  [pdf, other

    cs.IR cs.LG stat.ML

    Bandit Learning for Diversified Interactive Recommendation

    Authors: Yong Liu, Yingtai Xiao, Qiong Wu, Chunyan Miao, Juyong Zhang

    Abstract: Interactive recommender systems that enable the interactions between users and the recommender system have attracted increasing research attentions. Previous methods mainly focus on optimizing recommendation accuracy. However, they usually ignore the diversity of the recommendation results, thus usually results in unsatisfying user experiences. In this paper, we propose a novel diversified recomme… ▽ More

    Submitted 30 June, 2019; originally announced July 2019.

  42. arXiv:1907.01162  [pdf, other

    cs.LG stat.ML

    Sample Adaptive Multiple Kernel Learning for Failure Prediction of Railway Points

    Authors: Zhibin Li, Jian Zhang, Qiang Wu, Yongshun Gong, Jinfeng Yi, Christina Kirsch

    Abstract: Railway points are among the key components of railway infrastructure. As a part of signal equipment, points control the routes of trains at railway junctions, having a significant impact on the reliability, capacity, and punctuality of rail transport. Traditionally, maintenance of points is based on a fixed time interval or raised after the equipment failures. Instead, it would be of great value… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

    Comments: Accepted by KDD2019 Applied Data Science track

  43. arXiv:1906.09175  [pdf, other

    stat.ME

    MarZIC: A Marginal mediation model for Zero-Inflated Compositional mediators with applications to microbiome data

    Authors: Quran Wu, A. James O'Malley, Janaka S. S. Liyanage, Susmita Datta, Raad Z. Gharaibeh, Christian Jobin, Margaret R. Karagas, Modupe O. Coker, Anne G. Hoen, Brock C. Christensen, Juliette C. Madan, Zhigang Li

    Abstract: The human microbiome can contribute to pathogeneses of many complex diseases by mediating disease-leading causal pathways. However, standard mediation analysis methods are not adequate to analyze the microbiome as a mediator due to the excessive number of zero-valued sequencing reads in the data that is compounded by its compositional structure. The two main challenges raised by the zero-inflated… ▽ More

    Submitted 29 April, 2022; v1 submitted 21 June, 2019; originally announced June 2019.

    Comments: Corresponding: Zhigang Li

  44. arXiv:1906.04450  [pdf, other

    cs.LG stat.ML

    Quantifying Intrinsic Uncertainty in Classification via Deep Dirichlet Mixture Networks

    Authors: Qingyang Wu, He Li, Lexin Li, Zhou Yu

    Abstract: With the widespread success of deep neural networks in science and technology, it is becoming increasingly important to quantify the uncertainty of the predictions produced by deep learning. In this paper, we introduce a new method that attaches an explicit uncertainty statement to the probabilities of classification using deep neural networks. Precisely, we view that the classification probabilit… ▽ More

    Submitted 14 August, 2019; v1 submitted 11 June, 2019; originally announced June 2019.

  45. arXiv:1906.03737  [pdf, other

    cs.LG cs.SI stat.ML

    Factorization Bandits for Online Influence Maximization

    Authors: Qingyun Wu, Zhige Li, Huazheng Wang, Wei Chen, Hongning Wang

    Abstract: We study the problem of online influence maximization in social networks. In this problem, a learner aims to identify the set of "best influencers" in a network by interacting with it, i.e., repeatedly selecting seed nodes and observing activation feedback in the network. We capitalize on an important property of the influence maximization problem named network assortativity, which is ignored by m… ▽ More

    Submitted 15 July, 2019; v1 submitted 9 June, 2019; originally announced June 2019.

    Comments: 11 pages (including SUPPLEMENT)

  46. arXiv:1906.01981  [pdf, ps, other

    math.OC cs.LG q-fin.PM q-fin.RM stat.ML

    Understanding Distributional Ambiguity via Non-robust Chance Constraint

    Authors: Qi Wu, Shumin Ma, Cheuk Hang Leung, Wei Liu, Nanbo Peng

    Abstract: This paper provides a non-robust interpretation of the distributionally robust optimization (DRO) problem by relating the distributional uncertainties to the chance probabilities. Our analysis allows a decision-maker to interpret the size of the ambiguity set, which is often lack of business meaning, through the chance parameters constraining the objective function. We first show that, for general… ▽ More

    Submitted 21 September, 2020; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: 8 pages, 3 figures, Accepted for publication in ICAIF 2020

  47. arXiv:1812.06398  [pdf, other

    cs.LG stat.ML

    Gold Seeker: Information Gain from Policy Distributions for Goal-oriented Vision-and-Langauge Reasoning

    Authors: Ehsan Abbasnejad, Iman Abbasnejad, Qi Wu, Javen Shi, Anton van den Hengel

    Abstract: As Computer Vision moves from a passive analysis of pixels to active analysis of semantics, the breadth of information algorithms need to reason over has expanded significantly. One of the key challenges in this vein is the ability to identify the information required to make a decision, and select an action that will recover it. We propose a reinforcement-learning approach that maintains a distri… ▽ More

    Submitted 29 March, 2020; v1 submitted 16 December, 2018; originally announced December 2018.

  48. arXiv:1811.12802  [pdf, other

    cs.IR cs.LG cs.SD eess.AS stat.ML

    Naive Dictionary On Musical Corpora: From Knowledge Representation To Pattern Recognition

    Authors: Qiuyi Wu, Ernest Fokoue

    Abstract: In this paper, we propose and develop the novel idea of treating musical sheets as literary documents in the traditional text analytics parlance, to fully benefit from the vast amount of research already existing in statistical text mining and topic modelling. We specifically introduce the idea of representing any given piece of music as a collection of "musical words" that we codenamed "muselets"… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.

    Comments: 25 pages

    MSC Class: 62P15; 62P25; 62P99; 68W40; 68W01; 91E10; 91E45; 82-08; 62-07 ACM Class: E.2; F.1.1; F.2.0; I.1.3; I.1.4; I.2.4; I.2.1; I.2.6; I.5.5; I.7.0

  49. arXiv:1811.07342  [pdf, other

    cs.LG cs.AI stat.ML

    Transform-Based Multilinear Dynamical System for Tensor Time Series Analysis

    Authors: Weijun Lu, Xiao-Yang Liu, Qingwei Wu, Yue Sun, Anwar Walid

    Abstract: We propose a novel multilinear dynamical system (MLDS) in a transform domain, named $\mathcal{L}$-MLDS, to model tensor time series. With transformations applied to a tensor data, the latent multidimensional correlations among the frontal slices are built, and thus resulting in the computational independence in the transform domain. This allows the exact separability of the multi-dimensional probl… ▽ More

    Submitted 18 November, 2018; originally announced November 2018.

  50. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge