Skip to main content

Showing 1–50 of 52 results for author: Low, B K H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04981  [pdf, other

    cs.CL cs.LG

    TRACE: TRansformer-based Attribution using Contrastive Embeddings in LLMs

    Authors: Cheng Wang, Xinyang Lu, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The rapid evolution of large language models (LLMs) represents a substantial leap forward in natural language understanding and generation. However, alongside these advancements come significant challenges related to the accountability and transparency of LLM responses. Reliable source attribution is essential to adhering to stringent legal and regulatory standards, including those set forth by th… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  2. arXiv:2407.04411  [pdf, other

    cs.CR cs.AI cs.CL

    Waterfall: Framework for Robust and Scalable Text Watermarking

    Authors: Gregory Kang Ruey Lau, Xinyuan Niu, Hieu Dao, Jiangwei Chen, Chuan-Sheng Foo, Bryan Kian Hsiang Low

    Abstract: Protecting intellectual property (IP) of text such as articles and code is increasingly important, especially as sophisticated attacks become possible, such as paraphrasing by large language models (LLMs) or even unauthorized training of LLMs on copyrighted text to infringe such IP. However, existing text watermarking methods are not robust enough against such attacks nor scalable to millions of u… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  3. arXiv:2406.14507  [pdf, other

    cs.LG cs.AI

    On Newton's Method to Unlearn Neural Networks

    Authors: Nhung Bui, Xinyang Lu, See-Kiong Ng, Bryan Kian Hsian Low

    Abstract: Machine unlearning facilitates personal data ownership, including the ``right to be forgotten''. The proliferation of applications of \emph{neural networks} (NNs) trained on users' personal data calls for the need to develop algorithms to unlearn an NN. Since retraining is costly, efficiency is often achieved through approximate unlearning which aims to unlearn a trained NN to be close to the retr… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.14473  [pdf, other

    cs.LG cs.CL

    Data-Centric AI in the Age of Large Language Models

    Authors: Xinyi Xu, Zhaoxuan Wu, Rui Qiao, Arun Verma, Yao Shu, Jingtan Wang, Xinyuan Niu, Zhenfeng He, Jiangwei Chen, Zijian Zhou, Gregory Kang Ruey Lau, Hieu Dao, Lucas Agussurja, Rachael Hwee Ling Sim, Xiaoqiang Lin, Wenyang Hu, Zhongxiang Dai, Pang Wei Koh, Bryan Kian Hsiang Low

    Abstract: This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs). We start by making the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs, and yet it receives disproportionally low attention from the research community. We identify four specific… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Preprint

  5. arXiv:2406.04606  [pdf, other

    cs.LG cs.AI

    Helpful or Harmful Data? Fine-tuning-free Shapley Attribution for Explaining Language Model Predictions

    Authors: Jingtan Wang, Xiaoqiang Lin, Rui Qiao, Chuan-Sheng Foo, Bryan Kian Hsiang Low

    Abstract: The increasing complexity of foundational models underscores the necessity for explainability, particularly for fine-tuning, the most widely used training method for adapting models to downstream tasks. Instance attribution, one type of explanation, attributes the model prediction to each training example by an instance score. However, the robustness of instance scores, specifically towards datase… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024

  6. arXiv:2405.17346  [pdf, other

    cs.LG cs.AI

    Prompt Optimization with Human Feedback

    Authors: Xiaoqiang Lin, Zhongxiang Dai, Arun Verma, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have demonstrated remarkable performances in various tasks. However, the performance of LLMs heavily depends on the input prompt, which has given rise to a number of recent works on prompt optimization. However, previous works often require the availability of a numeric score to assess the quality of every prompt. Unfortunately, when a human user interacts with a black… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Preprint, 18 pages

  7. arXiv:2405.16122  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars

    Authors: Zhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have shown impressive capabilities in real-world applications. The capability of in-context learning (ICL) allows us to adapt an LLM to downstream tasks by including input-label exemplars in the prompt without model fine-tuning. However, the quality of these exemplars in the prompt greatly impacts performance, highlighting the need for an effective automated exemplar s… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 23 pages, 1 figure, 23 tables

  8. arXiv:2405.14899  [pdf, other

    cs.CL cs.AI cs.LG

    DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

    Authors: Zijian Zhou, Xiaoqiang Lin, Xinyi Xu, Alok Prakash, Daniela Rus, Bryan Kian Hsiang Low

    Abstract: In-context learning (ICL) allows transformer-based language models that are pre-trained on general text to quickly learn a specific task with a few "task demonstrations" without updating their parameters, significantly boosting their flexibility and generality. ICL possesses many distinct characteristics from conventional machine learning, thereby requiring new approaches to interpret this learnin… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  9. arXiv:2404.07662  [pdf, other

    cs.LG cs.AI physics.comp-ph physics.data-an stat.ML

    PINNACLE: PINN Adaptive ColLocation and Experimental points selection

    Authors: Gregory Kang Ruey Lau, Apivich Hemachandra, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: Physics-Informed Neural Networks (PINNs), which incorporate PDEs as soft constraints, train with a composite loss function that contains multiple training point types: different types of collocation points chosen during training to enforce each PDE and initial/boundary conditions, and experimental points which are usually costly to obtain via experiments or simulations. Training PINNs using this l… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Accepted to 12th International Conference on Learning Representations (ICLR 2024), 36 pages

  10. arXiv:2404.01676  [pdf, other

    cs.LG

    Incentives in Private Collaborative Machine Learning

    Authors: Rachael Hwee Ling Sim, Yehong Zhang, Trong Nghia Hoang, Xinyi Xu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Collaborative machine learning involves training models on data from multiple parties but must incentivize their participation. Existing data valuation methods fairly value and reward each party based on shared data or model parameters but neglect the privacy risks involved. To address this, we introduce differential privacy (DP) as an incentive. Each party can select its required DP guarantee and… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted to NeurIPS 2023

  11. arXiv:2403.07591  [pdf, other

    cs.LG

    Robustifying and Boosting Training-Free Neural Architecture Search

    Authors: Zhenfeng He, Yao Shu, Zhongxiang Dai, Bryan Kian Hsiang Low

    Abstract: Neural architecture search (NAS) has become a key component of AutoML and a standard tool to automate the design of deep neural networks. Recently, training-free NAS as an emerging paradigm has successfully reduced the search costs of standard training-based NAS by estimating the true architecture performance with only training-free metrics. Nevertheless, the estimation ability of these metrics ty… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted by ICLR 2024. Code available at https://github.com/hzf1174/RoBoT

  12. arXiv:2403.02993  [pdf, other

    cs.AI

    Localized Zeroth-Order Prompt Optimization

    Authors: Wenyang Hu, Yao Shu, Zongmin Yu, Zhaoxuan Wu, Xiangqiang Lin, Zhongxiang Dai, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The efficacy of large language models (LLMs) in understanding and generating natural language has aroused a wide interest in developing prompt-based methods to harness the power of black-box LLMs. Existing methodologies usually prioritize a global optimization for finding the global optimum, which however will perform poorly in certain tasks. This thus motivates us to re-think the necessity of fin… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  13. arXiv:2402.02359  [pdf, other

    math.OC cs.LG

    Incremental Quasi-Newton Methods with Faster Superlinear Convergence Rates

    Authors: Zhuanghua Liu, Luo Luo, Bryan Kian Hsiang Low

    Abstract: We consider the finite-sum optimization problem, where each component function is strongly convex and has Lipschitz continuous gradient and Hessian. The recently proposed incremental quasi-Newton method is based on BFGS update and achieves a local superlinear convergence rate that is dependent on the condition number of the problem. This paper proposes a more efficient quasi-Newton method by incor… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  14. arXiv:2402.02356  [pdf, other

    math.OC cs.LG

    Decentralized Sum-of-Nonconvex Optimization

    Authors: Zhuanghua Liu, Bryan Kian Hsiang Low

    Abstract: We consider the optimization problem of minimizing the sum-of-nonconvex function, i.e., a convex function that is the average of nonconvex components. The existing stochastic algorithms for such a problem only focus on a single machine and the centralized scenario. In this paper, we study the sum-of-nonconvex optimization in the decentralized setting. We present a new theoretical analysis of the P… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  15. arXiv:2401.14846  [pdf, other

    cs.LG cs.CV

    Understanding Domain Generalization: A Noise Robustness Perspective

    Authors: Rui Qiao, Bryan Kian Hsiang Low

    Abstract: Despite the rapid development of machine learning algorithms for domain generalization (DG), there is no clear empirical evidence that the existing DG algorithms outperform the classic empirical risk minimization (ERM) across standard benchmarks. To better understand this phenomenon, we investigate whether there are benefits of DG algorithms over ERM through the lens of label noise. Specifically,… ▽ More

    Submitted 17 March, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Accepted to the 12th International Conference on Learning Representations (ICLR 2024). Code is available at https://github.com/qiaoruiyt/NoiseRobustDG

  16. arXiv:2312.11413  [pdf, other

    cs.LG cs.AI

    DeRDaVa: Deletion-Robust Data Valuation for Machine Learning

    Authors: Xiao Tian, Rachael Hwee Ling Sim, Jue Fan, Bryan Kian Hsiang Low

    Abstract: Data valuation is concerned with determining a fair valuation of data from data sources to compensate them or to identify training examples that are the most or least useful for predictions. With the rising interest in personal data ownership and data protection regulations, model owners will likely have to fulfil more data deletion requests. This raises issues that have not been addressed by exis… ▽ More

    Submitted 21 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  17. arXiv:2311.02715  [pdf, other

    cs.LG stat.ML

    Exploiting Correlated Auxiliary Feedback in Parameterized Bandits

    Authors: Arun Verma, Zhongxiang Dai, Yao Shu, Bryan Kian Hsiang Low

    Abstract: We study a novel variant of the parameterized bandits problem in which the learner can observe additional auxiliary feedback that is correlated with the observed reward. The auxiliary feedback is readily available in many real-life applications, e.g., an online platform that wants to recommend the best-rated services to its users can observe the user's rating of service (rewards) and collect addit… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS 2023

  18. arXiv:2311.01195  [pdf, other

    cs.LG cs.AI

    Batch Bayesian Optimization for Replicable Experimental Design

    Authors: Zhongxiang Dai, Quoc Phong Nguyen, Sebastian Shenghong Tay, Daisuke Urano, Richalynn Leong, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Many real-world experimental design problems (a) evaluate multiple experimental conditions in parallel and (b) replicate each condition multiple times due to large and heteroscedastic observation noise. Given a fixed total budget, this naturally induces a trade-off between evaluating more unique conditions while replicating each of them fewer times vs. evaluating fewer unique conditions and replic… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS 2023

  19. arXiv:2310.05373  [pdf, other

    cs.LG cs.AI

    Quantum Bayesian Optimization

    Authors: Zhongxiang Dai, Gregory Kang Ruey Lau, Arun Verma, Yao Shu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Kernelized bandits, also known as Bayesian optimization (BO), has been a prevalent method for optimizing complicated black-box reward functions. Various BO algorithms have been theoretically shown to enjoy upper bounds on their cumulative regret which are sub-linear in the number T of iterations, and a regret lower bound of Omega(sqrt(T)) has been derived which represents the unavoidable regrets f… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  20. arXiv:2310.02905  [pdf, other

    cs.LG cs.AI cs.CL

    Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers

    Authors: Xiaoqiang Lin, Zhaoxuan Wu, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have shown remarkable instruction-following capabilities and achieved impressive performances in various applications. However, the performances of LLMs depend heavily on the instructions given to them, which are typically manually tuned with substantial human efforts. Recent work has used the query-efficient Bayesian optimization (BO) algorithm to automatically optimi… ▽ More

    Submitted 23 June, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

    Comments: Accepted to ICML 2024

  21. arXiv:2310.00646  [pdf, other

    cs.LG cs.AI stat.ML

    WASA: WAtermark-based Source Attribution for Large Language Model-Generated Data

    Authors: Jingtan Wang, Xinyang Lu, Zitong Zhao, Zhongxiang Dai, Chuan-Sheng Foo, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The impressive performances of large language models (LLMs) and their immense potential for commercialization have given rise to serious concerns over the intellectual property (IP) of their training data. In particular, the synthetic texts generated by LLMs may infringe the IP of the data being used to train the LLMs. To this end, it is imperative to be able to (a) identify the data provider who… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  22. arXiv:2308.04077  [pdf, other

    cs.LG cs.AI

    Federated Zeroth-Order Optimization using Trajectory-Informed Surrogate Gradients

    Authors: Yao Shu, Xiaoqiang Lin, Zhongxiang Dai, Bryan Kian Hsiang Low

    Abstract: Federated optimization, an emerging paradigm which finds wide real-world applications such as federated learning, enables multiple clients (e.g., edge devices) to collaboratively optimize a global function. The clients do not share their local datasets and typically only share their local gradients. However, the gradient information is not available in many applications of federated optimization,… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  23. arXiv:2308.00629  [pdf, other

    cs.LG cs.AI

    Hessian-Aware Bayesian Optimization for Decision Making Systems

    Authors: Mohit Rajpal, Lac Gia Tran, Yehong Zhang, Bryan Kian Hsiang Low

    Abstract: Many approaches for optimizing decision making systems rely on gradient based methods requiring informative feedback from the environment. However, in the case where such feedback is sparse or uninformative, such approaches may result in poor performance. Derivative-free approaches such as Bayesian Optimization mitigate the dependency on the quality of gradient feedback, but are known to scale poo… ▽ More

    Submitted 1 December, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: Fixed a typo

  24. arXiv:2306.05764  [pdf, other

    cs.LG cs.AI cs.GT cs.MA

    Fair yet Asymptotically Equal Collaborative Learning

    Authors: Xiaoqiang Lin, Xinyi Xu, See-Kiong Ng, Chuan-Sheng Foo, Bryan Kian Hsiang Low

    Abstract: In collaborative learning with streaming data, nodes (e.g., organizations) jointly and continuously learn a machine learning (ML) model by sharing the latest model updates computed from their latest streaming data. For the more resourceful nodes to be willing to share their model updates, they need to be fairly incentivized. This paper explores an incentive design that guarantees fairness so that… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted to 40th International Conference on Machine Learning (ICML 2023), 37 pages

  25. arXiv:2306.04454  [pdf, other

    cs.LG cs.AI

    Training-Free Neural Active Learning with Initialization-Robustness Guarantees

    Authors: Apivich Hemachandra, Zhongxiang Dai, Jasraj Singh, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: Existing neural active learning algorithms have aimed to optimize the predictive performance of neural networks (NNs) by selecting data for labelling. However, other than a good predictive performance, being robust against random parameter initializations is also a crucial requirement in safety-critical applications. To this end, we introduce our expected variance with Gaussian processes (EV-GP) c… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted to 40th International Conference on Machine Learning (ICML 2023), 41 pages

  26. arXiv:2305.14201  [pdf, other

    cs.LG cs.AI cs.CL

    Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks

    Authors: Tiedong Liu, Bryan Kian Hsiang Low

    Abstract: We introduce Goat, a fine-tuned LLaMA model that significantly outperforms GPT-4 on a range of arithmetic tasks. Fine-tuned on a synthetically generated dataset, Goat achieves state-of-the-art performance on BIG-bench arithmetic sub-task. In particular, the zero-shot Goat-7B matches or even surpasses the accuracy achieved by the few-shot PaLM-540B. Surprisingly, Goat can achieve near-perfect accur… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  27. arXiv:2305.06176  [pdf

    cs.CL cs.AI cs.LG

    Fine-tuning Language Models with Generative Adversarial Reward Modelling

    Authors: Zhang Ze Yu, Lau Jia Jaw, Zhang Hui, Bryan Kian Hsiang Low

    Abstract: Reinforcement Learning with Human Feedback (RLHF) has been demonstrated to significantly enhance the performance of large language models (LLMs) by aligning their outputs with desired human values through instruction tuning. However, RLHF is constrained by the expertise and productivity limitations of human evaluators. A response to this downside is to fall back to supervised fine-tuning (SFT) wit… ▽ More

    Submitted 5 March, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 22 pages, 9 figures, 12 tables

  28. arXiv:2301.11135  [pdf, other

    cs.LG cs.DC

    FedHQL: Federated Heterogeneous Q-Learning

    Authors: Flint Xiaofeng Fan, Yining Ma, Zhongxiang Dai, Cheston Tan, Bryan Kian Hsiang Low, Roger Wattenhofer

    Abstract: Federated Reinforcement Learning (FedRL) encourages distributed agents to learn collectively from each other's experience to improve their performance without exchanging their raw trajectories. The existing work on FedRL assumes that all participating agents are homogeneous, which requires all agents to share the same policy parameterization (e.g., network architectures and training configurations… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: Preprint. Under review

  29. arXiv:2210.06850  [pdf, other

    cs.LG cs.AI

    Sample-Then-Optimize Batch Neural Thompson Sampling

    Authors: Zhongxiang Dai, Yao Shu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Bayesian optimization (BO), which uses a Gaussian process (GP) as a surrogate to model its objective function, is popular for black-box optimization. However, due to the limitations of GPs, BO underperforms in some problems such as those with categorical, high-dimensional or image inputs. To this end, recent works have used the highly expressive neural networks (NNs) as the surrogate model and der… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to 36th Conference on Neural Information Processing Systems (NeurIPS 2022), Extended version with proofs and additional experimental details and results, 30 pages

  30. arXiv:2206.09341  [pdf, other

    cs.LG cs.AI

    Bayesian Optimization under Stochastic Delayed Feedback

    Authors: Arun Verma, Zhongxiang Dai, Bryan Kian Hsiang Low

    Abstract: Bayesian optimization (BO) is a widely-used sequential method for zeroth-order optimization of complex and expensive-to-compute black-box functions. The existing BO methods assume that the function evaluation (feedback) is available to the learner immediately or after a fixed delay. Such assumptions may not be practical in many real-life problems like online recommendations, clinical trials, and h… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: Accepted to ICML 2022

  31. arXiv:2206.06872  [pdf, other

    cs.LG cs.AI

    On Provably Robust Meta-Bayesian Optimization

    Authors: Zhongxiang Dai, Yizhou Chen, Haibin Yu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Bayesian optimization (BO) has become popular for sequential optimization of black-box functions. When BO is used to optimize a target function, we often have access to previous evaluations of potentially related functions. This begs the question as to whether we can leverage these previous experiences to accelerate the current BO task through meta-learning (meta-BO), while ensuring robustness aga… ▽ More

    Submitted 15 June, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Accepted to 38th Conference on Uncertainty in Artificial Intelligence (UAI 2022), Extended version with proofs and additional experimental details and results, 31 pages

  32. arXiv:2205.14309  [pdf, other

    cs.LG cs.AI

    Federated Neural Bandits

    Authors: Zhongxiang Dai, Yao Shu, Arun Verma, Flint Xiaofeng Fan, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Recent works on neural contextual bandits have achieved compelling performances due to their ability to leverage the strong representation power of neural networks (NNs) for reward prediction. Many applications of contextual bandits involve multiple agents who collaborate without sharing raw observations, thus giving rise to the setting of federated contextual bandits. Existing works on federated… ▽ More

    Submitted 28 February, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: ICLR 2023. Code: https://github.com/daizhongxiang/Federated-Neural-Bandits

  33. arXiv:2205.07428  [pdf, other

    cs.LG cs.GT stat.ML

    On the Convergence of the Shapley Value in Parametric Bayesian Learning Games

    Authors: Lucas Agussurja, Xinyi Xu, Bryan Kian Hsiang Low

    Abstract: Measuring contributions is a classical problem in cooperative game theory where the Shapley value is the most well-known solution concept. In this paper, we establish the convergence property of the Shapley value in parametric Bayesian learning games where players perform a Bayesian inference using their combined data, and the posterior-prior KL divergence is used as the characteristic function. W… ▽ More

    Submitted 14 June, 2022; v1 submitted 15 May, 2022; originally announced May 2022.

    Comments: Accepted to the 39th International Conference on Machine Learning (ICML 2022). Extended version with derivations

  34. arXiv:2205.04901  [pdf, other

    cs.LG math.ST

    Adjusted Expected Improvement for Cumulative Regret Minimization in Noisy Bayesian Optimization

    Authors: Shouri Hu, Haowei Wang, Zhongxiang Dai, Bryan Kian Hsiang Low, Szu Hui Ng

    Abstract: The expected improvement (EI) is one of the most popular acquisition functions for Bayesian optimization (BO) and has demonstrated good empirical performances in many applications for the minimization of simple regret. However, under the evaluation metric of cumulative regret, the performance of EI may not be competitive, and its existing theoretical regret upper bound still has room for improveme… ▽ More

    Submitted 24 May, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

  35. arXiv:2202.13597  [pdf, other

    cs.LG stat.ML

    Rectified Max-Value Entropy Search for Bayesian Optimization

    Authors: Quoc Phong Nguyen, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Although the existing max-value entropy search (MES) is based on the widely celebrated notion of mutual information, its empirical performance can suffer due to two misconceptions whose implications on the exploration-exploitation trade-off are investigated in this paper. These issues are essential in the development of future acquisition functions and the improvement of the existing ones as they… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

  36. Markov Chain Monte Carlo-Based Machine Unlearning: Unlearning What Needs to be Forgotten

    Authors: Quoc Phong Nguyen, Ryutaro Oikawa, Dinil Mon Divakaran, Mun Choon Chan, Bryan Kian Hsiang Low

    Abstract: As the use of machine learning (ML) models is becoming increasingly popular in many real-world applications, there are practical challenges that need to be addressed for model maintenance. One such challenge is to 'undo' the effect of a specific subset of dataset used for training a model. This specific subset may contain malicious or adversarial data injected by an attacker, which affects the mod… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

    Comments: Proceedings of the 2022 ACM Asia Conference on Computer and Communications Security (ASIA CCS '22), May 30-June 3, 2022, Nagasaki, Japan

  37. arXiv:2201.09785  [pdf, other

    cs.LG cs.AI

    Unifying and Boosting Gradient-Based Training-Free Neural Architecture Search

    Authors: Yao Shu, Zhongxiang Dai, Zhaoxuan Wu, Bryan Kian Hsiang Low

    Abstract: Neural architecture search (NAS) has gained immense popularity owing to its ability to automate neural architecture design. A number of training-free metrics are recently proposed to realize NAS without training, hence making NAS more scalable. Despite their competitive empirical performances, a unified theoretical understanding of these training-free metrics is lacking. As a consequence, (a) the… ▽ More

    Submitted 12 October, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: Published as a conference paper at NeurIPS 2022

  38. arXiv:2112.09327  [pdf, other

    cs.LG

    Incentivizing Collaboration in Machine Learning via Synthetic Data Rewards

    Authors: Sebastian Shenghong Tay, Xinyi Xu, Chuan Sheng Foo, Bryan Kian Hsiang Low

    Abstract: This paper presents a novel collaborative generative modeling (CGM) framework that incentivizes collaboration among self-interested parties to contribute data to a pool for training a generative model (e.g., GAN), from which synthetic data are drawn and distributed to the parties as rewards commensurate to their contributions. Distributing synthetic data as rewards (instead of trained models or mo… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: 36th AAAI Conference on Artificial Intelligence (AAAI 2022), Extended version with derivations, 42 pages

  39. arXiv:2110.14153  [pdf, other

    cs.LG cs.CR

    Differentially Private Federated Bayesian Optimization with Distributed Exploration

    Authors: Zhongxiang Dai, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Bayesian optimization (BO) has recently been extended to the federated learning (FL) setting by the federated Thompson sampling (FTS) algorithm, which has promising applications such as federated hyperparameter tuning. However, FTS is not equipped with a rigorous privacy guarantee which is an important consideration in FL. Recent works have incorporated differential privacy (DP) into the training… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Accepted to 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Extended version with proofs and additional experimental details and results, 29 pages

  40. arXiv:2110.14074  [pdf, other

    cs.LG cs.AI

    Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee

    Authors: Flint Xiaofeng Fan, Yining Ma, Zhongxiang Dai, Wei Jing, Cheston Tan, Bryan Kian Hsiang Low

    Abstract: The growing literature of Federated Learning (FL) has recently inspired Federated Reinforcement Learning (FRL) to encourage multiple agents to federatively build a better decision-making policy without sharing raw trajectories. Despite its promising applications, existing works on FRL fail to I) provide theoretical analysis on its convergence, and II) account for random system failures and adversa… ▽ More

    Submitted 3 November, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2021. Extended version with proofs and additional experimental details and results. New version changes: reduced file size of figures; added a diagram illustrating the problem setting; added link to code on GitHub; modified proof for Theorem 6 (highlighted in red)

  41. arXiv:2109.02533  [pdf, other

    cs.LG

    Neural Ensemble Search via Bayesian Sampling

    Authors: Yao Shu, Yizhou Chen, Zhongxiang Dai, Bryan Kian Hsiang Low

    Abstract: Recently, neural architecture search (NAS) has been applied to automate the design of neural networks in real-world applications. A large number of algorithms have been developed to improve the search cost or the performance of the final selected architectures in NAS. Unfortunately, these NAS algorithms aim to select only one single well-performing architecture from their search spaces and thus ha… ▽ More

    Submitted 17 June, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

    Comments: Published as a conference paper at UAI 2022

  42. arXiv:2109.00817  [pdf, other

    cs.LG cs.AI

    NASI: Label- and Data-agnostic Neural Architecture Search at Initialization

    Authors: Yao Shu, Shaofeng Cai, Zhongxiang Dai, Beng Chin Ooi, Bryan Kian Hsiang Low

    Abstract: Recent years have witnessed a surging interest in Neural Architecture Search (NAS). Various algorithms have been proposed to improve the search efficiency and effectiveness of NAS, i.e., to reduce the search cost and improve the generalization performance of the selected architectures, respectively. However, the search efficiency of these algorithms is severely limited by the need for model traini… ▽ More

    Submitted 25 April, 2022; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: Published as a conference paper at ICLR 2022

  43. arXiv:2107.14465  [pdf, other

    cs.LG cs.AI stat.ML

    Trusted-Maximizers Entropy Search for Efficient Bayesian Optimization

    Authors: Quoc Phong Nguyen, Zhaoxuan Wu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Information-based Bayesian optimization (BO) algorithms have achieved state-of-the-art performance in optimizing a black-box objective function. However, they usually require several approximations or simplifying assumptions (without clearly understanding their effects on the BO performance) and/or their generalization to batch BO is computationally unwieldy, especially with an increasing batch si… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    Comments: Published as a conference paper at UAI 2021

  44. arXiv:2105.06126  [pdf, other

    cs.LG

    Value-at-Risk Optimization with Gaussian Processes

    Authors: Quoc Phong Nguyen, Zhongxiang Dai, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Value-at-risk (VaR) is an established measure to assess risks in critical real-world applications with random environmental factors. This paper presents a novel VaR upper confidence bound (V-UCB) algorithm for maximizing the VaR of a black-box objective function with the first no-regret guarantee. To realize this, we first derive a confidence bound of VaR and then prove the existence of values of… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

  45. arXiv:2104.08472  [pdf, other

    cs.LG

    Convolutional Normalizing Flows for Deep Gaussian Processes

    Authors: Haibin Yu, Dapeng Liu, Yizhou Chen, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Deep Gaussian processes (DGPs), a hierarchical composition of GP models, have successfully boosted the expressive power of their single-layer counterpart. However, it is impossible to perform exact inference in DGPs, which has motivated the recent development of variational inference-based methods. Unfortunately, either these methods yield a biased posterior belief or it is difficult to evaluate t… ▽ More

    Submitted 26 May, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: To appear in Proceedings of the International Joint Conference on Neural Networks 2021 (IJCNN'21). arXiv admin note: text overlap with arXiv:1910.11998

  46. arXiv:2012.10695  [pdf, other

    cs.LG stat.ML

    An Information-Theoretic Framework for Unifying Active Learning Problems

    Authors: Quoc Phong Nguyen, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: This paper presents an information-theoretic framework for unifying active learning problems: level set estimation (LSE), Bayesian optimization (BO), and their generalized variant. We first introduce a novel active learning criterion that subsumes an existing LSE algorithm and achieves state-of-the-art performance in LSE problems with a continuous input domain. Then, by exploiting the relationship… ▽ More

    Submitted 19 December, 2020; originally announced December 2020.

    Comments: 35th AAAI Conference on Artificial Intelligence (AAAI 2021), Extended version with derivations, 12 pages

  47. arXiv:2012.10688  [pdf, other

    cs.LG stat.ML

    Top-$k$ Ranking Bayesian Optimization

    Authors: Quoc Phong Nguyen, Sebastian Tay, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: This paper presents a novel approach to top-$k$ ranking Bayesian optimization (top-$k$ ranking BO) which is a practical and significant generalization of preferential BO to handle top-$k$ ranking and tie/indifference observations. We first design a surrogate model that is not only capable of catering to the above observations, but is also supported by a classic random utility model. Another equall… ▽ More

    Submitted 19 December, 2020; originally announced December 2020.

    Comments: 35th AAAI Conference on Artificial Intelligence (AAAI 2021), Extended version with derivations, 13 pages

  48. arXiv:2011.08541  [pdf, other

    cs.LG cs.AI cs.RO

    Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization

    Authors: Sreejith Balakrishnan, Quoc Phong Nguyen, Bryan Kian Hsiang Low, Harold Soh

    Abstract: The problem of inverse reinforcement learning (IRL) is relevant to a variety of tasks including value alignment and robot learning from demonstration. Despite significant algorithmic contributions in recent years, IRL remains an ill-posed problem at its core; multiple reward functions coincide with the observed behavior and the actual reward function is not identifiable without prior knowledge or… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: Accepted to 34th Conference on Neural Information Processing Systems (NeurIPS 2020). Includes Appendix. 21 pages

  49. arXiv:2010.12883  [pdf, other

    cs.LG stat.ML

    Variational Bayesian Unlearning

    Authors: Quoc Phong Nguyen, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: This paper studies the problem of approximately unlearning a Bayesian model from a small subset of the training data to be erased. We frame this problem as one of minimizing the Kullback-Leibler divergence between the approximate posterior belief of model parameters after directly unlearning from erased data vs. the exact posterior belief from retraining with remaining data. Using the variational… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

    Comments: 34th Annual Conference on Neural Information Processing Systems (NeurIPS 2020), Extended version with proofs, 22 pages

  50. arXiv:2010.12799  [pdf, other

    cs.LG cs.CR stat.ML

    Private Outsourced Bayesian Optimization

    Authors: Dmitrii Kharkovskii, Zhongxiang Dai, Bryan Kian Hsiang Low

    Abstract: This paper presents the private-outsourced-Gaussian process-upper confidence bound (PO-GP-UCB) algorithm, which is the first algorithm for privacy-preserving Bayesian optimization (BO) in the outsourced setting with a provable performance guarantee. We consider the outsourced setting where the entity holding the dataset and the entity performing BO are represented by different parties, and the dat… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

    Comments: 37th International Conference on Machine Learning (ICML 2020), Extended version with proofs, 27 pages