Zum Hauptinhalt springen

Showing 1–20 of 20 results for author: Ton, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.12387  [pdf, other

    cs.LG

    Conformal Counterfactual Inference under Hidden Confounding

    Authors: Zonghao Chen, Ruocheng Guo, Jean-François Ton, Yang Liu

    Abstract: Personalized decision making requires the knowledge of potential outcomes under different treatments, and confidence intervals about the potential outcomes further enrich this decision-making process and improve its reliability in high-stakes scenarios. Predicting potential outcomes along with its uncertainty in a counterfactual world poses the foundamental challenge in causal inference. Existing… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Published in SIGKDD'24

  2. arXiv:2403.05171  [pdf, other

    cs.LG cs.AI

    Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation

    Authors: Xiaoying Zhang, Jean-Francois Ton, Wei Shen, Hongning Wang, Yang Liu

    Abstract: We introduce Adversarial Policy Optimization (AdvPO), a novel solution to the pervasive issue of reward over-optimization in Reinforcement Learning from Human Feedback (RLHF) for Large Language Models (LLMs). Over-optimization occurs when a reward model serves as an imperfect proxy for human preference, and RL-driven policy optimization erroneously exploits reward inaccuracies. In this paper, we b… ▽ More

    Submitted 9 July, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  3. arXiv:2402.17106  [pdf, other

    stat.ML cs.CY cs.LG

    Achievable Fairness on Your Data With Utility Guarantees

    Authors: Muhammad Faaiz Taufiq, Jean-Francois Ton, Yang Liu

    Abstract: In machine learning fairness, training models that minimize disparity across different sensitive groups often leads to diminished accuracy, a phenomenon known as the fairness-accuracy trade-off. The severity of this trade-off inherently depends on dataset characteristics such as dataset imbalances or biases and therefore, using a uniform fairness requirement across diverse datasets remains questio… ▽ More

    Submitted 30 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  4. arXiv:2402.10412  [pdf, other

    cs.CL cs.AI cs.LG

    Measuring and Reducing LLM Hallucination without Gold-Standard Answers

    Authors: Jiaheng Wei, Yuanshun Yao, Jean-Francois Ton, Hongyi Guo, Andrew Estornell, Yang Liu

    Abstract: LLM hallucination, i.e. generating factually incorrect yet seemingly convincing answers, is currently a major threat to the trustworthiness and reliability of LLMs. The first step towards solving this complicated problem is to measure it. However, existing hallucination metrics require having a benchmark dataset with gold-standard answers, i.e. "best" or "correct" answers written by humans. Such r… ▽ More

    Submitted 6 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Paper Under Review

  5. arXiv:2312.01457  [pdf, other

    stat.ML cs.LG stat.ME

    Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits

    Authors: Muhammad Faaiz Taufiq, Arnaud Doucet, Rob Cornish, Jean-Francois Ton

    Abstract: Off-Policy Evaluation (OPE) in contextual bandits is crucial for assessing new policies using existing data without costly experimentation. However, current OPE methods, such as Inverse Probability Weighting (IPW) and Doubly Robust (DR) estimators, suffer from high variance, particularly in cases of low overlap between target and behavior policies or large action and context spaces. In this paper,… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Conference on Neural Information Processing Systems (NeurIPS 2023)

  6. arXiv:2310.06205  [pdf, other

    cs.LG

    Fair Classifiers that Abstain without Harm

    Authors: Tongxin Yin, Jean-François Ton, Ruocheng Guo, Yuanshun Yao, Mingyan Liu, Yang Liu

    Abstract: In critical applications, it is vital for classifiers to defer decision-making to humans. We propose a post-hoc method that makes existing classifiers selectively abstain from predicting certain samples. Our abstaining classifier is incentivized to maintain the original accuracy for each sub-population (i.e. no harm) while achieving a set of group fairness definitions to a user specified degree. T… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  7. arXiv:2310.05755  [pdf, other

    cs.LG

    Deep Concept Removal

    Authors: Yegor Klochkov, Jean-Francois Ton, Ruocheng Guo, Yang Liu, Hang Li

    Abstract: We address the problem of concept removal in deep neural networks, aiming to learn representations that do not encode certain specified concepts (e.g., gender etc.) We propose a novel method based on adversarial linear classifiers trained on a concept dataset, which helps to remove the targeted attribute while maintaining model performance. Our approach Deep Concept Removal incorporates adversaria… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 21 pages, 9 figures, 4 tables

  8. arXiv:2309.12559  [pdf, other

    cs.LG cs.AI cs.CV

    Invariant Learning via Probability of Sufficient and Necessary Causes

    Authors: Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jianhong Wang, Jun Wang

    Abstract: Out-of-distribution (OOD) generalization is indispensable for learning models in the wild, where testing distribution typically unknown and different from the training. Recent methods derived from causality have shown great potential in achieving OOD generalization. However, existing methods mainly focus on the invariance property of causes, while largely overlooking the property of \textit{suffic… ▽ More

    Submitted 10 May, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

  9. arXiv:2308.05374  [pdf, other

    cs.AI cs.LG

    Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

    Authors: Yang Liu, Yuanshun Yao, Jean-Francois Ton, Xiaoying Zhang, Ruocheng Guo, Hao Cheng, Yegor Klochkov, Muhammad Faaiz Taufiq, Hang Li

    Abstract: Ensuring alignment, which refers to making models behave in accordance with human intentions [1,2], has become a critical task before deploying large language models (LLMs) in real-world applications. For instance, OpenAI devoted six months to iteratively aligning GPT-4 before its release [3]. However, a major challenge faced by practitioners is the lack of clear guidance on evaluating whether LLM… ▽ More

    Submitted 20 March, 2024; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: Fixed several typos

  10. arXiv:2306.07188  [pdf, other

    cs.LG cs.CY cs.IR

    Inference-time Stochastic Ranking with Risk Control

    Authors: Ruocheng Guo, Jean-François Ton, Yang Liu, Hang Li

    Abstract: Learning to Rank (LTR) methods are vital in online economies, affecting users and item providers. Fairness in LTR models is crucial to allocate exposure proportionally to item relevance. Widely used deterministic LTR models can lead to unfair exposure distribution, especially when items with the same relevance receive slightly different ranking scores. Stochastic LTR models, incorporating the Plac… ▽ More

    Submitted 18 May, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

  11. arXiv:2206.04405  [pdf, other

    stat.ML cs.LG

    Conformal Off-Policy Prediction in Contextual Bandits

    Authors: Muhammad Faaiz Taufiq, Jean-Francois Ton, Rob Cornish, Yee Whye Teh, Arnaud Doucet

    Abstract: Most off-policy evaluation methods for contextual bandits have focused on the expected outcome of a policy, which is estimated via methods that at best provide only asymptotic guarantees. However, in many applications, the expectation may not be the best measure of performance as it does not capture the variability of the outcome. In addition, particularly in safety-critical settings, stronger gua… ▽ More

    Submitted 26 October, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Proceedings of 36th Conference on Neural Information Processing System (NeurIPS 2022)

  12. arXiv:2202.03297  [pdf, other

    stat.ML cs.LG stat.ME

    Grassmann Stein Variational Gradient Descent

    Authors: Xing Liu, Harrison Zhu, Jean-François Ton, George Wynne, Andrew Duncan

    Abstract: Stein variational gradient descent (SVGD) is a deterministic particle inference algorithm that provides an efficient alternative to Markov chain Monte Carlo. However, SVGD has been found to suffer from variance underestimation when the dimensionality of the target distribution is high. Recent developments have advocated projecting both the score function and the data onto real lines to sidestep th… ▽ More

    Submitted 11 March, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: 20 pages, 13 figures, to appear in AISTATS 2022

    MSC Class: 62F15 (Bayesian inference)

  13. arXiv:2109.08249  [pdf, other

    cs.CL

    Regularized Training of Nearest Neighbor Language Models

    Authors: Jean-Francois Ton, Walter Talbott, Shuangfei Zhai, Josh Susskind

    Abstract: Including memory banks in a natural language processing architecture increases model capacity by equipping it with additional data at inference time. In this paper, we build upon $k$NN-LM \citep{khandelwal20generalization}, which uses a pre-trained language model together with an exhaustive $k$NN search through the training data (memory bank) to achieve state-of-the-art results. We investigate whe… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  14. arXiv:2106.03477  [pdf, other

    stat.ML cs.LG

    BayesIMP: Uncertainty Quantification for Causal Data Fusion

    Authors: Siu Lun Chau, Jean-François Ton, Javier González, Yee Whye Teh, Dino Sejdinovic

    Abstract: While causal models are becoming one of the mainstays of machine learning, the problem of uncertainty quantification in causal inference remains challenging. In this paper, we study the causal data fusion problem, where datasets pertaining to multiple causal graphs are combined to estimate the average treatment effect of a target variable. As data arises from multiple sources and can vary in quali… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: 10 pages main text, 10 pages supplementary materials

  15. arXiv:2007.02809  [pdf, other

    stat.ML cs.LG

    Meta Learning for Causal Direction

    Authors: Jean-Francois Ton, Dino Sejdinovic, Kenji Fukumizu

    Abstract: The inaccessibility of controlled randomized trials due to inherent constraints in many fields of science has been a fundamental issue in causal inference. In this paper, we focus on distinguishing the cause from effect in the bivariate setting under limited observational data. Based on recent developments in meta learning as well as in causal inference, we introduce a novel generative model that… ▽ More

    Submitted 21 February, 2021; v1 submitted 6 July, 2020; originally announced July 2020.

  16. arXiv:2002.08797  [pdf, other

    stat.ML cs.CV cs.LG

    Robust Pruning at Initialization

    Authors: Soufiane Hayou, Jean-Francois Ton, Arnaud Doucet, Yee Whye Teh

    Abstract: Overparameterized Neural Networks (NN) display state-of-the-art performance. However, there is a growing need for smaller, energy-efficient, neural networks tobe able to use machine learning applications on devices with limited computational resources. A popular approach consists of using pruning techniques. While these techniques have traditionally focused on pruning pre-trained NN (LeCun et al.,… ▽ More

    Submitted 19 May, 2021; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: 37 pages, 12 figures

  17. arXiv:1912.02738  [pdf, other

    stat.ML cs.LG

    MetaFun: Meta-Learning with Iterative Functional Updates

    Authors: Jin Xu, Jean-Francois Ton, Hyunjik Kim, Adam R. Kosiorek, Yee Whye Teh

    Abstract: We develop a functional encoder-decoder approach to supervised meta-learning, where labeled data is encoded into an infinite-dimensional functional representation rather than a finite-dimensional one. Furthermore, rather than directly producing the representation, we learn a neural update rule resembling functional gradient descent which iteratively improves the representation. The final represent… ▽ More

    Submitted 16 August, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

  18. arXiv:1906.02236  [pdf, other

    stat.ML cs.LG

    Noise Contrastive Meta-Learning for Conditional Density Estimation using Kernel Mean Embeddings

    Authors: Jean-Francois Ton, Lucian Chan, Yee Whye Teh, Dino Sejdinovic

    Abstract: Current meta-learning approaches focus on learning functional representations of relationships between variables, i.e. on estimating conditional expectations in regression. In many applications, however, we are faced with conditional distributions which cannot be meaningfully summarized using expectation only (due to e.g. multimodality). Hence, we consider the problem of conditional density estima… ▽ More

    Submitted 23 February, 2021; v1 submitted 5 June, 2019; originally announced June 2019.

  19. arXiv:1902.09724  [pdf, other

    cs.LG stat.ML

    Automated Model Selection with Bayesian Quadrature

    Authors: Henry Chai, Jean-Francois Ton, Roman Garnett, Michael A. Osborne

    Abstract: We present a novel technique for tailoring Bayesian quadrature (BQ) to model selection. The state-of-the-art for comparing the evidence of multiple models relies on Monte Carlo methods, which converge slowly and are unreliable for computationally expensive models. Previous research has shown that BQ offers sample efficiency superior to Monte Carlo in computing the evidence of an individual model.… ▽ More

    Submitted 1 March, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: 10 pages, 5 figures. Currently in submission to ICML 2019

  20. arXiv:1806.09178  [pdf, other

    stat.ML cs.LG

    Towards A Unified Analysis of Random Fourier Features

    Authors: Zhu Li, Jean-Francois Ton, Dino Oglic, Dino Sejdinovic

    Abstract: Random Fourier features is a widely used, simple, and effective technique for scaling up kernel methods. The existing theoretical analysis of the approach, however, remains focused on specific learning tasks and typically gives pessimistic bounds which are at odds with the empirical results. We tackle these problems and provide the first unified risk analysis of learning with random Fourier featur… ▽ More

    Submitted 4 February, 2021; v1 submitted 24 June, 2018; originally announced June 2018.