Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Charpentier, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.03425  [pdf, other

    cs.LG stat.ME

    Sequential Conditional Transport on Probabilistic Graphs for Interpretable Counterfactual Fairness

    Authors: Agathe Fernandes Machado, Arthur Charpentier, Ewen Gallic

    Abstract: In this paper, we link two existing approaches to derive counterfactuals: adaptations based on a causal graph, as suggested in Plečko and Meinshausen (2020) and optimal transport, as in De Lara et al. (2024). We extend "Knothe's rearrangement" Bonnotte (2013) and "triangular transport" Zech and Marzouk (2022a) to probabilistic graphical models, and use this counterfactual approach, referred to as… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  2. arXiv:2408.03421  [pdf, other

    cs.LG stat.ML

    Probabilistic Scores of Classifiers, Calibration is not Enough

    Authors: Agathe Fernandes Machado, Arthur Charpentier, Emmanuel Flachaire, Ewen Gallic, François Hu

    Abstract: In binary classification tasks, accurate representation of probabilistic predictions is essential for various real-world applications such as predicting payment defaults or assessing medical risks. The model must then be well-calibrated to ensure alignment between predicted probabilities and actual outcomes. However, when score heterogeneity deviates from the underlying data probability distributi… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  3. arXiv:2403.15790  [pdf, other

    cs.LG stat.ML

    Boarding for ISS: Imbalanced Self-Supervised: Discovery of a Scaled Autoencoder for Mixed Tabular Datasets

    Authors: Samuel Stocksieker, Denys Pommeret, Arthur Charpentier

    Abstract: The field of imbalanced self-supervised learning, especially in the context of tabular data, has not been extensively studied. Existing research has predominantly focused on image datasets. This paper aims to fill this gap by examining the specific challenges posed by data imbalance in self-supervised learning in the domain of tabular data, with a primary focus on autoencoders. Autoencoders are wi… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  4. arXiv:2402.07790  [pdf, other

    cs.LG

    From Uncertainty to Precision: Enhancing Binary Classifier Performance through Calibration

    Authors: Agathe Fernandes Machado, Arthur Charpentier, Emmanuel Flachaire, Ewen Gallic, François Hu

    Abstract: The assessment of binary classifier performance traditionally centers on discriminative ability using metrics, such as accuracy. However, these metrics often disregard the model's inherent uncertainty, especially when dealing with sensitive decision-making domains, such as finance or healthcare. Given that model-predicted scores are commonly seen as event probabilities, calibration is crucial for… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  5. arXiv:2401.16197  [pdf, other

    cs.LG cs.CY

    Geospatial Disparities: A Case Study on Real Estate Prices in Paris

    Authors: Agathe Fernandes Machado, François Hu, Philipp Ratz, Ewen Gallic, Arthur Charpentier

    Abstract: Driven by an increasing prevalence of trackers, ever more IoT sensors, and the declining cost of computing power, geospatial information has come to play a pivotal role in contemporary predictive models. While enhancing prognostic performance, geospatial data also has the potential to perpetuate many historical socio-economic patterns, raising concerns about a resurgence of biases and exclusionary… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  6. arXiv:2311.11900  [pdf, other

    stat.ML cs.CY cs.LG

    Measuring and Mitigating Biases in Motor Insurance Pricing

    Authors: Mulah Moriah, Franck Vermet, Arthur Charpentier

    Abstract: The non-life insurance sector operates within a highly competitive and tightly regulated framework, confronting a pivotal juncture in the formulation of pricing strategies. Insurers are compelled to harness a range of statistical methodologies and available data to construct optimal pricing structures that align with the overarching corporate strategy while accommodating the dynamics of market com… ▽ More

    Submitted 20 June, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

  7. arXiv:2310.20508  [pdf, other

    stat.ML cs.CY cs.LG

    Parametric Fairness with Statistical Guarantees

    Authors: François HU, Philipp Ratz, Arthur Charpentier

    Abstract: Algorithmic fairness has gained prominence due to societal and regulatory concerns about biases in Machine Learning models. Common group fairness metrics like Equalized Odds for classification or Demographic Parity for both classification and regression are widely used and a host of computationally advantageous post-processing methods have been developed around them. However, these metrics often l… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  8. arXiv:2309.06627  [pdf, other

    stat.ML cs.CY cs.LG

    A Sequentially Fair Mechanism for Multiple Sensitive Attributes

    Authors: François Hu, Philipp Ratz, Arthur Charpentier

    Abstract: In the standard use case of Algorithmic Fairness, the goal is to eliminate the relationship between a sensitive variable and a corresponding score. Throughout recent years, the scientific community has developed a host of definitions and tools to solve this task, which work well in many practical applications. However, the applicability and effectivity of these tools and definitions becomes less s… ▽ More

    Submitted 14 January, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

  9. arXiv:2308.11090  [pdf, other

    cs.CV cs.LG stat.AP

    Fairness Explainability using Optimal Transport with Applications in Image Classification

    Authors: Philipp Ratz, François Hu, Arthur Charpentier

    Abstract: Ensuring trust and accountability in Artificial Intelligence systems demands explainability of its outcomes. Despite significant progress in Explainable AI, human biases still taint a substantial portion of its training data, raising concerns about unfairness or discriminatory tendencies. Current approaches in the field of Algorithmic Fairness focus on mitigating such biases in the outcomes of a m… ▽ More

    Submitted 31 October, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

  10. arXiv:2308.02966  [pdf, other

    stat.ML cs.LG

    Generalized Oversampling for Learning from Imbalanced datasets and Associated Theory

    Authors: Samuel Stocksieker, Denys Pommeret, Arthur Charpentier

    Abstract: In supervised learning, it is quite frequent to be confronted with real imbalanced datasets. This situation leads to a learning difficulty for standard algorithms. Research and solutions in imbalanced learning have mainly focused on classification tasks. Despite its importance, very few solutions exist for imbalanced regression. In this paper, we propose a data augmentation procedure, the GOLIATH… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: This paper focuses specifically on the Imbalanced Regression issues but could be used for Imbalanced classification tasks

  11. arXiv:2306.12912  [pdf, other

    stat.ML cs.CY cs.LG

    Mitigating Discrimination in Insurance with Wasserstein Barycenters

    Authors: Arthur Charpentier, François Hu, Philipp Ratz

    Abstract: The insurance industry is heavily reliant on predictions of risks based on characteristics of potential customers. Although the use of said models is common, researchers have long pointed out that such practices perpetuate discrimination based on sensitive features such as gender or race. Given that such discrimination can often be attributed to historical data biases, an elimination or at least m… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  12. Fairness in Multi-Task Learning via Wasserstein Barycenters

    Authors: François Hu, Philipp Ratz, Arthur Charpentier

    Abstract: Algorithmic Fairness is an established field in machine learning that aims to reduce biases in data. Recent advances have proposed various methods to ensure fairness in a univariate environment, where the goal is to de-bias a single task. However, extending fairness to a multi-task setting, where more than one objective is optimised using a shared representation, remains underexplored. To bridge t… ▽ More

    Submitted 6 July, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

  13. arXiv:2302.09288  [pdf, other

    stat.ML cs.LG stat.ME

    Data Augmentation for Imbalanced Regression

    Authors: Samuel Stocksieker, Denys Pommeret, Arthur Charpentier

    Abstract: In this work, we consider the problem of imbalanced data in a regression framework when the imbalanced phenomenon concerns continuous or discrete covariates. Such a situation can lead to biases in the estimates. In this case, we propose a data augmentation algorithm that combines a weighted resampling (WR) and a data augmentation (DA) procedure. In a first step, the DA procedure permits exploring… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Comments: paper accepted at the AISTATS 2023 conference, to be published in PMLR (Proceedings of Machine Learning Research)

  14. arXiv:2207.01010  [pdf, other

    cs.MA cs.LG econ.GN

    Government Intervention in Catastrophe Insurance Markets: A Reinforcement Learning Approach

    Authors: Menna Hassan, Nourhan Sakr, Arthur Charpentier

    Abstract: This paper designs a sequential repeated game of a micro-founded society with three types of agents: individuals, insurers, and a government. Nascent to economics literature, we use Reinforcement Learning (RL), closely related to multi-armed bandit problems, to learn the welfare impact of a set of proposed policy interventions per $1 spent on them. The paper rigorously discusses the desirability o… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

  15. arXiv:2205.08112  [pdf, ps, other

    econ.GN cs.CY

    The Fairness of Machine Learning in Insurance: New Rags for an Old Man?

    Authors: Laurence Barry, Arthur Charpentier

    Abstract: Since the beginning of their history, insurers have been known to use data to classify and price risks. As such, they were confronted early on with the problem of fairness and discrimination associated with data. This issue is becoming increasingly important with access to more granular and behavioural data, and is evolving to reflect current technologies and societal concerns. By looking into ear… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  16. arXiv:2202.12008  [pdf, other

    stat.ML cs.AI cs.CY cs.LG stat.AP

    A Fair Pricing Model via Adversarial Learning

    Authors: Vincent Grari, Arthur Charpentier, Marcin Detyniecki

    Abstract: At the core of insurance business lies classification between risky and non-risky insureds, actuarial fairness meaning that risky insureds should contribute more and pay a higher premium than non-risky or less-risky ones. Actuaries, therefore, use econometric or machine learning techniques to classify, but the distinction between a fair actuarial classification and "discrimination" is subtle. For… ▽ More

    Submitted 26 December, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: 20 pages, 12 figures

  17. arXiv:2107.02764  [pdf, other

    q-fin.RM cs.SI econ.GN q-fin.CP

    Collaborative Insurance Sustainability and Network Structure

    Authors: Arthur Charpentier, Lariosse Kouakou, Matthias Löwe, Philipp Ratz, Franck Vermet

    Abstract: The peer-to-peer (P2P) economy has been growing with the advent of the Internet, with well known brands such as Uber or Airbnb being examples thereof. In the insurance sector the approach is still in its infancy, but some companies have started to explore P2P-based collaborative insurance products (eg. Lemonade in the U.S. or Inspeer in France). The actuarial literature only recently started to co… ▽ More

    Submitted 12 September, 2022; v1 submitted 5 July, 2021; originally announced July 2021.

  18. arXiv:2103.03635  [pdf, other

    stat.ML cs.LG econ.EM

    Autocalibration and Tweedie-dominance for Insurance Pricing with Machine Learning

    Authors: Michel Denuit, Arthur Charpentier, Julien Trufin

    Abstract: Boosting techniques and neural networks are particularly effective machine learning methods for insurance pricing. Often in practice, there are nevertheless endless debates about the choice of the right loss function to be used to train the machine learning model, as well as about the appropriate metric to assess the performances of competing models. Also, the sum of fitted values can depart from… ▽ More

    Submitted 9 July, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

  19. arXiv:2003.10014  [pdf, other

    econ.TH cs.LG q-fin.CP

    Reinforcement Learning in Economics and Finance

    Authors: Arthur Charpentier, Romuald Elie, Carl Remlinger

    Abstract: Reinforcement learning algorithms describe how an agent can learn an optimal action policy in a sequential decision process, through repeated experience. In a given environment, the agent policy provides him some running and terminal rewards. As in online learning, the agent learns sequentially. As in multi-armed bandit problems, when an agent picks an action, he can not infer ex-post the rewards… ▽ More

    Submitted 22 March, 2020; originally announced March 2020.

  20. arXiv:1907.02320  [pdf, other

    econ.GN cs.DS econ.EM

    Optimal transport on large networks, a practitioner's guide

    Authors: Arthur Charpentier, Alfred Galichon, Lucas Vernet

    Abstract: This article presents a set of tools for the modeling of a spatial allocation problem in a large geographic market and gives examples of applications. In our settings, the market is described by a network that maps the cost of travel between each pair of adjacent locations. Two types of agents are located at the nodes of this network. The buyers choose the most competitive sellers depending on the… ▽ More

    Submitted 22 August, 2019; v1 submitted 4 July, 2019; originally announced July 2019.

  21. arXiv:1905.10267  [pdf, other

    cs.SI physics.soc-ph stat.ME

    Extended Scale-Free Networks

    Authors: Arthur Charpentier, Emmanuel Flachaire

    Abstract: Recently, Broido & Clauset (2019) mentioned that (strict) Scale-Free networks were rare, in real life. This might be related to the statement of Stumpf, Wiuf & May (2005), that sub-networks of scale-free networks are not scale-free. In the later, those sub-networks are asymptotically scale-free, but one should not forget about second-order deviation (possibly also third order actually). In this ar… ▽ More

    Submitted 28 May, 2019; v1 submitted 24 May, 2019; originally announced May 2019.

  22. arXiv:1010.2621  [pdf, ps, other

    cs.CR

    An Asymmetric Fingerprinting Scheme based on Tardos Codes

    Authors: Ana Charpentier, Caroline Fontaine, Teddy Furon, Ingemar Cox

    Abstract: Tardos codes are currently the state-of-the-art in the design of practical collusion-resistant fingerprinting codes. Tardos codes rely on a secret vector drawn from a publicly known probability distribution in order to generate each Buyer's fingerprint. For security purposes, this secret vector must not be revealed to the Buyers. To prevent an untrustworthy Provider forging a copy of a Work with a… ▽ More

    Submitted 13 October, 2010; originally announced October 2010.

    Comments: 6 pages, 2 figures