Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Kannan, A R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11215  [pdf, other

    cs.LG cs.AI cs.CE cs.CL math.NA

    Mechanistic interpretability of large language models with applications to the financial services industry

    Authors: Ashkan Golgoon, Khashayar Filom, Arjun Ravi Kannan

    Abstract: Large Language Models such as GPTs (Generative Pre-trained Transformers) exhibit remarkable capabilities across a broad spectrum of applications. Nevertheless, due to their intrinsic complexity, these models present substantial challenges in interpreting their internal decision-making processes. This lack of transparency poses critical challenges when it comes to their adaptation by financial inst… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    MSC Class: 68T01 ACM Class: I.2.7

  2. arXiv:2406.02778  [pdf, other

    cs.LG

    MS-IMAP -- A Multi-Scale Graph Embedding Approach for Interpretable Manifold Learning

    Authors: Shay Deutsch, Lionel Yelibi, Alex Tong Lin, Arjun Ravi Kannan

    Abstract: Deriving meaningful representations from complex, high-dimensional data in unsupervised settings is crucial across diverse machine learning applications. This paper introduces a framework for multi-scale graph network embedding based on spectral graph wavelets that employs a contrastive learning approach. A significant feature of the proposed embedding is its capacity to establish a correspondence… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  3. arXiv:2303.10216  [pdf, other

    cs.LG math.PR

    Approximation of group explainers with coalition structure using Monte Carlo sampling on the product space of coalitions and features

    Authors: Konstandinos Kotsiopoulos, Alexey Miroshnikov, Khashayar Filom, Arjun Ravi Kannan

    Abstract: In recent years, many Machine Learning (ML) explanation techniques have been designed using ideas from cooperative game theory. These game-theoretic explainers suffer from high complexity, hindering their exact computation in practical settings. In our work, we focus on a wide class of linear game values, as well as coalitional values, for the marginal game based on a given ML model and predictor… ▽ More

    Submitted 18 April, 2024; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: 31 pages, 6 figures

  4. On marginal feature attributions of tree-based models

    Authors: Khashayar Filom, Alexey Miroshnikov, Konstandinos Kotsiopoulos, Arjun Ravi Kannan

    Abstract: Due to their power and ease of use, tree-based machine learning models, such as random forests and gradient-boosted tree ensembles, have become very popular. To interpret them, local feature attributions based on marginal expectations, e.g. marginal (interventional) Shapley, Owen or Banzhaf values, may be employed. Such methods are true to the model and implementation invariant, i.e. dependent onl… ▽ More

    Submitted 5 May, 2024; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: Minor corrections. 30 pages+appendix (64 pages in total), 10 figures. To appear in Foundations of Data Science

    MSC Class: Primary: 68T01; 91A12; 91A80; 05A19; Secondary: 91A68; 91A06; 05C05

  5. arXiv:2111.11259  [pdf, other

    cs.LG math.PR

    Model-agnostic bias mitigation methods with regressor distribution control for Wasserstein-based fairness metrics

    Authors: Alexey Miroshnikov, Konstandinos Kotsiopoulos, Ryan Franks, Arjun Ravi Kannan

    Abstract: This article is a companion paper to our earlier work Miroshnikov et al. (2021) on fairness interpretability, which introduces bias explanations. In the current work, we propose a bias mitigation methodology based upon the construction of post-processed models with fairer regressor distributions for Wasserstein-based fairness metrics. By identifying the list of predictors contributing the most to… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: 29 pages, 32 figures

    MSC Class: 49Q22; 91A12; 68T01

  6. arXiv:2102.10878  [pdf, other

    cs.GT math.PR

    Stability theory of game-theoretic group feature explanations for machine learning models

    Authors: Alexey Miroshnikov, Konstandinos Kotsiopoulos, Khashayar Filom, Arjun Ravi Kannan

    Abstract: In this article, we study feature attributions of Machine Learning (ML) models originating from linear game values and coalitional values defined as operators on appropriate functional spaces. The main focus is on random games based on the conditional and marginal expectations. The first part of our work formulates a stability theory for these explanation operators by establishing certain bounds f… ▽ More

    Submitted 10 August, 2024; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: 82 pages, 43 figures. Typos fixed. Some technical results have been improved

    MSC Class: 91A06; 91A12; 91A80; 46N30; 46N99; 68T01

  7. Wasserstein-based fairness interpretability framework for machine learning models

    Authors: Alexey Miroshnikov, Konstandinos Kotsiopoulos, Ryan Franks, Arjun Ravi Kannan

    Abstract: The objective of this article is to introduce a fairness interpretability framework for measuring and explaining the bias in classification and regression models at the level of a distribution. In our work, we measure the model bias across sub-population distributions in the model output using the Wasserstein metric. To properly quantify the contributions of predictors, we take into account the fa… ▽ More

    Submitted 8 March, 2022; v1 submitted 5 November, 2020; originally announced November 2020.

    Comments: 39 pages. (submitted for publication)

    MSC Class: 49Q22; 91A12; 68T01; 90C08

    Journal ref: Machine Learning Journal (2022), Springer