Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Filom, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11215  [pdf, other

    cs.LG cs.AI cs.CE cs.CL math.NA

    Mechanistic interpretability of large language models with applications to the financial services industry

    Authors: Ashkan Golgoon, Khashayar Filom, Arjun Ravi Kannan

    Abstract: Large Language Models such as GPTs (Generative Pre-trained Transformers) exhibit remarkable capabilities across a broad spectrum of applications. Nevertheless, due to their intrinsic complexity, these models present substantial challenges in interpreting their internal decision-making processes. This lack of transparency poses critical challenges when it comes to their adaptation by financial inst… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    MSC Class: 68T01 ACM Class: I.2.7

  2. arXiv:2303.10216  [pdf, other

    cs.LG math.PR

    Approximation of group explainers with coalition structure using Monte Carlo sampling on the product space of coalitions and features

    Authors: Konstandinos Kotsiopoulos, Alexey Miroshnikov, Khashayar Filom, Arjun Ravi Kannan

    Abstract: In recent years, many Machine Learning (ML) explanation techniques have been designed using ideas from cooperative game theory. These game-theoretic explainers suffer from high complexity, hindering their exact computation in practical settings. In our work, we focus on a wide class of linear game values, as well as coalitional values, for the marginal game based on a given ML model and predictor… ▽ More

    Submitted 18 April, 2024; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: 31 pages, 6 figures

  3. On marginal feature attributions of tree-based models

    Authors: Khashayar Filom, Alexey Miroshnikov, Konstandinos Kotsiopoulos, Arjun Ravi Kannan

    Abstract: Due to their power and ease of use, tree-based machine learning models, such as random forests and gradient-boosted tree ensembles, have become very popular. To interpret them, local feature attributions based on marginal expectations, e.g. marginal (interventional) Shapley, Owen or Banzhaf values, may be employed. Such methods are true to the model and implementation invariant, i.e. dependent onl… ▽ More

    Submitted 5 May, 2024; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: Minor corrections. 30 pages+appendix (64 pages in total), 10 figures. To appear in Foundations of Data Science

    MSC Class: Primary: 68T01; 91A12; 91A80; 05A19; Secondary: 91A68; 91A06; 05C05

  4. arXiv:2102.10878  [pdf, other

    cs.GT math.PR

    Stability theory of game-theoretic group feature explanations for machine learning models

    Authors: Alexey Miroshnikov, Konstandinos Kotsiopoulos, Khashayar Filom, Arjun Ravi Kannan

    Abstract: In this article, we study feature attributions of Machine Learning (ML) models originating from linear game values and coalitional values defined as operators on appropriate functional spaces. The main focus is on random games based on the conditional and marginal expectations. The first part of our work formulates a stability theory for these explanation operators by establishing certain bounds f… ▽ More

    Submitted 10 August, 2024; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: 82 pages, 43 figures. Typos fixed. Some technical results have been improved

    MSC Class: 91A06; 91A12; 91A80; 46N30; 46N99; 68T01

  5. arXiv:2005.08859  [pdf, other

    cs.LG math.AP stat.ML

    PDE constraints on smooth hierarchical functions computed by neural networks

    Authors: Khashayar Filom, Konrad Paul Kording, Roozbeh Farhoodi

    Abstract: Neural networks are versatile tools for computation, having the ability to approximate a broad range of functions. An important problem in the theory of deep neural networks is expressivity; that is, we want to understand the functions that are computable by a given network. We study real infinitely differentiable (smooth) hierarchical functions implemented by feedforward neural networks via compo… ▽ More

    Submitted 13 August, 2021; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: Minor changes, typos corrected. 52 pages, 17 figures

  6. arXiv:1904.02309  [pdf, other

    cs.LG math.CO q-bio.NC stat.ML

    On functions computed on trees

    Authors: Roozbeh Farhoodi, Khashayar Filom, Ilenna Simone Jones, Konrad Paul Kording

    Abstract: Any function can be constructed using a hierarchy of simpler functions through compositions. Such a hierarchy can be characterized by a binary rooted tree. Each node of this tree is associated with a function which takes as inputs two numbers from its children and produces one output. Since thinking about functions in terms of computation graphs is getting popular we may want to know which functio… ▽ More

    Submitted 22 October, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: 52 pages, 10 figures. The final version. To appear in Neural Computation. May vary slightly from published version

    Journal ref: Neural Computation 31 (2019), no. 11, 2075--2137