Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Horovicz, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10114  [pdf, other

    cs.CL

    TokenSHAP: Interpreting Large Language Models with Monte Carlo Shapley Value Estimation

    Authors: Roni Goldshmidt, Miriam Horovicz

    Abstract: As large language models (LLMs) become increasingly prevalent in critical applications, the need for interpretable AI has grown. We introduce TokenSHAP, a novel method for interpreting LLMs by attributing importance to individual tokens or substrings within input prompts. This approach adapts Shapley values from cooperative game theory to natural language processing, offering a rigorous framework… ▽ More

    Submitted 22 July, 2024; v1 submitted 14 July, 2024; originally announced July 2024.