Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Pollano, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.13102  [pdf, other

    cs.CL cs.LG math.AT

    Detecting out-of-distribution text using topological features of transformer-based language models

    Authors: Andres Pollano, Anupam Chaudhuri, Anj Simmons

    Abstract: To safeguard machine learning systems that operate on textual data against out-of-distribution (OOD) inputs that could cause unpredictable behaviour, we explore the use of topological features of self-attention maps from transformer-based language models to detect when input text is out of distribution. Self-attention forms the core of transformer-based language models, dynamically assigning vecto… ▽ More

    Submitted 18 July, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: 8 pages, 6 figures, 3 tables, to be published in proceedings of the IJCAI-2024 AISafety Workshop