Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Köcher, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16166  [pdf, other

    cs.FL

    The Power of Hard Attention Transformers on Data Sequences: A Formal Language Theoretic Perspective

    Authors: Pascal Bergsträßer, Chris Köcher, Anthony Widjaja Lin, Georg Zetzsche

    Abstract: Formal language theory has recently been successfully employed to unravel the power of transformer encoders. This setting is primarily applicable in Natural Languange Processing (NLP), as a token embedding function (where a bounded number of tokens is admitted) is first applied before feeding the input to the transformer. On certain kinds of data (e.g. time series), we want our transformers to be… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.