Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Pöppel, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04303  [pdf, other

    cs.CV cs.AI cs.LG

    Vision-LSTM: xLSTM as Generic Vision Backbone

    Authors: Benedikt Alkin, Maximilian Beck, Korbinian Pöppel, Sepp Hochreiter, Johannes Brandstetter

    Abstract: Transformers are widely used as generic backbones in computer vision, despite initially introduced for natural language processing. Recently, the Long Short-Term Memory (LSTM) has been extended to a scalable and performant architecture - the xLSTM - which overcomes long-standing LSTM limitations via exponential gating and parallelizable matrix memory structure. In this report, we introduce Vision-… ▽ More

    Submitted 2 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2405.04517  [pdf, other

    cs.LG cs.AI stat.ML

    xLSTM: Extended Long Short-Term Memory

    Authors: Maximilian Beck, Korbinian Pöppel, Markus Spanring, Andreas Auer, Oleksandra Prudnikova, Michael Kopp, Günter Klambauer, Johannes Brandstetter, Sepp Hochreiter

    Abstract: In the 1990s, the constant error carousel and gating were introduced as the central ideas of the Long Short-Term Memory (LSTM). Since then, LSTMs have stood the test of time and contributed to numerous deep learning success stories, in particular they constituted the first Large Language Models (LLMs). However, the advent of the Transformer technology with parallelizable self-attention at its core… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.