Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Lizaire, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05045  [pdf, other

    cs.LG

    A Tensor Decomposition Perspective on Second-order RNNs

    Authors: Maude Lizaire, Michael Rizvi-Martel, Marawan Gamal Abdel Hameed, Guillaume Rabusseau

    Abstract: Second-order Recurrent Neural Networks (2RNNs) extend RNNs by leveraging second-order interactions for sequence modelling. These models are provably more expressive than their first-order counterparts and have connections to well-studied models from formal language theory. However, their large parameter tensor makes computations intractable. To circumvent this issue, one approach known as MIRNN co… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024. Camera ready version

  2. arXiv:2403.09728  [pdf, other

    cs.CL cs.AI cs.CC

    Simulating Weighted Automata over Sequences and Trees with Transformers

    Authors: Michael Rizvi, Maude Lizaire, Clara Lacroce, Guillaume Rabusseau

    Abstract: Transformers are ubiquitous models in the natural language processing (NLP) community and have shown impressive empirical successes in the past few years. However, little is understood about how they reason and the limits of their computational capabilities. These models do not process data sequentially, and yet outperform sequential neural models such as RNNs. Recent work has shown that these mod… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.