Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Rizvi-Martel, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05045  [pdf, other

    cs.LG

    A Tensor Decomposition Perspective on Second-order RNNs

    Authors: Maude Lizaire, Michael Rizvi-Martel, Marawan Gamal Abdel Hameed, Guillaume Rabusseau

    Abstract: Second-order Recurrent Neural Networks (2RNNs) extend RNNs by leveraging second-order interactions for sequence modelling. These models are provably more expressive than their first-order counterparts and have connections to well-studied models from formal language theory. However, their large parameter tensor makes computations intractable. To circumvent this issue, one approach known as MIRNN co… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024. Camera ready version