Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Hameed, M G A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.07802  [pdf, other

    cs.LG cs.AI cs.CL

    ROSA: Random Subspace Adaptation for Efficient Fine-Tuning

    Authors: Marawan Gamal Abdel Hameed, Aristides Milios, Siva Reddy, Guillaume Rabusseau

    Abstract: Model training requires significantly more memory, compared with inference. Parameter efficient fine-tuning (PEFT) methods provide a means of adapting large models to downstream tasks using less memory. However, existing methods such as adapters, prompt tuning or low-rank adaptation (LoRA) either introduce latency overhead at inference time or achieve subpar downstream performance compared with fu… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2406.05045  [pdf, other

    cs.LG

    A Tensor Decomposition Perspective on Second-order RNNs

    Authors: Maude Lizaire, Michael Rizvi-Martel, Marawan Gamal Abdel Hameed, Guillaume Rabusseau

    Abstract: Second-order Recurrent Neural Networks (2RNNs) extend RNNs by leveraging second-order interactions for sequence modelling. These models are provably more expressive than their first-order counterparts and have connections to well-studied models from formal language theory. However, their large parameter tensor makes computations intractable. To circumvent this issue, one approach known as MIRNN co… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024. Camera ready version

  3. arXiv:2210.06299  [pdf, other

    cs.CV

    SeKron: A Decomposition Method Supporting Many Factorization Structures

    Authors: Marawan Gamal Abdel Hameed, Ali Mosleh, Marzieh S. Tahaei, Vahid Partovi Nia

    Abstract: While convolutional neural networks (CNNs) have become the de facto standard for most image processing and computer vision applications, their deployment on edge devices remains challenging. Tensor decomposition methods provide a means of compressing CNNs to meet the wide range of device constraints by imposing certain factorization structures on their convolution tensors. However, being limited t… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  4. arXiv:2109.14710  [pdf, other

    cs.CV cs.LG

    Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition

    Authors: Marawan Gamal Abdel Hameed, Marzieh S. Tahaei, Ali Mosleh, Vahid Partovi Nia

    Abstract: Modern Convolutional Neural Network (CNN) architectures, despite their superiority in solving various problems, are generally too large to be deployed on resource constrained edge devices. In this paper, we reduce memory usage and floating-point operations required by convolutional layers in CNNs. We compress these layers by generalizing the Kronecker Product Decomposition to apply to multidimensi… ▽ More

    Submitted 14 January, 2022; v1 submitted 29 September, 2021; originally announced September 2021.