Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Moalla, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.09835  [pdf, other

    cs.CL

    Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis

    Authors: Xiuying Wei, Skander Moalla, Razvan Pascanu, Caglar Gulcehre

    Abstract: State-of-the-art LLMs often rely on scale with high computational costs, which has sparked a research agenda to reduce parameter counts and costs without significantly impacting performance. Our study focuses on Transformer-based LLMs, specifically applying low-rank parametrization to the computationally intensive feedforward networks (FFNs), which are less studied than attention blocks. In contra… ▽ More

    Submitted 24 July, 2024; v1 submitted 13 July, 2024; originally announced July 2024.

    Comments: Accepted by ICML 2024 Next Generation of Sequence Modeling Architectures Workshop. Short version of arXiv:2406.16450

  2. arXiv:2406.16450  [pdf, other

    cs.CL

    Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers

    Authors: Xiuying Wei, Skander Moalla, Razvan Pascanu, Caglar Gulcehre

    Abstract: State-of-the-art results in large language models (LLMs) often rely on scale, which becomes computationally expensive. This has sparked a research agenda to reduce these models' parameter count and computational costs without significantly impacting their performance. Our study focuses on transformer-based LLMs, specifically targeting the computationally intensive feedforward networks (FFN), which… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2405.00662  [pdf, other

    cs.LG

    No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO

    Authors: Skander Moalla, Andrea Miele, Razvan Pascanu, Caglar Gulcehre

    Abstract: Reinforcement learning (RL) is inherently rife with non-stationarity since the states and rewards the agent observes during training depend on its changing policy. Therefore, networks in deep RL must be capable of adapting to new observations and fitting new targets. However, previous works have observed that networks in off-policy deep value-based methods exhibit a decrease in representation rank… ▽ More

    Submitted 25 July, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: ICML ARLET workshop version. Code and run histories are available at https://github.com/CLAIRE-Labo/no-representation-no-trust

  4. arXiv:2212.07489  [pdf, other

    cs.LG cs.MA

    SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

    Authors: Benjamin Ellis, Jonathan Cook, Skander Moalla, Mikayel Samvelyan, Mingfei Sun, Anuj Mahajan, Jakob N. Foerster, Shimon Whiteson

    Abstract: The availability of challenging benchmarks has played a key role in the recent progress of machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi-Agent Challenge (SMAC) has become a popular testbed for centralised training with decentralised execution. However, after years of sustained improvement on SMAC, algorithms now achieve near-perfect performance. In this w… ▽ More

    Submitted 17 October, 2023; v1 submitted 14 December, 2022; originally announced December 2022.