Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Quétu, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08933  [pdf, other

    cs.LG

    LaCoOT: Layer Collapse through Optimal Transport

    Authors: Victor Quétu, Nour Hezbri, Enzo Tartaglione

    Abstract: Although deep neural networks are well-known for their remarkable performance in tackling complex tasks, their hunger for computational resources remains a significant hurdle, posing energy-consumption issues and restricting their deployment on resource-constrained devices, which stalls their widespread adoption. In this paper, we present an optimal transport method to reduce the depth of over-par… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2404.18949  [pdf, other

    cs.LG

    The Simpler The Better: An Entropy-Based Importance Metric To Reduce Neural Networks' Depth

    Authors: Victor Quétu, Zhu Liao, Enzo Tartaglione

    Abstract: While deep neural networks are highly effective at solving complex tasks, large pre-trained models are commonly employed even to solve consistently simpler downstream tasks, which do not necessarily require a large model's complexity. Motivated by the awareness of the ever-growing AI environmental impact, we propose an efficiency strategy that leverages prior knowledge transferred by large models.… ▽ More

    Submitted 5 June, 2024; v1 submitted 27 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: text overlap with arXiv:2404.16890

  3. arXiv:2404.16890  [pdf, other

    cs.LG cs.AI

    NEPENTHE: Entropy-Based Pruning as a Neural Network Depth's Reducer

    Authors: Zhu Liao, Victor Quétu, Van-Tam Nguyen, Enzo Tartaglione

    Abstract: While deep neural networks are highly effective at solving complex tasks, their computational demands can hinder their usefulness in real-time applications and with limited-resources systems. Besides, for many tasks it is known that these models are over-parametrized: neoteric works have broadly focused on reducing the width of these networks, rather than their depth. In this paper, we aim to redu… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  4. arXiv:2308.16596  [pdf, other

    cs.AI

    The Quest of Finding the Antidote to Sparse Double Descent

    Authors: Victor Quétu, Marta Milovanović

    Abstract: In energy-efficient schemes, finding the optimal size of deep learning models is very important and has a broad impact. Meanwhile, recent studies have reported an unexpected phenomenon, the sparse double descent: as the model's sparsity increases, the performance first worsens, then improves, and finally deteriorates. Such a non-monotonic behavior raises serious questions about the optimal model's… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  5. Can Unstructured Pruning Reduce the Depth in Deep Neural Networks?

    Authors: Zhu Liao, Victor Quétu, Van-Tam Nguyen, Enzo Tartaglione

    Abstract: Pruning is a widely used technique for reducing the size of deep neural networks while maintaining their performance. However, such a technique, despite being able to massively compress deep models, is hardly able to remove entire layers from a model (even when structured): is this an addressable task? In this study, we introduce EGP, an innovative Entropy Guided Pruning algorithm aimed at reducin… ▽ More

    Submitted 18 August, 2023; v1 submitted 12 August, 2023; originally announced August 2023.

  6. Sparse Double Descent in Vision Transformers: real or phantom threat?

    Authors: Victor Quétu, Marta Milovanovic, Enzo Tartaglione

    Abstract: Vision transformers (ViT) have been of broad interest in recent theoretical and empirical works. They are state-of-the-art thanks to their attention-based approach, which boosts the identification of key features and patterns within images thanks to the capability of avoiding inductive bias, resulting in highly accurate image analysis. Meanwhile, neoteric studies have reported a ``sparse double de… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  7. arXiv:2303.01213  [pdf, other

    cs.LG

    DSD$^2$: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?

    Authors: Victor Quétu, Enzo Tartaglione

    Abstract: Neoteric works have shown that modern deep learning models can exhibit a sparse double descent phenomenon. Indeed, as the sparsity of the model increases, the test performance first worsens since the model is overfitting the training data; then, the overfitting reduces, leading to an improvement in performance, and finally, the model begins to forget critical information, resulting in underfitting… ▽ More

    Submitted 8 February, 2024; v1 submitted 2 March, 2023; originally announced March 2023.

  8. Can we avoid Double Descent in Deep Neural Networks?

    Authors: Victor Quétu, Enzo Tartaglione

    Abstract: Finding the optimal size of deep learning models is very actual and of broad impact, especially in energy-saving schemes. Very recently, an unexpected phenomenon, the ``double descent'', has caught the attention of the deep learning community. As the model's size grows, the performance gets first worse, and then goes back to improving. It raises serious questions about the optimal model's size to… ▽ More

    Submitted 4 July, 2023; v1 submitted 26 February, 2023; originally announced February 2023.