Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Hatefi, S M V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.12568  [pdf, other

    cs.AI cs.CV cs.LG

    Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers

    Authors: Sayed Mohammad Vakilzadeh Hatefi, Maximilian Dreyer, Reduan Achtibat, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: To solve ever more complex problems, Deep Neural Networks are scaled to billions of parameters, leading to huge computational costs. An effective approach to reduce computational requirements and increase efficiency is to prune unnecessary components of these often over-parameterized networks. Previous work has shown that attribution methods from the field of eXplainable AI serve as effective mean… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Accepted as a workshop paper at ECCV 2024 31 pages (14 pages manuscript, 4 pages references, 13 pages appendix)

  2. arXiv:2402.05602  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    AttnLRP: Attention-Aware Layer-Wise Relevance Propagation for Transformers

    Authors: Reduan Achtibat, Sayed Mohammad Vakilzadeh Hatefi, Maximilian Dreyer, Aakriti Jain, Thomas Wiegand, Sebastian Lapuschkin, Wojciech Samek

    Abstract: Large Language Models are prone to biased predictions and hallucinations, underlining the paramount importance of understanding their model-internal reasoning process. However, achieving faithful attributions for the entirety of a black-box transformer model and maintaining computational efficiency is an unsolved challenge. By extending the Layer-wise Relevance Propagation attribution method to ha… ▽ More

    Submitted 10 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.