Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Wiegand, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.12568  [pdf, other

    cs.AI cs.CV cs.LG

    Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers

    Authors: Sayed Mohammad Vakilzadeh Hatefi, Maximilian Dreyer, Reduan Achtibat, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: To solve ever more complex problems, Deep Neural Networks are scaled to billions of parameters, leading to huge computational costs. An effective approach to reduce computational requirements and increase efficiency is to prune unnecessary components of these often over-parameterized networks. Previous work has shown that attribution methods from the field of eXplainable AI serve as effective mean… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Accepted as a workshop paper at ECCV 2024 31 pages (14 pages manuscript, 4 pages references, 13 pages appendix)

  2. arXiv:2402.12118  [pdf, other

    cs.LG cs.AI

    DualView: Data Attribution from the Dual Perspective

    Authors: Galip Ümit Yolcu, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Local data attribution (or influence estimation) techniques aim at estimating the impact that individual data points seen during training have on particular predictions of an already trained Machine Learning model during test time. Previous methods either do not perform well consistently across different evaluation criteria from literature, are characterized by a high computational demand, or suff… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  3. arXiv:2402.05602  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    AttnLRP: Attention-Aware Layer-Wise Relevance Propagation for Transformers

    Authors: Reduan Achtibat, Sayed Mohammad Vakilzadeh Hatefi, Maximilian Dreyer, Aakriti Jain, Thomas Wiegand, Sebastian Lapuschkin, Wojciech Samek

    Abstract: Large Language Models are prone to biased predictions and hallucinations, underlining the paramount importance of understanding their model-internal reasoning process. However, achieving faithful attributions for the entirety of a black-box transformer model and maintaining computational efficiency is an unsolved challenge. By extending the Layer-wise Relevance Propagation attribution method to ha… ▽ More

    Submitted 10 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  4. arXiv:2308.12053  [pdf, other

    cs.LG cs.AI cs.NE

    Layer-wise Feedback Propagation

    Authors: Leander Weber, Jim Berend, Alexander Binder, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: In this paper, we present Layer-wise Feedback Propagation (LFP), a novel training approach for neural-network-like predictors that utilizes explainability, specifically Layer-wise Relevance Propagation(LRP), to assign rewards to individual connections based on their respective contributions to solving a given task. This differs from traditional gradient descent, which updates parameters towards an… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    MSC Class: 68T05

  5. arXiv:2211.11426  [pdf, other

    cs.CV cs.AI cs.LG

    Revealing Hidden Context Bias in Segmentation and Object Detection through Concept-specific Explanations

    Authors: Maximilian Dreyer, Reduan Achtibat, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Applying traditional post-hoc attribution methods to segmentation or object detection predictors offers only limited insights, as the obtained feature attribution maps at input level typically resemble the models' predicted segmentation mask or bounding box. In this work, we address the need for more informative explanations for these predictors by proposing the post-hoc eXplainable Artificial Int… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  6. From Attribution Maps to Human-Understandable Explanations through Concept Relevance Propagation

    Authors: Reduan Achtibat, Maximilian Dreyer, Ilona Eisenbraun, Sebastian Bosse, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: The field of eXplainable Artificial Intelligence (XAI) aims to bring transparency to today's powerful but opaque deep learning models. While local XAI methods explain individual predictions in form of attribution maps, thereby identifying where important features occur (but not providing information about what they represent), global explanation techniques visualize what concepts a model has gener… ▽ More

    Submitted 6 January, 2024; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: 87 pages (13 pages manuscript, 8 pages references, 66 pages appendix) 63 figures (6 in manuscript, 57 in appendix) 3 tables (in appendix)

    Journal ref: Nature Machine Intelligence (year 2023, volume 5, pages 1006-1019)

  7. arXiv:2202.03482  [pdf, other

    cs.CV cs.AI cs.LG

    Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence

    Authors: Frederik Pahde, Maximilian Dreyer, Leander Weber, Moritz Weckbecker, Christopher J. Anders, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: With a growing interest in understanding neural network prediction strategies, Concept Activation Vectors (CAVs) have emerged as a popular tool for modeling human-understandable concepts in the latent space. Commonly, CAVs are computed by leveraging linear classifiers optimizing the separability of latent representations of samples with and without a given concept. However, in this paper we show t… ▽ More

    Submitted 5 February, 2024; v1 submitted 7 February, 2022; originally announced February 2022.

  8. arXiv:2106.05946  [pdf, other

    cs.CV

    Curiously Effective Features for Image Quality Prediction

    Authors: Sören Becker, Thomas Wiegand, Sebastian Bosse

    Abstract: The performance of visual quality prediction models is commonly assumed to be closely tied to their ability to capture perceptually relevant image aspects. Models are thus either based on sophisticated feature extractors carefully designed from extensive domain knowledge or optimized through feature learning. In contrast to this, we find feature extractors constructed from random noise to be suffi… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: To be published at ICIP 2021

  9. arXiv:2012.11331  [pdf, other

    cs.AR cs.LG

    FantastIC4: A Hardware-Software Co-Design Approach for Efficiently Running 4bit-Compact Multilayer Perceptrons

    Authors: Simon Wiedemann, Suhas Shivapakash, Pablo Wiedemann, Daniel Becking, Wojciech Samek, Friedel Gerfers, Thomas Wiegand

    Abstract: With the growing demand for deploying deep learning models to the "edge", it is paramount to develop techniques that allow to execute state-of-the-art models within very tight and limited resource constraints. In this work we propose a software-hardware optimization paradigm for obtaining a highly efficient execution engine of deep neural networks (DNNs) that are based on fully-connected layers. O… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

  10. arXiv:2004.11841  [pdf, other

    q-bio.QM cs.LG q-bio.PE stat.AP stat.ML

    Risk Estimation of SARS-CoV-2 Transmission from Bluetooth Low Energy Measurements

    Authors: Felix Sattler, Jackie Ma, Patrick Wagner, David Neumann, Markus Wenzel, Ralf Schäfer, Wojciech Samek, Klaus-Robert Müller, Thomas Wiegand

    Abstract: Digital contact tracing approaches based on Bluetooth low energy (BLE) have the potential to efficiently contain and delay outbreaks of infectious diseases such as the ongoing SARS-CoV-2 pandemic. In this work we propose a novel machine learning based approach to reliably detect subjects that have spent enough time in close proximity to be at risk of being infected. Our study is an important proof… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

  11. arXiv:2003.03320  [pdf, other

    cs.LG cs.DC cs.MM cs.NI stat.ML

    Trends and Advancements in Deep Neural Network Communication

    Authors: Felix Sattler, Thomas Wiegand, Wojciech Samek

    Abstract: Due to their great performance and scalability properties neural networks have become ubiquitous building blocks of many applications. With the rise of mobile and IoT, these models now are also being increasingly applied in distributed settings, where the owners of the data are separated by limited communication channels and privacy constraints. To address the challenges of these distributed envir… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

  12. DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks

    Authors: Simon Wiedemann, Heiner Kirchoffer, Stefan Matlage, Paul Haase, Arturo Marban, Talmaj Marinc, David Neumann, Tung Nguyen, Ahmed Osman, Detlev Marpe, Heiko Schwarz, Thomas Wiegand, Wojciech Samek

    Abstract: The field of video compression has developed some of the most sophisticated and efficient compression algorithms known in the literature, enabling very high compressibility for little loss of information. Whilst some of these techniques are domain specific, many of their underlying principles are universal in that they can be adapted and applied for compressing different types of data. In this wor… ▽ More

    Submitted 27 July, 2019; originally announced July 2019.

  13. arXiv:1905.08318  [pdf, other

    cs.LG cs.AI cs.IT

    DeepCABAC: Context-adaptive binary arithmetic coding for deep neural network compression

    Authors: Simon Wiedemann, Heiner Kirchhoffer, Stefan Matlage, Paul Haase, Arturo Marban, Talmaj Marinc, David Neumann, Ahmed Osman, Detlev Marpe, Heiko Schwarz, Thomas Wiegand, Wojciech Samek

    Abstract: We present DeepCABAC, a novel context-adaptive binary arithmetic coder for compressing deep neural networks. It quantizes each weight parameter by minimizing a weighted rate-distortion function, which implicitly takes the impact of quantization on to the accuracy of the network into account. Subsequently, it compresses the quantized values into a bitstream representation with minimal redundancies.… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

    Comments: ICML 2019, Joint Workshop on On-Device Machine Learning and Compact Deep Neural Network Representations (ODML-CDNNR)

  14. arXiv:1809.04797  [pdf

    cs.AI cs.CY

    Focus Group on Artificial Intelligence for Health

    Authors: Marcel Salathé, Thomas Wiegand, Markus Wenzel

    Abstract: Artificial Intelligence (AI) - the phenomenon of machines being able to solve problems that require human intelligence - has in the past decade seen an enormous rise of interest due to significant advances in effectiveness and use. The health sector, one of the most important sectors for societies and economies worldwide, is particularly interesting for AI applications, given the ongoing digitalis… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

    Comments: Whitepaper on ITU Focus Group AI4H for 1st workshop at WHO

  15. arXiv:1708.08299  [pdf, other

    cs.NI cs.AI cs.CY

    The Convergence of Machine Learning and Communications

    Authors: Wojciech Samek, Slawomir Stanczak, Thomas Wiegand

    Abstract: The areas of machine learning and communication technology are converging. Today's communications systems generate a huge amount of traffic data, which can help to significantly enhance the design and management of networks and communication components when combined with advanced machine learning methods. Furthermore, recently developed end-to-end training procedures offer new ways to jointly opti… ▽ More

    Submitted 28 August, 2017; originally announced August 2017.

    Comments: 8 pages, 4 figures

  16. arXiv:1708.08296  [pdf, other

    cs.AI cs.CY cs.NE stat.ML

    Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

    Authors: Wojciech Samek, Thomas Wiegand, Klaus-Robert Müller

    Abstract: With the availability of large databases and recent improvements in deep learning methodology, the performance of AI systems is reaching or even exceeding the human level on an increasing number of complex tasks. Impressive examples of this development can be found in domains such as image classification, sentiment analysis, speech understanding or strategic game playing. However, because of their… ▽ More

    Submitted 28 August, 2017; originally announced August 2017.

    Comments: 8 pages, 2 figures

  17. Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment

    Authors: Sebastian Bosse, Dominique Maniry, Klaus-Robert Müller, Thomas Wiegand, Wojciech Samek

    Abstract: We present a deep neural network-based approach to image quality assessment (IQA). The network is trained end-to-end and comprises ten convolutional layers and five pooling layers for feature extraction, and two fully connected layers for regression, which makes it significantly deeper than related IQA models. Unique features of the proposed architecture are that: 1) with slight adaptations it can… ▽ More

    Submitted 7 December, 2017; v1 submitted 6 December, 2016; originally announced December 2016.

    Journal ref: IEEE Transactions on Image Processing, 27(1):206-219, 2018

  18. A Haar Wavelet-Based Perceptual Similarity Index for Image Quality Assessment

    Authors: Rafael Reisenhofer, Sebastian Bosse, Gitta Kutyniok, Thomas Wiegand

    Abstract: In most practical situations, the compression or transmission of images and videos creates distortions that will eventually be perceived by a human observer. Vice versa, image and video restoration techniques, such as inpainting or denoising, aim to enhance the quality of experience of human viewers. Correctly assessing the similarity between an image and an undistorted reference image as subjecti… ▽ More

    Submitted 5 November, 2017; v1 submitted 20 July, 2016; originally announced July 2016.

    Journal ref: Signal Processing: Image Communication 61 (2018) 33-43

  19. arXiv:1308.1126  [pdf, other

    cs.CV

    Image interpolation using Shearlet based iterative refinement

    Authors: H. Lakshman, W. -Q Lim, H. Schwarz, D. Marpe, G. Kutyniok, T. Wiegand

    Abstract: This paper proposes an image interpolation algorithm exploiting sparse representation for natural images. It involves three main steps: (a) obtaining an initial estimate of the high resolution image using linear methods like FIR filtering, (b) promoting sparsity in a selected dictionary through iterative thresholding, and (c) extracting high frequency information from the approximation to refine t… ▽ More

    Submitted 5 August, 2013; originally announced August 2013.

    MSC Class: 94A08 65T60