Zum Hauptinhalt springen

Showing 1–33 of 33 results for author: Harmeling, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12480  [pdf, other

    cs.CL cs.LG

    The Power of LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions

    Authors: Stefan Sylvius Wagner, Maike Behrendt, Marc Ziegele, Stefan Harmeling

    Abstract: Stance detection holds great potential for enhancing the quality of online political discussions, as it has shown to be useful for summarizing discussions, detecting misinformation, and evaluating opinion distributions. Usually, transformer-based models are used directly for stance detection, which require large amounts of data. However, the broad range of debate questions in online political disc… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2404.08078  [pdf, other

    cs.CL cs.AI cs.LG

    SQBC: Active Learning using LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions

    Authors: Stefan Sylvius Wagner, Maike Behrendt, Marc Ziegele, Stefan Harmeling

    Abstract: Stance detection is an important task for many applications that analyse or support online political discussions. Common approaches include fine-tuning transformer based models. However, these models require a large amount of labelled data, which might not be available. In this work, we present two different ways to leverage LLM-generated synthetic data to train and improve stance detection agents… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  3. arXiv:2404.02761  [pdf, other

    cs.CL cs.AI cs.LG

    AQuA -- Combining Experts' and Non-Experts' Views To Assess Deliberation Quality in Online Discussions Using LLMs

    Authors: Maike Behrendt, Stefan Sylvius Wagner, Marc Ziegele, Lena Wilms, Anke Stoll, Dominique Heinbach, Stefan Harmeling

    Abstract: Measuring the quality of contributions in political online discussions is crucial in deliberation research and computer science. Research has identified various indicators to assess online discussion quality, and with deep learning advancements, automating these measures has become feasible. While some studies focus on analyzing specific quality indicators, a comprehensive quality score incorporat… ▽ More

    Submitted 17 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  4. arXiv:2402.03138  [pdf, other

    cs.LG cs.AI

    Just Cluster It: An Approach for Exploration in High-Dimensions using Clustering and Pre-Trained Representations

    Authors: Stefan Sylvius Wagner, Stefan Harmeling

    Abstract: In this paper we adopt a representation-centric perspective on exploration in reinforcement learning, viewing exploration fundamentally as a density estimation problem. We investigate the effectiveness of clustering representations for exploration in 3-D environments, based on the observation that the importance of pixel changes between transitions is less pronounced in 3-D environments compared t… ▽ More

    Submitted 14 August, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted at the International Conference On Machine Learning (ICML) 2024

  5. arXiv:2312.05044  [pdf, other

    cs.LG cs.AI

    Backward Learning for Goal-Conditioned Policies

    Authors: Marc Höftmann, Jan Robine, Stefan Harmeling

    Abstract: Can we learn policies in reinforcement learning without rewards? Can we learn a policy just by trying to reach a goal state? We answer these questions positively by proposing a multi-step procedure that first learns a world model that goes backward in time, secondly generates goal-reaching backward trajectories, thirdly improves those sequences using shortest path finding algorithms, and finally t… ▽ More

    Submitted 15 April, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: World Models, Goal-conditioned, Reward-free, Workshop on Goal-Conditioned Reinforcement Learning - NeurIPS 2023

  6. arXiv:2309.06948  [pdf, other

    eess.IV cs.CV

    Limited-Angle Tomography Reconstruction via Deep End-To-End Learning on Synthetic Data

    Authors: Thomas Germer, Jan Robine, Sebastian Konietzny, Stefan Harmeling, Tobias Uelwer

    Abstract: Computed tomography (CT) has become an essential part of modern science and medicine. A CT scanner consists of an X-ray source that is spun around an object of interest. On the opposite end of the X-ray source, a detector captures X-rays that are not absorbed by the object. The reconstruction of an image is a linear inverse problem, which is usually solved by filtered back projection. However, whe… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    MSC Class: Primary: 92C55; 68T07; Secondary: 68U10; 94A08 ACM Class: I.4.5

  7. arXiv:2308.15911  [pdf, other

    cs.LG cs.AI cs.RO

    Cyclophobic Reinforcement Learning

    Authors: Stefan Sylvius Wagner, Peter Arndt, Jan Robine, Stefan Harmeling

    Abstract: In environments with sparse rewards, finding a good inductive bias for exploration is crucial to the agent's success. However, there are two competing goals: novelty search and systematic exploration. While existing approaches such as curiosity-driven exploration find novelty, they sometimes do not systematically explore the whole state space, akin to depth-first-search vs breadth-first-search. In… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: Published in Transactions on Machine Learning Research (08/2023)

  8. arXiv:2308.11455  [pdf, other

    cs.LG cs.AI eess.IV stat.ML

    A Survey on Self-Supervised Representation Learning

    Authors: Tobias Uelwer, Jan Robine, Stefan Sylvius Wagner, Marc Höftmann, Eric Upschulte, Sebastian Konietzny, Maike Behrendt, Stefan Harmeling

    Abstract: Learning meaningful representations is at the heart of many tasks in the field of modern machine learning. Recently, a lot of methods were introduced that allow learning of image representations without supervision. These representations can then be used in downstream tasks like classification or object detection. The quality of these representations is close to supervised learning, while no label… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  9. arXiv:2303.07109  [pdf, other

    cs.LG cs.AI stat.ML

    Transformer-based World Models Are Happy With 100k Interactions

    Authors: Jan Robine, Marc Höftmann, Tobias Uelwer, Stefan Harmeling

    Abstract: Deep neural networks have been successful in many reinforcement learning settings. However, compared to human learners they are overly data hungry. To build a sample-efficient world model, we apply a transformer to real-world episodes in an autoregressive manner: not only the compact latent states and the taken actions but also the experienced or predicted rewards are fed into the transformer, so… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: Published as a conference paper at ICLR 2023. Code is available at https://github.com/jrobine/twm

  10. arXiv:2301.05635  [pdf, other

    cs.LG

    Time-Myopic Go-Explore: Learning A State Representation for the Go-Explore Paradigm

    Authors: Marc Höftmann, Jan Robine, Stefan Harmeling

    Abstract: Very large state spaces with a sparse reward signal are difficult to explore. The lack of a sophisticated guidance results in a poor performance for numerous reinforcement learning algorithms. In these cases, the commonly used random exploration is often not helpful. The literature shows that this kind of environments require enormous efforts to systematically explore large chunks of the state spa… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: 9 pages, 7 figures, Deep Reinforcement Learning Workshop NeurIPS 2022, Deep RL Workshop 2022 NeurIPS, OpenReview

  11. arXiv:2210.14573  [pdf, other

    stat.ML cs.AI cs.LG

    Learning Causal Graphs in Manufacturing Domains using Structural Equation Models

    Authors: Maximilian Kertel, Stefan Harmeling, Markus Pauly

    Abstract: Many production processes are characterized by numerous and complex cause-and-effect relationships. Since they are only partially known they pose a challenge to effective process control. In this work we present how Structural Equation Models can be used for deriving cause-and-effect relationships from the combination of prior knowledge and process data in the manufacturing domain. Compared to exi… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: To be published in the Proceedings of IEEE AI4I 2022

  12. arXiv:2210.00252  [pdf, other

    cs.CV

    Blindly Deconvolving Super-noisy Blurry Image Sequences

    Authors: Leonid Kostrykin, Stefan Harmeling

    Abstract: Image blur and image noise are imaging artifacts intrinsically arising in image acquisition. In this paper, we consider multi-frame blind deconvolution (MFBD), where image blur is described by the convolution of an unobservable, undeteriorated image and an unknown filter, and the objective is to recover the undeteriorated image from a sequence of its blurry and noisy observations. We present two n… ▽ More

    Submitted 1 October, 2022; originally announced October 2022.

    Comments: 19 pages, 9 figures

    MSC Class: 68U10 (Primary) 68W99 (Secondary) ACM Class: I.4

  13. arXiv:2205.15617  [pdf, other

    cs.LG eess.IV

    Optimizing Intermediate Representations of Generative Models for Phase Retrieval

    Authors: Tobias Uelwer, Sebastian Konietzny, Stefan Harmeling

    Abstract: Phase retrieval is the problem of reconstructing images from magnitude-only measurements. In many real-world applications the problem is underdetermined. When training data is available, generative models allow optimization in a lower-dimensional latent space, hereby constraining the solution set to those images that can be synthesized by the generative model. However, not all possible solutions a… ▽ More

    Submitted 19 January, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR). First two authors contributed equally

  14. arXiv:2205.15053  [pdf, other

    cs.CV

    Deblurring Photographs of Characters Using Deep Neural Networks

    Authors: Thomas Germer, Tobias Uelwer, Stefan Harmeling

    Abstract: In this paper, we present our approach for the Helsinki Deblur Challenge (HDC2021). The task of this challenge is to deblur images of characters without knowing the point spread function (PSF). The organizers provided a dataset of pairs of sharp and blurred images. Our method consists of three steps: First, we estimate a warping transformation of the images to align the sharp images with the blurr… ▽ More

    Submitted 31 May, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: 15 pages, 13 figures

    MSC Class: Primary: 68U10; Secondary: 78A46 ACM Class: I.2.10

  15. arXiv:2110.13688  [pdf, other

    eess.IV cs.CV cs.LG

    A Closer Look at Reference Learning for Fourier Phase Retrieval

    Authors: Tobias Uelwer, Nick Rucks, Stefan Harmeling

    Abstract: Reconstructing images from their Fourier magnitude measurements is a problem that often arises in different research areas. This process is also referred to as phase retrieval. In this work, we consider a modified version of the phase retrieval problem, which allows for a reference image to be added onto the image before the Fourier magnitudes are measured. We analyze an unrolled Gerchberg-Saxton… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted at the NeurIPS 2021 Workshop on Deep Learning and Inverse Problems

  16. arXiv:2106.10195  [pdf, other

    eess.IV cs.CV

    Non-Iterative Phase Retrieval With Cascaded Neural Networks

    Authors: Tobias Uelwer, Tobias Hoffmann, Stefan Harmeling

    Abstract: Fourier phase retrieval is the problem of reconstructing a signal given only the magnitude of its Fourier transformation. Optimization-based approaches, like the well-established Gerchberg-Saxton or the hybrid input output algorithm, struggle at reconstructing images from magnitudes that are not oversampled. This motivates the application of learned methods, which allow reconstruction from non-ove… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: Accepted at the 30th International Conference on Artificial Neural Networks (ICANN 2021)

  17. arXiv:2106.10075  [pdf, other

    cs.LG cs.AI cs.RO

    Learning to Plan via a Multi-Step Policy Regression Method

    Authors: Stefan Wagner, Michael Janschek, Tobias Uelwer, Stefan Harmeling

    Abstract: We propose a new approach to increase inference performance in environments that require a specific sequence of actions in order to be solved. This is for example the case for maze environments where ideally an optimal path is determined. Instead of learning a policy for a single step, we want to learn a policy that can predict n actions in advance. Our proposed method called policy horizon regres… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: Accepted at the 30th International Conference on Artificial Neural Networks (ICANN 2021)

  18. arXiv:2104.03393  [pdf, other

    cs.CV

    Contour Proposal Networks for Biomedical Instance Segmentation

    Authors: Eric Upschulte, Stefan Harmeling, Katrin Amunts, Timo Dickscheid

    Abstract: We present a conceptually simple framework for object instance segmentation called Contour Proposal Network (CPN), which detects possibly overlapping objects in an image while simultaneously fitting closed object contours using an interpretable, fixed-sized representation based on Fourier Descriptors. The CPN can incorporate state of the art object detection architectures as backbone networks into… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

  19. 2D histology meets 3D topology: Cytoarchitectonic brain mapping with Graph Neural Networks

    Authors: Christian Schiffer, Stefan Harmeling, Katrin Amunts, Timo Dickscheid

    Abstract: Cytoarchitecture describes the spatial organization of neuronal cells in the brain, including their arrangement into layers and columns with respect to cell density, orientation, or presence of certain cell types. It allows to segregate the brain into cortical areas and subcortical nuclei, links structure with connectivity and function, and provides a microstructural reference for human brain atla… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

  20. arXiv:2011.12865  [pdf, other

    eess.IV cs.CV q-bio.NC

    Contrastive Representation Learning for Whole Brain Cytoarchitectonic Mapping in Histological Human Brain Sections

    Authors: Christian Schiffer, Katrin Amunts, Stefan Harmeling, Timo Dickscheid

    Abstract: Cytoarchitectonic maps provide microstructural reference parcellations of the brain, describing its organization in terms of the spatial arrangement of neuronal cell bodies as measured from histological tissue sections. Recent work provided the first automatic segmentations of cytoarchitectonic areas in the visual system using Convolutional Neural Networks. We aim to extend this approach to become… ▽ More

    Submitted 28 January, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: Accepted to ISBI 2021

  21. Convolutional Neural Networks for cytoarchitectonic brain mapping at large scale

    Authors: Christian Schiffer, Hannah Spitzer, Kai Kiwitz, Nina Unger, Konrad Wagstyl, Alan C. Evans, Stefan Harmeling, Katrin Amunts, Timo Dickscheid

    Abstract: Human brain atlases provide spatial reference systems for data characterizing brain organization at different levels, coming from different brains. Cytoarchitecture is a basic principle of the microstructural organization of the brain, as regional differences in the arrangement and composition of neuronal cells are indicators of changes in connectivity and function. Automated scanning procedures a… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: Preprint submitted to NeuroImage

  22. arXiv:2010.05767  [pdf, other

    cs.LG cs.AI stat.ML

    Smaller World Models for Reinforcement Learning

    Authors: Jan Robine, Tobias Uelwer, Stefan Harmeling

    Abstract: Sample efficiency remains a fundamental issue of reinforcement learning. Model-based algorithms try to make better use of data by simulating the environment with a model. We propose a new neural network architecture for world models based on a vector quantized-variational autoencoder (VQ-VAE) to encode observations and a convolutional LSTM to predict the next embedding indices. A model-free PPO ag… ▽ More

    Submitted 2 March, 2021; v1 submitted 12 October, 2020; originally announced October 2020.

  23. arXiv:2006.14970  [pdf, other

    cs.CV

    Fast Multi-Level Foreground Estimation

    Authors: Thomas Germer, Tobias Uelwer, Stefan Conrad, Stefan Harmeling

    Abstract: Alpha matting aims to estimate the translucency of an object in a given image. The resulting alpha matte describes pixel-wise to what amount foreground and background colors contribute to the color of the composite image. While most methods in literature focus on estimating the alpha matte, the process of estimating the foreground colors given the input image and its alpha matte is often neglected… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: Accepted at the 25th International Conference on Pattern Recognition 2020 (ICPR)

  24. PyMatting: A Python Library for Alpha Matting

    Authors: Thomas Germer, Tobias Uelwer, Stefan Conrad, Stefan Harmeling

    Abstract: An important step of many image editing tasks is to extract specific objects from an image in order to place them in a scene of a movie or compose them onto another background. Alpha matting describes the problem of separating the objects in the foreground from the background of an image given only a rough sketch. We introduce the PyMatting package for Python which implements various approaches to… ▽ More

    Submitted 25 March, 2020; originally announced March 2020.

    Journal ref: Journal of Open Source Software (2020), 5(54), 2481

  25. arXiv:1912.04981  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Phase Retrieval Using Conditional Generative Adversarial Networks

    Authors: Tobias Uelwer, Alexander Oberstraß, Stefan Harmeling

    Abstract: In this paper, we propose the application of conditional generative adversarial networks to solve various phase retrieval problems. We show that including knowledge of the measurement process at training time leads to an optimization at test time that is more robust to initialization than existing approaches involving generative models. In addition, conditioning the generator network on the measur… ▽ More

    Submitted 8 July, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: Accepted at the 25th International Conference on Pattern Recognition 2020 (ICPR)

  26. arXiv:1906.03612  [pdf, other

    cs.LG cs.CR stat.ML

    On the Vulnerability of Capsule Networks to Adversarial Attacks

    Authors: Felix Michels, Tobias Uelwer, Eric Upschulte, Stefan Harmeling

    Abstract: This paper extensively evaluates the vulnerability of capsule networks to different adversarial attacks. Recent work suggests that these architectures are more robust towards adversarial attacks than other neural networks. However, our experiments show that capsule networks can be fooled as easily as convolutional neural networks.

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: 5 pages

  27. arXiv:1902.01813  [pdf, other

    cs.LG stat.ML

    Modular Block-diagonal Curvature Approximations for Feedforward Architectures

    Authors: Felix Dangel, Stefan Harmeling, Philipp Hennig

    Abstract: We propose a modular extension of backpropagation for the computation of block-diagonal approximations to various curvature matrices of the training objective (in particular, the Hessian, generalized Gauss-Newton, and positive-curvature Hessian). The approach reduces the otherwise tedious manual derivation of these matrices into local modules, and is easy to integrate into existing machine learnin… ▽ More

    Submitted 28 February, 2020; v1 submitted 5 February, 2019; originally announced February 2019.

    Comments: 9 pages, 5 figures, 1 table, supplements included (13 pages, 6 figures, 2 tables)

  28. Improving Cytoarchitectonic Segmentation of Human Brain Areas with Self-supervised Siamese Networks

    Authors: Hannah Spitzer, Kai Kiwitz, Katrin Amunts, Stefan Harmeling, Timo Dickscheid

    Abstract: Cytoarchitectonic parcellations of the human brain serve as anatomical references in multimodal atlas frameworks. They are based on analysis of cell-body stained histological sections and the identification of borders between brain areas. The de-facto standard involves a semi-automatic, reproducible border detection, but does not scale with high-throughput imaging in large series of sections at mi… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

    Comments: Accepted at MICCAI 2018

  29. Parcellation of Visual Cortex on high-resolution histological Brain Sections using Convolutional Neural Networks

    Authors: Hannah Spitzer, Katrin Amunts, Stefan Harmeling, Timo Dickscheid

    Abstract: Microscopic analysis of histological sections is considered the "gold standard" to verify structural parcellations in the human brain. Its high resolution allows the study of laminar and columnar patterns of cell distributions, which build an important basis for the simulation of cortical areas and networks. However, such cytoarchitectonic mapping is a semiautomatic, time consuming process that do… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.

    Comments: Accepted for oral presentation at International Symposium of Biomedical Imaging (ISBI) 2017

  30. arXiv:1406.7444  [pdf, other

    cs.CV cs.LG

    Learning to Deblur

    Authors: Christian J. Schuler, Michael Hirsch, Stefan Harmeling, Bernhard Schölkopf

    Abstract: We describe a learning-based approach to blind image deconvolution. It uses a deep layered architecture, parts of which are borrowed from recent work on neural network learning, and parts of which incorporate computations that are specific to image deconvolution. The system is trained end-to-end on a set of artificially generated training examples, enabling competitive performance in blind deconvo… ▽ More

    Submitted 28 June, 2014; originally announced June 2014.

  31. arXiv:1211.1552  [pdf, other

    cs.CV cs.LG

    Image denoising with multi-layer perceptrons, part 2: training trade-offs and analysis of their mechanisms

    Authors: Harold Christopher Burger, Christian J. Schuler, Stefan Harmeling

    Abstract: Image denoising can be described as the problem of mapping from a noisy image to a noise-free image. In another paper, we show that multi-layer perceptrons can achieve outstanding image denoising performance for various types of noise (additive white Gaussian noise, mixed Poisson-Gaussian noise, JPEG artifacts, salt-and-pepper noise and noise resembling stripes). In this work we discuss in detail… ▽ More

    Submitted 7 November, 2012; originally announced November 2012.

  32. arXiv:1211.1544  [pdf, other

    cs.CV cs.LG

    Image denoising with multi-layer perceptrons, part 1: comparison with existing algorithms and with bounds

    Authors: Harold Christopher Burger, Christian J. Schuler, Stefan Harmeling

    Abstract: Image denoising can be described as the problem of mapping from a noisy image to a noise-free image. The best currently available denoising methods approximate this mapping with cleverly engineered algorithms. In this work we attempt to learn this mapping directly with plain multi layer perceptrons (MLP) applied to image patches. We will show that by training on large image databases we are able t… ▽ More

    Submitted 9 November, 2012; v1 submitted 7 November, 2012; originally announced November 2012.

  33. arXiv:0912.1128  [pdf, ps, other

    stat.ML cs.LG

    How to Explain Individual Classification Decisions

    Authors: David Baehrens, Timon Schroeter, Stefan Harmeling, Motoaki Kawanabe, Katja Hansen, Klaus-Robert Mueller

    Abstract: After building a classifier with modern tools of machine learning we typically have a black box at hand that is able to predict well for unseen data. Thus, we get an answer to the question what is the most likely label of a given unseen data point. However, most methods will provide no answer why the model predicted the particular label for a single instance and what features were most influenti… ▽ More

    Submitted 6 December, 2009; originally announced December 2009.

    Comments: 31 pages, 14 figures