Zum Hauptinhalt springen

Showing 1–31 of 31 results for author: Gehler, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.11459  [pdf, other

    cs.CV cs.LG

    TeST: Test-time Self-Training under Distribution Shift

    Authors: Samarth Sinha, Peter Gehler, Francesco Locatello, Bernt Schiele

    Abstract: Despite their recent success, deep neural networks continue to perform poorly when they encounter distribution shifts at test time. Many recently proposed approaches try to counter this by aligning the model to the new distribution prior to inference. With no labels available this requires unsupervised objectives to adapt the model on the observed test data. In this paper, we propose Test-Time Sel… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Journal ref: WACV 2023

  2. arXiv:2207.09239  [pdf, other

    cs.LG stat.ML

    Assaying Out-Of-Distribution Generalization in Transfer Learning

    Authors: Florian Wenzel, Andrea Dittadi, Peter Vincent Gehler, Carl-Johann Simon-Gabriel, Max Horn, Dominik Zietlow, David Kernert, Chris Russell, Thomas Brox, Bernt Schiele, Bernhard Schölkopf, Francesco Locatello

    Abstract: Since out-of-distribution generalization is a generally ill-posed problem, various proxy targets (e.g., calibration, adversarial robustness, algorithmic corruptions, invariance across shifts) were studied across different research programs resulting in different recommendations. While sharing the same aspirational goal, these approaches have never been tested under the same experimental conditions… ▽ More

    Submitted 21 October, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

  3. arXiv:2110.06562  [pdf, other

    cs.CV cs.LG stat.ML

    Unsupervised Object Learning via Common Fate

    Authors: Matthias Tangemann, Steffen Schneider, Julius von Kügelgen, Francesco Locatello, Peter Gehler, Thomas Brox, Matthias Kümmerer, Matthias Bethge, Bernhard Schölkopf

    Abstract: Learning generative object models from unlabelled videos is a long standing problem and required for causal scene modeling. We decompose this problem into three easier subtasks, and provide candidate solutions for each of them. Inspired by the Common Fate Principle of Gestalt Psychology, we first extract (noisy) masks of moving objects via unsupervised motion segmentation. Second, generative model… ▽ More

    Submitted 15 May, 2023; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Published at CLeaR 2023

  4. arXiv:2110.06399  [pdf, other

    cs.LG cs.CV

    Dynamic Inference with Neural Interpreters

    Authors: Nasim Rahaman, Muhammad Waleed Gondal, Shruti Joshi, Peter Gehler, Yoshua Bengio, Francesco Locatello, Bernhard Schölkopf

    Abstract: Modern neural network architectures can leverage large amounts of data to generalize well within the training distribution. However, they are less capable of systematic generalization to data drawn from unseen but related distributions, a feat that is hypothesized to require compositional reasoning and reuse of knowledge. In this work, we present Neural Interpreters, an architecture that factorize… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021

  5. arXiv:2110.05304  [pdf, other

    cs.LG

    You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory Prediction

    Authors: Osama Makansi, Julius von Kügelgen, Francesco Locatello, Peter Gehler, Dominik Janzing, Thomas Brox, Bernhard Schölkopf

    Abstract: Predicting the future trajectory of a moving agent can be easy when the past trajectory continues smoothly but is challenging when complex interactions with other agents are involved. Recent deep learning approaches for trajectory prediction show promising performance and partially attribute this to successful reasoning about agent-agent interactions. However, it remains unclear which features suc… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  6. arXiv:2109.14910  [pdf, other

    cs.CV cs.AI cs.LG

    CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations

    Authors: Mohammadreza Zolfaghari, Yi Zhu, Peter Gehler, Thomas Brox

    Abstract: Contrastive learning allows us to flexibly define powerful losses by contrasting positive pairs from sets of negative samples. Recently, the principle has also been used to learn cross-modal embeddings for video and text, yet without exploiting its full potential. In particular, previous losses do not take the intra-modality similarities into account, which leads to inefficient embeddings, as the… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

    Comments: ICCV 2021, 14 pages, 13 figures

  7. arXiv:2107.08221  [pdf, other

    cs.LG cs.CV

    Visual Representation Learning Does Not Generalize Strongly Within the Same Domain

    Authors: Lukas Schott, Julius von Kügelgen, Frederik Träuble, Peter Gehler, Chris Russell, Matthias Bethge, Bernhard Schölkopf, Francesco Locatello, Wieland Brendel

    Abstract: An important component for generalization in machine learning is to uncover underlying latent factors of variation as well as the mechanism through which each factor acts in the world. In this paper, we test whether 17 unsupervised, weakly supervised, and fully supervised representation learning approaches correctly infer the generative factors of variation in simple datasets (dSprites, Shapes3D,… ▽ More

    Submitted 12 February, 2022; v1 submitted 17 July, 2021; originally announced July 2021.

  8. arXiv:2107.05686  [pdf, other

    cs.LG stat.ML

    The Role of Pretrained Representations for the OOD Generalization of Reinforcement Learning Agents

    Authors: Andrea Dittadi, Frederik Träuble, Manuel Wüthrich, Felix Widmaier, Peter Gehler, Ole Winther, Francesco Locatello, Olivier Bachem, Bernhard Schölkopf, Stefan Bauer

    Abstract: Building sample-efficient agents that generalize out-of-distribution (OOD) in real-world settings remains a fundamental unsolved problem on the path towards achieving higher-level cognition. One particularly promising approach is to begin with low-dimensional, pretrained representations of our world, which should facilitate efficient downstream learning and generalization. By training 240 represen… ▽ More

    Submitted 16 April, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: Published at ICLR 2022

  9. arXiv:2107.01057  [pdf, other

    cs.LG cs.AI

    Backward-Compatible Prediction Updates: A Probabilistic Approach

    Authors: Frederik Träuble, Julius von Kügelgen, Matthäus Kleindessner, Francesco Locatello, Bernhard Schölkopf, Peter Gehler

    Abstract: When machine learning systems meet real world applications, accuracy is only one of several requirements. In this paper, we assay a complementary perspective originating from the increasing availability of pre-trained and regularly improving state-of-the-art models. While new improved models develop at a fast pace, downstream tasks vary more slowly or stay constant. Assume that we have a large unl… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  10. arXiv:2106.08265  [pdf, other

    cs.CV

    Towards Total Recall in Industrial Anomaly Detection

    Authors: Karsten Roth, Latha Pemula, Joaquin Zepeda, Bernhard Schölkopf, Thomas Brox, Peter Gehler

    Abstract: Being able to spot defective parts is a critical component in large-scale industrial manufacturing. A particular challenge that we address in this work is the cold-start problem: fit a model using nominal (non-defective) example images only. While handcrafted solutions per class are possible, the goal is to build systems that work well simultaneously on many different tasks automatically. The best… ▽ More

    Submitted 5 May, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: Accepted to CVPR 2022

  11. arXiv:2104.12928  [pdf, other

    cs.CV cs.LG

    If your data distribution shifts, use self-learning

    Authors: Evgenia Rusak, Steffen Schneider, George Pachitariu, Luisa Eck, Peter Gehler, Oliver Bringmann, Wieland Brendel, Matthias Bethge

    Abstract: We demonstrate that self-learning techniques like entropy minimization and pseudo-labeling are simple and effective at improving performance of a deployed computer vision model under systematic domain shifts. We conduct a wide range of large-scale experiments and show consistent improvements irrespective of the model architecture, the pre-training technique or the type of distribution shift. At th… ▽ More

    Submitted 7 December, 2023; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: Web: https://domainadaptation.org/selflearning

  12. arXiv:2004.12906  [pdf, other

    stat.ML cs.CV cs.LG

    Towards causal generative scene models via competition of experts

    Authors: Julius von Kügelgen, Ivan Ustyuzhaninov, Peter Gehler, Matthias Bethge, Bernhard Schölkopf

    Abstract: Learning how to model complex scenes in a modular way with recombinable components is a pre-requisite for higher-order reasoning and acting in the physical world. However, current generative models lack the ability to capture the inherently compositional and layered nature of visual scenes. While recent work has made progress towards unsupervised learning of object-based scene representations, mos… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: Presented at the ICLR 2020 workshop "Causal learning for decision making"

  13. arXiv:1909.03677  [pdf, other

    cs.CV

    Learning Task-Specific Generalized Convolutions in the Permutohedral Lattice

    Authors: Anne S. Wannenwetsch, Martin Kiefel, Peter V. Gehler, Stefan Roth

    Abstract: Dense prediction tasks typically employ encoder-decoder architectures, but the prevalent convolutions in the decoder are not image-adaptive and can lead to boundary artifacts. Different generalized convolution operations have been introduced to counteract this. We go beyond these by leveraging guidance data to redefine their inherent notion of proximity. Our proposed network layer builds on the pe… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: To appear at GCPR 2019

  14. arXiv:1808.05942  [pdf, other

    cs.CV

    Neural Body Fitting: Unifying Deep Learning and Model-Based Human Pose and Shape Estimation

    Authors: Mohamed Omran, Christoph Lassner, Gerard Pons-Moll, Peter V. Gehler, Bernt Schiele

    Abstract: Direct prediction of 3D body pose and shape remains a challenge even for highly parameterized deep learning models. Mapping from the 2D image space to the prediction space is difficult: perspective ambiguities make the loss function noisy and training data is scarce. In this paper, we propose a novel approach (Neural Body Fitting (NBF)). It integrates a statistical body model within a CNN, leverag… ▽ More

    Submitted 17 August, 2018; originally announced August 2018.

    Comments: 3DV 2018

  15. Rehabilitating the ColorChecker Dataset for Illuminant Estimation

    Authors: Ghalia Hemrit, Graham D. Finlayson, Arjan Gijsenij, Peter Gehler, Simone Bianco, Brian Funt, Mark Drew, Lilong Shi

    Abstract: In a previous work, it was shown that there is a curious problem with the benchmark ColorChecker dataset for illuminant estimation. To wit, this dataset has at least 3 different sets of ground-truths. Typically, for a single algorithm a single ground-truth is used. But then different algorithms, whose performance is measured with respect to different ground-truths, are compared against each other… ▽ More

    Submitted 17 September, 2018; v1 submitted 30 May, 2018; originally announced May 2018.

    Comments: 4 pages, 3 figures, 2 tables, Proceedings of the 26th Color and Imaging Conference

    Journal ref: Color and Imaging Conference, 2018

  16. arXiv:1805.03430  [pdf, other

    cs.CV

    Deep Directional Statistics: Pose Estimation with Uncertainty Quantification

    Authors: Sergey Prokudin, Peter Gehler, Sebastian Nowozin

    Abstract: Modern deep learning systems successfully solve many perception tasks such as object pose estimation when the input image is of high quality. However, in challenging imaging conditions such as on low-resolution images or when the image is corrupted by imaging artifacts, current systems degrade considerably in accuracy. While a loss in performance is unavoidable, we would like our models to quantif… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.

  17. arXiv:1708.03088  [pdf, other

    cs.CV

    Semantic Video CNNs through Representation Warping

    Authors: Raghudeep Gadde, Varun Jampani, Peter V. Gehler

    Abstract: In this work, we propose a technique to convert CNN models for semantic segmentation of static images into CNNs for video data. We describe a warping method that can be used to augment existing architectures with very little extra computational cost. This module is called NetWarp and we demonstrate its use for a range of network architectures. The main design principle is to use optical flow of ad… ▽ More

    Submitted 10 August, 2017; originally announced August 2017.

    Comments: ICCV 2017

  18. arXiv:1707.07548  [pdf, other

    cs.CV

    Towards Accurate Markerless Human Shape and Pose Estimation over Time

    Authors: Yinghao Huang, Federica Bogo, Christoph Lassner, Angjoo Kanazawa, Peter V. Gehler, Ijaz Akhter, Michael J. Black

    Abstract: Existing marker-less motion capture methods often assume known backgrounds, static cameras, and sequence specific motion priors, which narrows its application scenarios. Here we propose a fully automatic method that given multi-view video, estimates 3D human motion and body shape. We take recent SMPLify \cite{bogo2016keep} as the base method, and extend it in several ways. First we fit the body to… ▽ More

    Submitted 30 April, 2018; v1 submitted 24 July, 2017; originally announced July 2017.

    Comments: 10 pages, 6 figures, 5 tables, published in 3DV-2017

  19. arXiv:1705.04098  [pdf, other

    cs.CV

    A Generative Model of People in Clothing

    Authors: Christoph Lassner, Gerard Pons-Moll, Peter V. Gehler

    Abstract: We present the first image-based generative model of people in clothing for the full body. We sidestep the commonly used complex graphics rendering pipeline and the need for high-quality 3D scans of dressed people. Instead, we learn generative models from a large image database. The main challenge is to cope with the high variance in human pose, shape and appearance. For this reason, pure image-ba… ▽ More

    Submitted 31 July, 2017; v1 submitted 11 May, 2017; originally announced May 2017.

  20. arXiv:1701.02468  [pdf, other

    cs.CV

    Unite the People: Closing the Loop Between 3D and 2D Human Representations

    Authors: Christoph Lassner, Javier Romero, Martin Kiefel, Federica Bogo, Michael J. Black, Peter V. Gehler

    Abstract: 3D models provide a common ground for different representations of human bodies. In turn, robust 2D estimation has proven to be a powerful tool to obtain 3D fits "in-the- wild". However, depending on the level of detail, it can be hard to impossible to acquire labeled data for training 2D estimators on large scale. We propose a hybrid approach to this problem: with an extended version of the recen… ▽ More

    Submitted 24 July, 2017; v1 submitted 10 January, 2017; originally announced January 2017.

  21. arXiv:1612.05478  [pdf, other

    cs.CV

    Video Propagation Networks

    Authors: Varun Jampani, Raghudeep Gadde, Peter V. Gehler

    Abstract: We propose a technique that propagates information forward through video data. The method is conceptually simple and can be applied to tasks that require the propagation of structured information, such as semantic labels, based on video content. We propose a 'Video Propagation Network' that processes video frames in an adaptive manner. The model is applied online: it propagates information forward… ▽ More

    Submitted 11 April, 2017; v1 submitted 16 December, 2016; originally announced December 2016.

    Comments: Appearing in Computer Vision and Pattern Recognition, 2017 (CVPR'17)

  22. arXiv:1612.05062  [pdf, other

    cs.CV

    Reflectance Adaptive Filtering Improves Intrinsic Image Estimation

    Authors: Thomas Nestmeyer, Peter V. Gehler

    Abstract: Separating an image into reflectance and shading layers poses a challenge for learning approaches because no large corpus of precise and realistic ground truth decompositions exists. The Intrinsic Images in the Wild~(IIW) dataset provides a sparse set of relative human reflectance judgments, which serves as a standard benchmark for intrinsic images. A number of methods use IIW to learn statistical… ▽ More

    Submitted 12 June, 2017; v1 submitted 15 December, 2016; originally announced December 2016.

    Comments: CVPR 2017

  23. arXiv:1607.08128  [pdf, other

    cs.CV

    Keep it SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image

    Authors: Federica Bogo, Angjoo Kanazawa, Christoph Lassner, Peter Gehler, Javier Romero, Michael J. Black

    Abstract: We describe the first method to automatically estimate the 3D pose of the human body as well as its 3D shape from a single unconstrained image. We estimate a full 3D mesh and show that 2D joints alone carry a surprising amount of information about body shape. The problem is challenging because of the complexity of the human body, articulation, occlusion, clothing, lighting, and the inherent ambigu… ▽ More

    Submitted 27 July, 2016; originally announced July 2016.

    Comments: To appear in ECCV 2016

  24. arXiv:1606.06437  [pdf, other

    cs.CV

    Efficient 2D and 3D Facade Segmentation using Auto-Context

    Authors: Raghudeep Gadde, Varun Jampani, Renaud Marlet, Peter V. Gehler

    Abstract: This paper introduces a fast and efficient segmentation technique for 2D images and 3D point clouds of building facades. Facades of buildings are highly structured and consequently most methods that have been proposed for this problem aim to make use of this strong prior information. Contrary to most prior work, we are describing a system that is almost domain independent and consists of standard… ▽ More

    Submitted 21 June, 2016; originally announced June 2016.

    Comments: 8 pages

  25. arXiv:1511.06739  [pdf, other

    cs.CV

    Superpixel Convolutional Networks using Bilateral Inceptions

    Authors: Raghudeep Gadde, Varun Jampani, Martin Kiefel, Daniel Kappler, Peter V. Gehler

    Abstract: In this paper we propose a CNN architecture for semantic image segmentation. We introduce a new 'bilateral inception' module that can be inserted in existing CNN architectures and performs bilateral filtering, at multiple feature-scales, between superpixels in an image. The feature spaces for bilateral filtering and other parameters of the module are learned end-to-end using standard backpropagati… ▽ More

    Submitted 8 August, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: European Conference on Computer Vision (ECCV), 2016

    ACM Class: I.2.10; I.2.6

  26. arXiv:1511.06645  [pdf, other

    cs.CV

    DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation

    Authors: Leonid Pishchulin, Eldar Insafutdinov, Siyu Tang, Bjoern Andres, Mykhaylo Andriluka, Peter Gehler, Bernt Schiele

    Abstract: This paper considers the task of articulated human pose estimation of multiple people in real world images. We propose an approach that jointly solves the tasks of detection and pose estimation: it infers the number of persons in a scene, identifies occluded body parts, and disambiguates body parts between people in close proximity of each other. This joint formulation is in contrast to previous s… ▽ More

    Submitted 26 April, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: Accepted at IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2016)

  27. arXiv:1503.05038  [pdf, other

    cs.CV

    3D Object Class Detection in the Wild

    Authors: Bojan Pepik, Michael Stark, Peter Gehler, Tobias Ritschel, Bernt Schiele

    Abstract: Object class detection has been a synonym for 2D bounding box localization for the longest time, fueled by the success of powerful statistical learning techniques, combined with robust image representations. Only recently, there has been a growing interest in revisiting the promise of computer vision from the early days: to precisely delineate the contents of a visual scene, object by object, in 3… ▽ More

    Submitted 17 March, 2015; originally announced March 2015.

  28. arXiv:1503.04949  [pdf, other

    cs.CV

    Learning Sparse High Dimensional Filters: Image Filtering, Dense CRFs and Bilateral Neural Networks

    Authors: Varun Jampani, Martin Kiefel, Peter V. Gehler

    Abstract: Bilateral filters have wide spread use due to their edge-preserving properties. The common use case is to manually choose a parametric filter type, usually a Gaussian filter. In this paper, we will generalize the parametrization and in particular derive a gradient descent algorithm so the filter parameters can be learned from data. This derivation allows to learn high dimensional linear filters th… ▽ More

    Submitted 25 November, 2015; v1 submitted 17 March, 2015; originally announced March 2015.

  29. arXiv:1412.6618  [pdf, ps, other

    cs.CV cs.LG cs.NE

    Permutohedral Lattice CNNs

    Authors: Martin Kiefel, Varun Jampani, Peter V. Gehler

    Abstract: This paper presents a convolutional layer that is able to process sparse input features. As an example, for image recognition problems this allows an efficient filtering of signals that do not lie on a dense grid (like pixel position), but of more general features (such as color values). The presented algorithm makes use of the permutohedral lattice data structure. The permutohedral lattice was in… ▽ More

    Submitted 3 May, 2015; v1 submitted 20 December, 2014; originally announced December 2014.

  30. arXiv:1402.0859  [pdf, other

    cs.CV cs.LG stat.ML

    The Informed Sampler: A Discriminative Approach to Bayesian Inference in Generative Computer Vision Models

    Authors: Varun Jampani, Sebastian Nowozin, Matthew Loper, Peter V. Gehler

    Abstract: Computer vision is hard because of a large variability in lighting, shape, and texture; in addition the image signal is non-additive due to occlusion. Generative models promised to account for this variability by accurately modelling the image formation process as a function of latent variables with prior beliefs. Bayesian posterior inference could then, in principle, explain the observation. Whil… ▽ More

    Submitted 7 March, 2015; v1 submitted 4 February, 2014; originally announced February 2014.

    Comments: Appearing in Computer Vision and Image Understanding Journal (Special Issue on Generative Models in Computer Vision)

  31. arXiv:1312.6095  [pdf, other

    cs.CV

    Multi-View Priors for Learning Detectors from Sparse Viewpoint Data

    Authors: Bojan Pepik, Michael Stark, Peter Gehler, Bernt Schiele

    Abstract: While the majority of today's object class models provide only 2D bounding boxes, far richer output hypotheses are desirable including viewpoint, fine-grained category, and 3D geometry estimate. However, models trained to provide richer output require larger amounts of training data, preferably well covering the relevant aspects such as viewpoint and fine-grained categories. In this paper, we addr… ▽ More

    Submitted 16 February, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: 13 pages, 7 figures, 4 tables, International Conference on Learning Representations 2014