Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Argus, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03921  [pdf, other

    cs.LG cs.CV

    Concept Bottleneck Models Without Predefined Concepts

    Authors: Simon Schrodi, Julian Schur, Max Argus, Thomas Brox

    Abstract: There has been considerable recent interest in interpretable concept-based models such as Concept Bottleneck Models (CBMs), which first predict human-interpretable concepts and then map them to output classes. To reduce reliance on human-annotated concepts, recent works have converted pretrained black-box models into interpretable CBMs post-hoc. However, these approaches predefine a set of concept… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2404.07983  [pdf, other

    cs.CV cs.LG

    Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Representation Learning

    Authors: Simon Schrodi, David T. Hoffmann, Max Argus, Volker Fischer, Thomas Brox

    Abstract: Contrastive vision-language models like CLIP have gained popularity for their versatile applicable learned representations in various downstream tasks. Despite their successes in some tasks, like zero-shot image recognition, they also perform surprisingly poor on other tasks, like attribute detection. Previous work has attributed these challenges to the modality gap, a separation of image and text… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  3. arXiv:2403.15203  [pdf, other

    cs.RO cs.CV

    DITTO: Demonstration Imitation by Trajectory Transformation

    Authors: Nick Heppert, Max Argus, Tim Welschehold, Thomas Brox, Abhinav Valada

    Abstract: Teaching robots new skills quickly and conveniently is crucial for the broader adoption of robotic systems. In this work, we address the problem of one-shot imitation from a single human demonstration, given by an RGB-D video recording through a two-stage process. In the first stage which is offline, we extract the trajectory of the demonstration. This entails segmenting manipulated objects and de… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures, 3 tables, submitted to IROS 2024

  4. arXiv:2310.06668  [pdf, other

    cs.LG cs.CV

    Latent Diffusion Counterfactual Explanations

    Authors: Karim Farid, Simon Schrodi, Max Argus, Thomas Brox

    Abstract: Counterfactual explanations have emerged as a promising method for elucidating the behavior of opaque black-box models. Recently, several works leveraged pixel-space diffusion models for counterfactual generation. To handle noisy, adversarial gradients during counterfactual generation -- causing unrealistic artifacts or mere adversarial perturbations -- they required either auxiliary adversarially… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  5. arXiv:2310.05691  [pdf, other

    cs.CV physics.ao-ph

    Climate-sensitive Urban Planning through Optimization of Tree Placements

    Authors: Simon Schrodi, Ferdinand Briegel, Max Argus, Andreas Christen, Thomas Brox

    Abstract: Climate change is increasing the intensity and frequency of many extreme weather events, including heatwaves, which results in increased thermal discomfort and mortality rates. While global mitigation action is undoubtedly necessary, so is climate adaptation, e.g., through climate-sensitive urban planning. Among the most promising strategies is harnessing the benefits of urban trees in shading and… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  6. arXiv:2310.04271  [pdf, other

    cs.RO cs.CV

    Compositional Servoing by Recombining Demonstrations

    Authors: Max Argus, Abhijeet Nayak, Martin Büchner, Silvio Galesso, Abhinav Valada, Thomas Brox

    Abstract: Learning-based manipulation policies from image inputs often show weak task transfer capabilities. In contrast, visual servoing methods allow efficient task transfer in high-precision scenarios while requiring only a few demonstrations. In this work, we present a framework that formulates the visual servoing task as graph traversal. Our method not only extends the robustness of visual servoing, bu… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: http://compservo.cs.uni-freiburg.de

  7. arXiv:2211.06660  [pdf, other

    cs.CV

    Far Away in the Deep Space: Dense Nearest-Neighbor-Based Out-of-Distribution Detection

    Authors: Silvio Galesso, Max Argus, Thomas Brox

    Abstract: The key to out-of-distribution detection is density estimation of the in-distribution data or of its feature representations. This is particularly challenging for dense anomaly detection in domains where the in-distribution data has a complex underlying structure. Nearest-Neighbors approaches have been shown to work well in object-centric data domains, such as industrial inspection and image class… ▽ More

    Submitted 14 September, 2023; v1 submitted 12 November, 2022; originally announced November 2022.

    Comments: Workshop on Uncertainty Quantification for Computer Vision, ICCV 2023. Code at: https://github.com/silviogalesso/dense-ood-knns

  8. arXiv:2207.13591  [pdf, other

    cs.RO

    RobotIO: A Python Library for Robot Manipulation Experiments

    Authors: Lukas Hermann, Max Argus, Adrian Roefer, Abhinav Valada, Thomas Brox

    Abstract: Setting up robot environments to quickly test newly developed algorithms is still a difficult and time consuming process. This presents a significant hurdle to researchers interested in performing real-world robotic experiments. RobotIO is a python library designed to solve this problem. It focuses on providing common, simple, and well structured python interfaces for robots, grippers, and cameras… ▽ More

    Submitted 16 August, 2022; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: 6 pages, 3 figures

  9. arXiv:2205.08441  [pdf, other

    cs.RO cs.CV cs.LG

    Conditional Visual Servoing for Multi-Step Tasks

    Authors: Sergio Izquierdo, Max Argus, Thomas Brox

    Abstract: Visual Servoing has been effectively used to move a robot into specific target locations or to track a recorded demonstration. It does not require manual programming, but it is typically limited to settings where one demonstration maps to one environment state. We propose a modular approach to extend visual servoing to scenarios with multiple demonstration sequences. We call this conditional servo… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  10. arXiv:2106.04324  [pdf, other

    cs.CV

    Contrastive Representation Learning for Hand Shape Estimation

    Authors: Christian Zimmermann, Max Argus, Thomas Brox

    Abstract: This work presents improvements in monocular hand shape estimation by building on top of recent advances in unsupervised learning. We extend momentum contrastive learning and contribute a structured collection of hand images, well suited for visual representation learning, which we call HanCo. We find that the representation learned by established contrastive learning methods can be improved signi… ▽ More

    Submitted 2 July, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

  11. arXiv:2104.14386  [pdf, other

    cs.LG cs.AI cs.RO

    Pre-training of Deep RL Agents for Improved Learning under Domain Randomization

    Authors: Artemij Amiranashvili, Max Argus, Lukas Hermann, Wolfram Burgard, Thomas Brox

    Abstract: Visual domain randomization in simulated environments is a widely used method to transfer policies trained in simulation to real robots. However, domain randomization and augmentation hamper the training of a policy. As reinforcement learning struggles with a noisy training signal, this additional nuisance can drastically impede training. For difficult tasks it can even result in complete failure… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  12. arXiv:2007.00291  [pdf, other

    cs.RO cs.CV

    FlowControl: Optical Flow Based Visual Servoing

    Authors: Max Argus, Lukas Hermann, Jon Long, Thomas Brox

    Abstract: One-shot imitation is the vision of robot programming from a single demonstration, rather than by tedious construction of computer code. We present a practical method for realizing one-shot imitation for manipulation tasks, exploiting modern learning-based optical flow to perform real-time visual servoing. Our approach, which we call FlowControl, continuously tracks a demonstration video, using a… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  13. arXiv:2004.01823  [pdf, other

    cs.CV cs.LG eess.IV

    Temporal Shift GAN for Large Scale Video Generation

    Authors: Andres Munoz, Mohammadreza Zolfaghari, Max Argus, Thomas Brox

    Abstract: Video generation models have become increasingly popular in the last few years, however the standard 2D architectures used today lack natural spatio-temporal modelling capabilities. In this paper, we present a network architecture for video generation that models spatio-temporal consistency without resorting to costly 3D architectures. The architecture facilitates information exchange between neig… ▽ More

    Submitted 10 November, 2020; v1 submitted 3 April, 2020; originally announced April 2020.

    Comments: 14 pages, 15 figures

    ACM Class: I.2.10

  14. arXiv:1910.07972  [pdf, other

    cs.RO cs.CV cs.LG

    Adaptive Curriculum Generation from Demonstrations for Sim-to-Real Visuomotor Control

    Authors: Lukas Hermann, Max Argus, Andreas Eitel, Artemij Amiranashvili, Wolfram Burgard, Thomas Brox

    Abstract: We propose Adaptive Curriculum Generation from Demonstrations (ACGD) for reinforcement learning in the presence of sparse rewards. Rather than designing shaped reward functions, ACGD adaptively sets the appropriate task difficulty for the learner by controlling where to sample from the demonstration trajectories and which set of simulation parameters to use. We show that training vision-based cont… ▽ More

    Submitted 8 July, 2020; v1 submitted 17 October, 2019; originally announced October 2019.

    Comments: Accepted at the 2020 IEEE International Conference on Robotics and Automation (ICRA). Project page see https://lmb.informatik.uni-freiburg.de/projects/curriculum/

  15. arXiv:1909.12047  [pdf, other

    eess.IV cs.CV cs.LG

    Function Follows Form: Regression from Complete Thoracic Computed Tomography Scans

    Authors: Max Argus, Cornelia Schaefer-Prokop, David A. Lynch, Bram van Ginneken

    Abstract: Chronic Obstructive Pulmonary Disease (COPD) is a leading cause of morbidity and mortality. While COPD diagnosis is based on lung function tests, early stages and progression of different aspects of the disease can be visible and quantitatively assessed on computed tomography (CT) scans. Many studies have been published that quantify imaging biomarkers related to COPD. In this paper we present a c… ▽ More

    Submitted 27 September, 2019; v1 submitted 26 September, 2019; originally announced September 2019.

  16. arXiv:1909.04349  [pdf, other

    cs.CV cs.LG cs.RO

    FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from Single RGB Images

    Authors: Christian Zimmermann, Duygu Ceylan, Jimei Yang, Bryan Russell, Max Argus, Thomas Brox

    Abstract: Estimating 3D hand pose from single RGB images is a highly ambiguous problem that relies on an unbiased training dataset. In this paper, we analyze cross-dataset generalization when training on existing datasets. We find that approaches perform well on the datasets they are trained on, but do not generalize to other datasets or in-the-wild scenarios. As a consequence, we introduce the first large-… ▽ More

    Submitted 13 September, 2019; v1 submitted 10 September, 2019; originally announced September 2019.

    Comments: Accepted to ICCV 2019, Project page: https://lmb.informatik.uni-freiburg.de/projects/freihand/

  17. arXiv:1902.05605  [pdf, other

    cs.LG stat.ML

    CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity

    Authors: Aditya Bhatt, Daniel Palenicek, Boris Belousov, Max Argus, Artemij Amiranashvili, Thomas Brox, Jan Peters

    Abstract: Sample efficiency is a crucial problem in deep reinforcement learning. Recent algorithms, such as REDQ and DroQ, found a way to improve the sample efficiency by increasing the update-to-data (UTD) ratio to 20 gradient update steps on the critic per environment sample. However, this comes at the expense of a greatly increased computational cost. To reduce this computational burden, we introduce Cro… ▽ More

    Submitted 25 March, 2024; v1 submitted 14 February, 2019; originally announced February 2019.

    Comments: Published at ICLR 2024. Project page at http://aditya.bhatts.org/CrossQ and code release at https://github.com/adityab/CrossQ

  18. arXiv:1612.07307  [pdf, other

    cs.LG

    Loss is its own Reward: Self-Supervision for Reinforcement Learning

    Authors: Evan Shelhamer, Parsa Mahmoudieh, Max Argus, Trevor Darrell

    Abstract: Reinforcement learning optimizes policies for expected cumulative reward. Need the supervision be so narrow? Reward is delayed and sparse for many tasks, making it a difficult and impoverished signal for end-to-end optimization. To augment reward, we consider a range of self-supervised tasks that incorporate states, actions, and successors to provide auxiliary losses. These losses offer ubiquitous… ▽ More

    Submitted 9 March, 2017; v1 submitted 21 December, 2016; originally announced December 2016.