Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Enzweiler, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.01716  [pdf

    cs.RO cs.CV

    Visual-Inertial SLAM for Agricultural Robotics: Benchmarking the Benefits and Computational Costs of Loop Closing

    Authors: Fabian Schmidt, Constantin Blessing, Markus Enzweiler, Abhinav Valada

    Abstract: Simultaneous Localization and Mapping (SLAM) is essential for mobile robotics, enabling autonomous navigation in dynamic, unstructured outdoor environments without relying on external positioning systems. In agricultural applications, where environmental conditions can be particularly challenging due to variable lighting or weather conditions, Visual-Inertial SLAM has emerged as a potential soluti… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

    Comments: 18 pages, 8 figures, 5 tables

  2. arXiv:2407.08277  [pdf, other

    cs.CV

    StixelNExT: Toward Monocular Low-Weight Perception for Object Segmentation and Free Space Detection

    Authors: Marcel Vosshans, Omar Ait-Aider, Youcef Mezouar, Markus Enzweiler

    Abstract: In this work, we present a novel approach for general object segmentation from a monocular image, eliminating the need for manually labeled training data and enabling rapid, straightforward training and adaptation with minimal data. Our model initially learns from LiDAR during the training process, which is subsequently removed from the system, allowing it to function solely on monocular imagery.… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  3. arXiv:2407.08261  [pdf, other

    cs.RO

    The OPNV Data Collection: A Dataset for Infrastructure-Supported Perception Research with Focus on Public Transportation

    Authors: Marcel Vosshans, Alexander Baumann, Matthias Drueppel, Omar Ait-Aider, Ralf Woerner, Youcef Mezouar, Thao Dang, Markus Enzweiler

    Abstract: This paper we present our vision and ongoing work for a novel dataset designed to advance research into the interoperability of intelligent vehicles and infrastructure, specifically aimed at enhancing cooperative perception and interaction in the realm of public transportation. Unlike conventional datasets centered on ego-vehicle data, this approach encompasses both a stationary sensor tower and a… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  4. arXiv:2406.06264  [pdf, other

    cs.CV

    DualAD: Disentangling the Dynamic and Static World for End-to-End Driving

    Authors: Simon Doll, Niklas Hanselmann, Lukas Schneider, Richard Schulz, Marius Cordts, Markus Enzweiler, Hendrik P. A. Lensch

    Abstract: State-of-the-art approaches for autonomous driving integrate multiple sub-tasks of the overall driving task into a single pipeline that can be trained in an end-to-end fashion by passing latent representations between the different modules. In contrast to previous approaches that rely on a unified grid to represent the belief state of the scene, we propose dedicated representations to disentangle… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted at CVPR 2024; Copyright 2024 IEEE; Project Website: https://simondoll.github.io/publications/dualad

  5. arXiv:2306.17602  [pdf, other

    cs.CV cs.AI cs.RO

    S.T.A.R.-Track: Latent Motion Models for End-to-End 3D Object Tracking with Adaptive Spatio-Temporal Appearance Representations

    Authors: Simon Doll, Niklas Hanselmann, Lukas Schneider, Richard Schulz, Markus Enzweiler, Hendrik P. A. Lensch

    Abstract: Following the tracking-by-attention paradigm, this paper introduces an object-centric, transformer-based framework for tracking in 3D. Traditional model-based tracking approaches incorporate the geometric effect of object- and ego motion between frames with a geometric motion model. Inspired by this, we propose S.T.A.R.-Track, which uses a novel latent motion model (LMM) to additionally adjust obj… ▽ More

    Submitted 22 December, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: \c{opyright} 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Journal ref: IEEE Robotics and Automation Letters, Vol. 9, No. 2 (2024), PP 1326-1333

  6. arXiv:2011.09141  [pdf, other

    cs.CV cs.LG

    Semantic Scene Completion using Local Deep Implicit Functions on LiDAR Data

    Authors: Christoph B. Rist, David Emmerichs, Markus Enzweiler, Dariu M. Gavrila

    Abstract: Semantic scene completion is the task of jointly estimating 3D geometry and semantics of objects and surfaces within a given extent. This is a particularly challenging task on real-world data that is sparse and occluded. We propose a scene segmentation network based on local Deep Implicit Functions as a novel learning-based method for scene completion. Unlike previous work on scene completion, our… ▽ More

    Submitted 12 April, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

    ACM Class: I.4.8

  7. arXiv:1907.00787  [pdf, other

    eess.IV cs.LG stat.ML

    CNN-based synthesis of realistic high-resolution LiDAR data

    Authors: Larissa T. Triess, David Peter, Christoph B. Rist, Markus Enzweiler, J. Marius Zöllner

    Abstract: This paper presents a novel CNN-based approach for synthesizing high-resolution LiDAR point cloud data. Our approach generates semantically and perceptually realistic results with guidance from specialized loss-functions. First, we utilize a modified per-point loss that addresses missing LiDAR point measurements. Second, we align the quality of our generated output with real-world sensor data by a… ▽ More

    Submitted 24 September, 2021; v1 submitted 28 June, 2019; originally announced July 2019.

    Comments: Project Page: http://ltriess.github.io/pc-upsampling

    Journal ref: IEEE Intelligent Vehicles Symposium (IV), 2019, pp. 1512-1519

  8. arXiv:1809.08993  [pdf, other

    cs.CV

    Improved Semantic Stixels via Multimodal Sensor Fusion

    Authors: Florian Piewak, Peter Pinggera, Markus Enzweiler, David Pfeiffer, Marius Zöllner

    Abstract: This paper presents a compact and accurate representation of 3D scenes that are observed by a LiDAR sensor and a monocular camera. The proposed method is based on the well-established Stixel model originally developed for stereo vision applications. We extend this Stixel concept to incorporate data from multiple sensor modalities. The resulting mid-level fusion scheme takes full advantage of the g… ▽ More

    Submitted 27 September, 2018; v1 submitted 24 September, 2018; originally announced September 2018.

  9. arXiv:1804.09915  [pdf, other

    cs.CV

    Boosting LiDAR-based Semantic Labeling by Cross-Modal Training Data Generation

    Authors: Florian Piewak, Peter Pinggera, Manuel Schäfer, David Peter, Beate Schwarz, Nick Schneider, David Pfeiffer, Markus Enzweiler, Marius Zöllner

    Abstract: Mobile robots and autonomous vehicles rely on multi-modal sensor setups to perceive and understand their surroundings. Aside from cameras, LiDAR sensors represent a central component of state-of-the-art perception systems. In addition to accurate spatial perception, a comprehensive semantic understanding of the environment is essential for efficient and safe operation. In this paper we present a n… ▽ More

    Submitted 26 April, 2018; originally announced April 2018.

  10. The Stixel world: A medium-level representation of traffic scenes

    Authors: Marius Cordts, Timo Rehfeld, Lukas Schneider, David Pfeiffer, Markus Enzweiler, Stefan Roth, Marc Pollefeys, Uwe Franke

    Abstract: Recent progress in advanced driver assistance systems and the race towards autonomous vehicles is mainly driven by two factors: (1) increasingly sophisticated algorithms that interpret the environment around the vehicle and react accordingly, and (2) the continuous improvements of sensor technology itself. In terms of cameras, these improvements typically include higher spatial resolution, which a… ▽ More

    Submitted 2 April, 2017; originally announced April 2017.

    Comments: Accepted for publication in Image and Vision Computing

  11. arXiv:1604.01685  [pdf, other

    cs.CV

    The Cityscapes Dataset for Semantic Urban Scene Understanding

    Authors: Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, Bernt Schiele

    Abstract: Visual understanding of complex urban street scenes is an enabling factor for a wide range of applications. Object detection has benefited enormously from large-scale datasets, especially in the context of deep learning. For semantic urban scene understanding, however, no current dataset adequately captures the complexity of real-world urban scenes. To address this, we introduce Cityscapes, a be… ▽ More

    Submitted 7 April, 2016; v1 submitted 6 April, 2016; originally announced April 2016.

    Comments: Includes supplemental material