Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Benenson, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.14142  [pdf, other

    cs.CV

    From colouring-in to pointillism: revisiting semantic segmentation supervision

    Authors: Rodrigo Benenson, Vittorio Ferrari

    Abstract: The prevailing paradigm for producing semantic segmentation training data relies on densely labelling each pixel of each image in the training set, akin to colouring-in books. This approach becomes a bottleneck when scaling up in the number of images, classes, and annotators. Here we propose instead a pointillist approach for semantic segmentation annotation, where only point-wise yes/no questions… ▽ More

    Submitted 17 November, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: Open Images V7 available at https://g.co/dataset/open-images

  2. arXiv:1903.10830  [pdf, other

    cs.CV

    Large-scale interactive object segmentation with human annotators

    Authors: Rodrigo Benenson, Stefan Popov, Vittorio Ferrari

    Abstract: Manually annotating object segmentation masks is very time consuming. Interactive object segmentation methods offer a more efficient alternative where a human annotator and a machine segmentation model collaborate. In this paper we make several contributions to interactive segmentation: (1) we systematically explore in simulation the design space of deep interactive segmentation models and report… ▽ More

    Submitted 17 April, 2019; v1 submitted 26 March, 2019; originally announced March 2019.

    Comments: Accepted at CVPR2019

  3. Person Recognition in Personal Photo Collections

    Authors: Seong Joon Oh, Rodrigo Benenson, Mario Fritz, Bernt Schiele

    Abstract: People nowadays share large parts of their personal lives through social media. Being able to automatically recognise people in personal photos may greatly enhance user convenience by easing photo album organisation. For human identification task, however, traditional focus of computer vision has been face recognition and pedestrian re-identification. Person recognition in social media photos sets… ▽ More

    Submitted 20 October, 2018; v1 submitted 9 October, 2017; originally announced October 2017.

    Comments: 18 pages, 20 figures; to appear in IEEE Transactions on Pattern Analysis and Machine Intelligence

  4. arXiv:1705.02950  [pdf, other

    cs.CV

    Learning non-maximum suppression

    Authors: Jan Hosang, Rodrigo Benenson, Bernt Schiele

    Abstract: Object detectors have hugely profited from moving towards an end-to-end learning paradigm: proposals, features, and the classifier becoming one neural network improved results two-fold on general object detection. One indispensable component is non-maximum suppression (NMS), a post-processing algorithm responsible for merging all detections that belong to the same object. The de facto standard NMS… ▽ More

    Submitted 9 May, 2017; v1 submitted 8 May, 2017; originally announced May 2017.

    Comments: Added "Supplementary material" title

  5. arXiv:1703.09554  [pdf, other

    cs.CV

    Lucid Data Dreaming for Video Object Segmentation

    Authors: Anna Khoreva, Rodrigo Benenson, Eddy Ilg, Thomas Brox, Bernt Schiele

    Abstract: Convolutional networks reach top quality in pixel-level video object segmentation but require a large amount of training data (1k~100k) to deliver such results. We propose a new training strategy which achieves state-of-the-art results across three evaluation datasets while using 20x~1000x less annotated data than competing methods. Our approach is suitable for both single and multiple object segm… ▽ More

    Submitted 13 March, 2019; v1 submitted 28 March, 2017; originally announced March 2017.

    Comments: Accepted in International Journal of Computer Vision (IJCV)

  6. arXiv:1702.05693  [pdf, other

    cs.CV

    CityPersons: A Diverse Dataset for Pedestrian Detection

    Authors: Shanshan Zhang, Rodrigo Benenson, Bernt Schiele

    Abstract: Convnets have enabled significant progress in pedestrian detection recently, but there are still open questions regarding suitable architectures and training data. We revisit CNN design and point out key adaptations, enabling plain FasterRCNN to obtain state-of-the-art results on the Caltech dataset. To achieve further improvement from more and better data, we introduce CityPersons, a new set of… ▽ More

    Submitted 18 February, 2017; originally announced February 2017.

  7. arXiv:1701.08261  [pdf, other

    cs.CV

    Exploiting saliency for object segmentation from image level labels

    Authors: Seong Joon Oh, Rodrigo Benenson, Anna Khoreva, Zeynep Akata, Mario Fritz, Bernt Schiele

    Abstract: There have been remarkable improvements in the semantic labelling task in the recent years. However, the state of the art methods rely on large-scale pixel-level annotations. This paper studies the problem of training a pixel-wise semantic labeller network from image-level annotations of the present object classes. Recently, it has been shown that high quality seeds indicating discriminative objec… ▽ More

    Submitted 14 July, 2017; v1 submitted 28 January, 2017; originally announced January 2017.

    Comments: CVPR 2017

  8. arXiv:1612.02646  [pdf, other

    cs.CV

    Learning Video Object Segmentation from Static Images

    Authors: Anna Khoreva, Federico Perazzi, Rodrigo Benenson, Bernt Schiele, Alexander Sorkine-Hornung

    Abstract: Inspired by recent advances of deep learning in instance segmentation and object tracking, we introduce video object segmentation problem as a concept of guided instance segmentation. Our model proceeds on a per-frame basis, guided by the output of the previous frame towards the object of interest in the next frame. We demonstrate that highly accurate object segmentation in videos can be enabled b… ▽ More

    Submitted 8 December, 2016; originally announced December 2016.

    Comments: Submitted to CVPR 2017

  9. arXiv:1607.08438  [pdf, other

    cs.CV cs.AI cs.CR

    Faceless Person Recognition; Privacy Implications in Social Media

    Authors: Seong Joon Oh, Rodrigo Benenson, Mario Fritz, Bernt Schiele

    Abstract: As we shift more of our lives into the virtual domain, the volume of data shared on the web keeps increasing and presents a threat to our privacy. This works contributes to the understanding of privacy implications of such data sharing by analysing how well people are recognisable in social media data. To facilitate a systematic study we define a number of scenarios considering factors such as how… ▽ More

    Submitted 28 July, 2016; originally announced July 2016.

    Comments: Accepted to ECCV'16

  10. arXiv:1605.03718  [pdf, other

    cs.CV

    Improved Image Boundaries for Better Video Segmentation

    Authors: Anna Khoreva, Rodrigo Benenson, Fabio Galasso, Matthias Hein, Bernt Schiele

    Abstract: Graph-based video segmentation methods rely on superpixels as starting point. While most previous work has focused on the construction of the graph edges and weights as well as solving the graph partitioning problem, this paper focuses on better superpixels for video segmentation. We demonstrate by a comparative analysis that superpixels extracted from boundaries perform best, and show that bounda… ▽ More

    Submitted 23 November, 2016; v1 submitted 12 May, 2016; originally announced May 2016.

  11. arXiv:1604.01685  [pdf, other

    cs.CV

    The Cityscapes Dataset for Semantic Urban Scene Understanding

    Authors: Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, Bernt Schiele

    Abstract: Visual understanding of complex urban street scenes is an enabling factor for a wide range of applications. Object detection has benefited enormously from large-scale datasets, especially in the context of deep learning. For semantic urban scene understanding, however, no current dataset adequately captures the complexity of real-world urban scenes. To address this, we introduce Cityscapes, a be… ▽ More

    Submitted 7 April, 2016; v1 submitted 6 April, 2016; originally announced April 2016.

    Comments: Includes supplemental material

  12. arXiv:1603.07485  [pdf, other

    cs.CV

    Simple Does It: Weakly Supervised Instance and Semantic Segmentation

    Authors: Anna Khoreva, Rodrigo Benenson, Jan Hosang, Matthias Hein, Bernt Schiele

    Abstract: Semantic labelling and instance segmentation are two tasks that require particularly costly annotations. Starting from weak supervision in the form of bounding box detection annotations, we propose a new approach that does not require modification of the segmentation training procedure. We show that when carefully designing the input labels from given bounding boxes, even a single round of trainin… ▽ More

    Submitted 23 November, 2016; v1 submitted 24 March, 2016; originally announced March 2016.

  13. arXiv:1602.01237  [pdf, other

    cs.CV

    How Far are We from Solving Pedestrian Detection?

    Authors: Shanshan Zhang, Rodrigo Benenson, Mohamed Omran, Jan Hosang, Bernt Schiele

    Abstract: Encouraged by the recent progress in pedestrian detection, we investigate the gap between current state-of-the-art methods and the "perfect single frame detector". We enable our analysis by creating a human baseline for pedestrian detection (over the Caltech dataset), and by manually clustering the recurrent errors of a top detector. Our results characterize both localization and background-versus… ▽ More

    Submitted 21 June, 2016; v1 submitted 3 February, 2016; originally announced February 2016.

    Comments: CVPR16 camera ready

  14. arXiv:1511.07803  [pdf, other

    cs.CV

    Weakly Supervised Object Boundaries

    Authors: Anna Khoreva, Rodrigo Benenson, Mohamed Omran, Matthias Hein, Bernt Schiele

    Abstract: State-of-the-art learning based boundary detection methods require extensive training data. Since labelling object boundaries is one of the most expensive types of annotations, there is a need to relax the requirement to carefully annotate images to make both the training more affordable and to extend the amount of training data. In this paper we propose a technique to generate weakly supervised a… ▽ More

    Submitted 24 November, 2015; originally announced November 2015.

  15. arXiv:1511.06437  [pdf, other

    cs.CV cs.LG

    A convnet for non-maximum suppression

    Authors: Jan Hosang, Rodrigo Benenson, Bernt Schiele

    Abstract: Non-maximum suppression (NMS) is used in virtually all state-of-the-art object detection pipelines. While essential object detection ingredients such as features, classifiers, and proposal methods have been extensively researched surprisingly little work has aimed to systematically address NMS. The de-facto standard for NMS is based on greedy clustering with a fixed distance threshold, which force… ▽ More

    Submitted 7 January, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: Included comments from reviewers

  16. arXiv:1509.03502  [pdf, other

    cs.CV

    Person Recognition in Personal Photo Collections

    Authors: Seong Joon Oh, Rodrigo Benenson, Mario Fritz, Bernt Schiele

    Abstract: Recognising persons in everyday photos presents major challenges (occluded faces, different clothing, locations, etc.) for machine vision. We propose a convnet based person recognition system on which we provide an in-depth analysis of informativeness of different body cues, impact of training data, and the common failure modes of the system. In addition, we discuss the limitations of existing ben… ▽ More

    Submitted 25 September, 2015; v1 submitted 11 September, 2015; originally announced September 2015.

    Comments: Accepted to ICCV 2015, revised

  17. arXiv:1508.02844  [pdf, other

    cs.CV

    What is Holding Back Convnets for Detection?

    Authors: Bojan Pepik, Rodrigo Benenson, Tobias Ritschel, Bernt Schiele

    Abstract: Convolutional neural networks have recently shown excellent results in general object detection and many other tasks. Albeit very effective, they involve many user-defined design choices. In this paper we want to better understand these choices by inspecting two key aspects "what did the network learn?", and "what can the network learn?". We exploit new annotations (Pascal3D+), to enable a new emp… ▽ More

    Submitted 18 August, 2015; v1 submitted 12 August, 2015; originally announced August 2015.

  18. What makes for effective detection proposals?

    Authors: Jan Hosang, Rodrigo Benenson, Piotr Dollár, Bernt Schiele

    Abstract: Current top performing object detectors employ detection proposals to guide the search for objects, thereby avoiding exhaustive sliding window search across images. Despite the popularity and widespread use of detection proposals, it is unclear which trade-offs are made when using them during object detection. We provide an in-depth analysis of twelve proposal methods along with four baselines reg… ▽ More

    Submitted 1 August, 2015; v1 submitted 17 February, 2015; originally announced February 2015.

    Comments: TPAMI final version, duplicate proposals removed in experiments

  19. arXiv:1501.05790  [pdf, other

    cs.CV

    Taking a Deeper Look at Pedestrians

    Authors: Jan Hosang, Mohamed Omran, Rodrigo Benenson, Bernt Schiele

    Abstract: In this paper we study the use of convolutional neural networks (convnets) for the task of pedestrian detection. Despite their recent diverse successes, convnets historically underperform compared to other pedestrian detectors. We deliberately omit explicitly modelling the problem into the network (e.g. parts or occlusion modelling) and show that we can reach competitive performance without bells… ▽ More

    Submitted 23 January, 2015; originally announced January 2015.

  20. arXiv:1501.05759  [pdf, other

    cs.CV

    Filtered Channel Features for Pedestrian Detection

    Authors: Shanshan Zhang, Rodrigo Benenson, Bernt Schiele

    Abstract: This paper starts from the observation that multiple top performing pedestrian detectors can be modelled by using an intermediate layer filtering low-level features in combination with a boosted decision forest. Based on this observation we propose a unifying framework and experimentally explore different filter families. We report extensive results enabling a systematic analysis. Using filtered… ▽ More

    Submitted 23 January, 2015; originally announced January 2015.

  21. arXiv:1411.4304  [pdf, other

    cs.CV

    Ten Years of Pedestrian Detection, What Have We Learned?

    Authors: Rodrigo Benenson, Mohamed Omran, Jan Hosang, Bernt Schiele

    Abstract: Paper-by-paper results make it easy to miss the forest for the trees.We analyse the remarkable progress of the last decade by discussing the main ideas explored in the 40+ detectors currently present in the Caltech pedestrian detection benchmark. We observe that there exist three families of approaches, all currently reaching similar detection quality. Based on our analysis, we study the complemen… ▽ More

    Submitted 16 November, 2014; originally announced November 2014.

    Comments: To appear in ECCV 2014 CVRSUAD workshop proceedings

  22. arXiv:1406.6962  [pdf, other

    cs.CV

    How good are detection proposals, really?

    Authors: Jan Hosang, Rodrigo Benenson, Bernt Schiele

    Abstract: Current top performing Pascal VOC object detectors employ detection proposals to guide the search for objects thereby avoiding exhaustive sliding window search across images. Despite the popularity of detection proposals, it is unclear which trade-offs are made when using them during object detection. We provide an in depth analysis of ten object proposal methods along with four baselines regardi… ▽ More

    Submitted 22 July, 2014; v1 submitted 26 June, 2014; originally announced June 2014.