Zum Hauptinhalt springen

Showing 1–36 of 36 results for author: Egger, B

.
  1. arXiv:2404.16306  [pdf, other

    cs.CV

    TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

    Authors: Haomiao Ni, Bernhard Egger, Suhas Lohit, Anoop Cherian, Ye Wang, Toshiaki Koike-Akino, Sharon X. Huang, Tim K. Marks

    Abstract: Text-conditioned image-to-video generation (TI2V) aims to synthesize a realistic video starting from a given image (e.g., a woman's photo) and a text description (e.g., "a woman is drinking water."). Existing TI2V frameworks often require costly training on video-text datasets and specific model designs for text and image conditioning. In this paper, we propose TI2V-Zero, a zero-shot, tuning-free… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  2. arXiv:2404.03421  [pdf, other

    cs.CV

    Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View

    Authors: Andreea Dogaru, Mert Özer, Bernhard Egger

    Abstract: Single-view 3D reconstruction is currently approached from two dominant perspectives: reconstruction of scenes with limited diversity using 3D data supervision or reconstruction of diverse singular objects using large image priors. However, real-world scenarios are far more complex and exceed the capabilities of these methods. We therefore propose a hybrid method following a divide-and-conquer str… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  3. arXiv:2404.00994  [pdf, other

    cs.CV

    AMOR: Ambiguous Authorship Order

    Authors: Maximilian Weiherer, Andreea Dogaru, Shreya Kapoor, Hannah Schieber, Bernhard Egger

    Abstract: As we all know, writing scientific papers together with our beloved colleagues is a truly remarkable experience (partially): endless discussions about the same useless paragraph over and over again, followed by long days and long nights -- both at the same time. What a wonderful ride it is! What a beautiful life we have. But wait, there's one tiny little problem that utterly shatters the peace, tu… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: SIGBOVIK '24 submission

  4. arXiv:2403.11865  [pdf, other

    cs.CV cs.AI cs.GR

    Exploring Multi-modal Neural Scene Representations With Applications on Thermal Imaging

    Authors: Mert Özer, Maximilian Weiherer, Martin Hundhausen, Bernhard Egger

    Abstract: Neural Radiance Fields (NeRFs) quickly evolved as the new de-facto standard for the task of novel view synthesis when trained on a set of RGB images. In this paper, we conduct a comprehensive evaluation of neural scene representations, such as NeRFs, in the context of multi-modal learning. Specifically, we present four different strategies of how to incorporate a second modality, other than RGB, i… ▽ More

    Submitted 23 August, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted to ECCVW'24

  5. exploreCOSMOS: Interactive Exploration of Conditional Statistical Shape Models in the Web-Browser

    Authors: Maximilian Hahn, Bernhard Egger

    Abstract: Statistical Shape Models of faces and various body parts are heavily used in medical image analysis, computer vision and visualization. Whilst the field is well explored with many existing tools, all of them aim at experts, which limits their applicability. We demonstrate the first tool that enables the convenient exploration of statistical shape models in the browser, with the capability to manip… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Dies ist ein Vorabdruck des folgenden Beitrages, veröffentlicht in BVM 2024, herausgegeben von Maier, A. et al, 2024, Springer Nature, vervielfältigt mit Genehmigung von Springer Nature. Die finale authentifizierte Version ist online verfügbar unter: https://doi.org/10.1007/978-3-658-44037-4_32

  6. arXiv:2402.07677  [pdf, other

    cs.CV

    GBOT: Graph-Based 3D Object Tracking for Augmented Reality-Assisted Assembly Guidance

    Authors: Shiyu Li, Hannah Schieber, Niklas Corell, Bernhard Egger, Julian Kreimeier, Daniel Roth

    Abstract: Guidance for assemblable parts is a promising field for augmented reality. Augmented reality assembly guidance requires 6D object poses of target objects in real time. Especially in time-critical medical or industrial settings, continuous and markerless tracking of individual parts is essential to visualize instructions superimposed on or next to the target object parts. In this regard, occlusions… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 9 pages

  7. arXiv:2312.09780  [pdf, other

    cs.CV

    RANRAC: Robust Neural Scene Representations via Random Ray Consensus

    Authors: Benno Buschmann, Andreea Dogaru, Elmar Eisemann, Michael Weinmann, Bernhard Egger

    Abstract: Learning-based scene representations such as neural radiance fields or light field networks, that rely on fitting a scene model to image observations, commonly encounter challenges in the presence of inconsistencies within the images caused by occlusions, inaccurately estimated camera parameters or effects like lens flare. To address this challenge, we introduce RANdom RAy Consensus (RANRAC), an e… ▽ More

    Submitted 19 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

  8. arXiv:2311.17232  [pdf, other

    cs.CV cs.AI

    ReWaRD: Retinal Waves for Pre-Training Artificial Neural Networks Mimicking Real Prenatal Development

    Authors: Benjamin Cappell, Andreas Stoll, Williams Chukwudi Umah, Bernhard Egger

    Abstract: Computational models trained on a large amount of natural images are the state-of-the-art to study human vision - usually adult vision. Computational models of infant vision and its further development are gaining more and more attention in the community. In this work we aim at the very beginning of our visual experience - pre- and post-natal retinal waves which suggest to be a pre-training mechan… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: https://github.com/BennyCa/ReWaRD/ IN: Proceedings of the first edition of the Workshop on Unifying Representations in Neural Models (UniReps 2023) @ NeurIPS 2023

  9. arXiv:2311.16937  [pdf, other

    cs.CV

    The Sky's the Limit: Re-lightable Outdoor Scenes via a Sky-pixel Constrained Illumination Prior and Outside-In Visibility

    Authors: James A. D. Gardner, Evgenii Kashin, Bernhard Egger, William A. P. Smith

    Abstract: Inverse rendering of outdoor scenes from unconstrained image collections is a challenging task, particularly illumination/albedo ambiguities and occlusion of the illumination environment (shadowing) caused by geometry. However, there are many cues in an image that can aid in the disentanglement of geometry, albedo and shadows. Whilst sky is frequently masked out in state-of-the-art methods, we exp… ▽ More

    Submitted 30 July, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted to ECCV 2024

  10. arXiv:2311.09361  [pdf, other

    cs.CV

    RENI++ A Rotation-Equivariant, Scale-Invariant, Natural Illumination Prior

    Authors: James A. D. Gardner, Bernhard Egger, William A. P. Smith

    Abstract: Inverse rendering is an ill-posed problem. Previous work has sought to resolve this by focussing on priors for object or scene shape or appearance. In this work, we instead focus on a prior for natural illuminations. Current methods rely on spherical harmonic lighting or other generic representations and, at best, a simplistic prior on the parameters. This results in limitations for the inverse se… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Project Repo - https://github.com/JADGardner/ns_reni. arXiv admin note: substantial text overlap with arXiv:2206.03858

  11. arXiv:2304.00399  [pdf, other

    cs.CV cs.AI

    From Zero to Hero: Convincing with Extremely Complicated Math

    Authors: Maximilian Weiherer, Bernhard Egger

    Abstract: Becoming a (super) hero is almost every kid's dream. During their sheltered childhood, they do whatever it takes to grow up to be one. Work hard, play hard -- all day long. But as they're getting older, distractions are more and more likely to occur. They're getting off track. They start discovering what is feared as simple math. Finally, they end up as a researcher, writing boring, non-impressive… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: SIGBOVIK'23

  12. arXiv:2303.10042  [pdf, other

    cs.CV

    ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty

    Authors: Vanessa Wirth, Anna-Maria Liphardt, Birte Coppers, Johanna Bräunig, Simon Heinrich, Sigrid Leyendecker, Arnd Kleyer, Georg Schett, Martin Vossiek, Bernhard Egger, Marc Stamminger

    Abstract: Despite their potential, markerless hand tracking technologies are not yet applied in practice to the diagnosis or monitoring of the activity in inflammatory musculoskeletal diseases. One reason is that the focus of most methods lies in the reconstruction of coarse, plausible poses, whereas in the clinical context, accurate, interpretable, and reliable results are required. Therefore, we propose S… ▽ More

    Submitted 12 September, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: Accepted at ICCVW (CVAMD) 2023

  13. arXiv:2303.09412  [pdf, other

    cs.CV

    NeRFtrinsic Four: An End-To-End Trainable NeRF Jointly Optimizing Diverse Intrinsic and Extrinsic Camera Parameters

    Authors: Hannah Schieber, Fabian Deuser, Bernhard Egger, Norbert Oswald, Daniel Roth

    Abstract: Novel view synthesis using neural radiance fields (NeRF) is the state-of-the-art technique for generating high-quality images from novel viewpoints. Existing methods require a priori knowledge about extrinsic and intrinsic camera parameters. This limits their applicability to synthetic scenes, or real-world scenarios with the necessity of a preprocessing step. Current research on the joint optimiz… ▽ More

    Submitted 26 October, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

  14. arXiv:2303.04923  [pdf, other

    cs.CV

    BOSS: Bones, Organs and Skin Shape Model

    Authors: Karthik Shetty, Annette Birkhold, Srikrishna Jaganathan, Norbert Strobel, Bernhard Egger, Markus Kowarschik, Andreas Maier

    Abstract: Objective: A digital twin of a patient can be a valuable tool for enhancing clinical tasks such as workflow automation, patient-specific X-ray dose optimization, markerless tracking, positioning, and navigation assistance in image-guided interventions. However, it is crucial that the patient's surface and internal organs are of high quality for any pose and shape estimates. At present, the majorit… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  15. arXiv:2211.16314  [pdf, other

    cs.CV cs.GR cs.LG

    Approximating Intersections and Differences Between Linear Statistical Shape Models Using Markov Chain Monte Carlo

    Authors: Maximilian Weiherer, Finn Klein, Bernhard Egger

    Abstract: To date, the comparison of Statistical Shape Models (SSMs) is often solely performance-based, carried out by means of simplistic metrics such as compactness, generalization, or specificity. Any similarities or differences between the actual shape spaces can neither be visualized nor quantified. In this paper, we present a new method to qualitatively compare two linear SSMs in dense correspondence… ▽ More

    Submitted 30 October, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Accepted to WACV'24

  16. arXiv:2211.11734  [pdf, other

    cs.CV

    PLIKS: A Pseudo-Linear Inverse Kinematic Solver for 3D Human Body Estimation

    Authors: Karthik Shetty, Annette Birkhold, Srikrishna Jaganathan, Norbert Strobel, Markus Kowarschik, Andreas Maier, Bernhard Egger

    Abstract: We introduce PLIKS (Pseudo-Linear Inverse Kinematic Solver) for reconstruction of a 3D mesh of the human body from a single 2D image. Current techniques directly regress the shape, pose, and translation of a parametric model from an input image through a non-linear mapping with minimal flexibility to any external influences. We approach the task as a model-in-the-loop optimization problem. PLIKS i… ▽ More

    Submitted 27 March, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: CVPR2023

  17. arXiv:2210.15664  [pdf, other

    cs.CV cs.GR

    State of the Art in Dense Monocular Non-Rigid 3D Reconstruction

    Authors: Edith Tretschk, Navami Kairanda, Mallikarjun B R, Rishabh Dabral, Adam Kortylewski, Bernhard Egger, Marc Habermann, Pascal Fua, Christian Theobalt, Vladislav Golyanik

    Abstract: 3D reconstruction of deformable (or non-rigid) scenes from a set of monocular 2D image observations is a long-standing and actively researched area of computer vision and graphics. It is an ill-posed inverse problem, since -- without additional prior assumptions -- it permits infinitely many solutions leading to accurate projection to the input 2D images. Non-rigid reconstruction is a foundational… ▽ More

    Submitted 24 March, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: 36 pages, 18 figures, 3 tables; State-of-the-Art Report at EUROGRAPHICS 2023

    Journal ref: Computer Graphics Forum, 2023

  18. A Lightweight Machine Learning Pipeline for LiDAR-simulation

    Authors: Richard Marcus, Niklas Knoop, Bernhard Egger, Marc Stamminger

    Abstract: Virtual testing is a crucial task to ensure safety in autonomous driving, and sensor simulation is an important task in this domain. Most current LiDAR simulations are very simplistic and are mainly used to perform initial tests, while the majority of insights are gathered on the road. In this paper, we propose a lightweight approach for more realistic LiDAR simulation that learns a real sensor's… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

    Comments: Conference: DeLTA 22; ISBN 978-989-758-584-5; ISSN 2184-9277; publisher: SciTePress, organization: INSTICC

    Journal ref: Proceedings of the 3rd International Conference on Deep Learning Theory and Applications - DeLTA, 2022, pages 176-183

  19. arXiv:2206.03858  [pdf, other

    cs.CV

    Rotation-Equivariant Conditional Spherical Neural Fields for Learning a Natural Illumination Prior

    Authors: James A. D. Gardner, Bernhard Egger, William A. P. Smith

    Abstract: Inverse rendering is an ill-posed problem. Previous work has sought to resolve this by focussing on priors for object or scene shape or appearance. In this work, we instead focus on a prior for natural illuminations. Current methods rely on spherical harmonic lighting or other generic representations and, at best, a simplistic prior on the parameters. We propose a conditional neural field represen… ▽ More

    Submitted 14 October, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 - Project Website: jadgardner.github.io/RENI

  20. arXiv:2203.02554  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Building 3D Generative Models from Minimal Data

    Authors: Skylar Sutherland, Bernhard Egger, Joshua Tenenbaum

    Abstract: We propose a method for constructing generative models of 3D objects from a single 3D mesh and improving them through unsupervised low-shot learning from 2D images. Our method produces a 3D morphable model that represents shape and albedo in terms of Gaussian processes. Whereas previous approaches have typically built 3D morphable models from multiple high-quality 3D scans through principal compon… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2011.12440

  21. arXiv:2201.09354  [pdf, other

    cs.CV cs.AI cs.LG

    Survey and Systematization of 3D Object Detection Models and Methods

    Authors: Moritz Drobnitzky, Jonas Friederich, Bernhard Egger, Patrick Zschech

    Abstract: Strong demand for autonomous vehicles and the wide availability of 3D sensors are continuously fueling the proposal of novel methods for 3D object detection. In this paper, we provide a comprehensive survey of recent developments from 2012-2021 in 3D object detection covering the full pipeline from input data, over data representation and feature extraction to the actual detection modules. We intr… ▽ More

    Submitted 5 May, 2023; v1 submitted 23 January, 2022; originally announced January 2022.

    Comments: accepted at "The Visual Computer"

  22. arXiv:2112.00113  [pdf, other

    cs.CV cs.AI

    Beyond Flatland: Pre-training with a Strong 3D Inductive Bias

    Authors: Shubhaankar Gupta, Thomas P. O'Connell, Bernhard Egger

    Abstract: Pre-training on large-scale databases consisting of natural images and then fine-tuning them to fit the application at hand, or transfer-learning, is a popular strategy in computer vision. However, Kataoka et al., 2020 introduced a technique to eliminate the need for natural images in supervised deep learning by proposing a novel synthetic, formula-based method to generate 2D fractals as training… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Comments: NeurIPS 2021 pre-registration workshop

  23. arXiv:2111.01048  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation

    Authors: Safa C. Medin, Bernhard Egger, Anoop Cherian, Ye Wang, Joshua B. Tenenbaum, Xiaoming Liu, Tim K. Marks

    Abstract: Recent advances in generative adversarial networks (GANs) have led to remarkable achievements in face image synthesis. While methods that use style-based GANs can generate strikingly photorealistic face images, it is often difficult to control the characteristics of the generated faces in a meaningful and disentangled way. Prior approaches aim to achieve such semantic control and disentanglement w… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    ACM Class: I.2.10

  24. arXiv:2109.14203  [pdf, other

    cs.CV cs.GR

    Identity-Expression Ambiguity in 3D Morphable Face Models

    Authors: Bernhard Egger, Skylar Sutherland, Safa C. Medin, Joshua Tenenbaum

    Abstract: 3D Morphable Models are a class of generative models commonly used to model faces. They are typically applied to ill-posed problems such as 3D reconstruction from 2D data. Several ambiguities in this problem's image formation process have been studied explicitly. We demonstrate that non-orthogonality of the variation in identity and expression can cause identity-expression ambiguity in 3D Morphabl… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: IEEE International Conference on Automatic Face and Gesture Recognition 2021

  25. Learning the shape of female breasts: an open-access 3D statistical shape model of the female breast built from 110 breast scans

    Authors: Maximilian Weiherer, Andreas Eigenberger, Bernhard Egger, Vanessa Brébant, Lukas Prantl, Christoph Palm

    Abstract: We present the Regensburg Breast Shape Model (RBSM) -- a 3D statistical shape model of the female breast built from 110 breast scans acquired in a standing position, and the first publicly available. Together with the model, a fully automated, pairwise surface registration pipeline used to establish dense correspondence among 3D breast scans is introduced. Our method is computationally efficient a… ▽ More

    Submitted 1 February, 2022; v1 submitted 28 July, 2021; originally announced July 2021.

    Comments: 16 pages, 14 figures, accepted for publication in The Visual Computer

    ACM Class: I.4.m; J.3

  26. arXiv:2106.09614  [pdf, other

    cs.CV

    Robust Model-based Face Reconstruction through Weakly-Supervised Outlier Segmentation

    Authors: Chunlu Li, Andreas Morel-Forster, Thomas Vetter, Bernhard Egger, Adam Kortylewski

    Abstract: In this work, we aim to enhance model-based face reconstruction by avoiding fitting the model to outliers, i.e. regions that cannot be well-expressed by the model such as occluders or make-up. The core challenge for localizing outliers is that they are highly variable and difficult to annotate. To overcome this challenging problem, we introduce a joint Face-autoencoder and outlier segmentation app… ▽ More

    Submitted 21 March, 2023; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: 20 pages, CVPR2023

  27. arXiv:2102.02912  [pdf, other

    eess.IV cs.CV

    Deep Learning compatible Differentiable X-ray Projections for Inverse Rendering

    Authors: Karthik Shetty, Annette Birkhold, Norbert Strobel, Bernhard Egger, Srikrishna Jaganathan, Markus Kowarschik, Andreas Maier

    Abstract: Many minimally invasive interventional procedures still rely on 2D fluoroscopic imaging. Generating a patient-specific 3D model from these X-ray projection data would allow to improve the procedural workflow, e.g. by providing assistance functions such as automatic positioning. To accomplish this, two things are required. First, a statistical human shape model of the human anatomy and second, a di… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

    Comments: 7 pages, 3 figures, Accepted for Bildverarbeitung für die Medizin 2021

  28. arXiv:2011.12440  [pdf, other

    cs.CV cs.GR

    Building 3D Morphable Models from a Single Scan

    Authors: Skylar Sutherland, Bernhard Egger, Joshua Tenenbaum

    Abstract: We propose a method for constructing generative models of 3D objects from a single 3D mesh. Our method produces a 3D morphable model that represents shape and albedo in terms of Gaussian processes. We define the shape deformations in physical (3D) space and the albedo deformations as a combination of physical-space and color-space deformations. Whereas previous approaches have typically built 3D m… ▽ More

    Submitted 30 September, 2021; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: ICCV Workshops: 1st Workshop on Traditional Computer Vision in the Age of Deep Learning (TradiCV)

  29. arXiv:2010.13187  [pdf, other

    stat.ML cs.CV cs.LG

    Improving the Reconstruction of Disentangled Representation Learners via Multi-Stage Modeling

    Authors: Akash Srivastava, Yamini Bansal, Yukun Ding, Cole Lincoln Hurwitz, Kai Xu, Bernhard Egger, Prasanna Sattigeri, Joshua B. Tenenbaum, Phuong Le, Arun Prakash R, Nengfeng Zhou, Joel Vaughan, Yaquan Wang, Anwesha Bhattacharyya, Kristjan Greenewald, David D. Cox, Dan Gutfreund

    Abstract: Current autoencoder-based disentangled representation learning methods achieve disentanglement by penalizing the (aggregate) posterior to encourage statistical independence of the latent factors. This approach introduces a trade-off between disentangled representation learning and reconstruction quality since the model does not have enough capacity to learn correlated latent variables that capture… ▽ More

    Submitted 3 April, 2024; v1 submitted 25 October, 2020; originally announced October 2020.

  30. arXiv:2004.02711  [pdf, other

    cs.CV cs.GR

    A Morphable Face Albedo Model

    Authors: William A. P. Smith, Alassane Seck, Hannah Dee, Bernard Tiddeman, Joshua Tenenbaum, Bernhard Egger

    Abstract: In this paper, we bring together two divergent strands of research: photometric face capture and statistical 3D face appearance modelling. We propose a novel lightstage capture and processing pipeline for acquiring ear-to-ear, truly intrinsic diffuse and specular albedo maps that fully factor out the effects of illumination, camera and geometry. Using this pipeline, we capture a dataset of 50 scan… ▽ More

    Submitted 19 June, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: CVPR 2020

  31. arXiv:1909.01815  [pdf, other

    cs.CV cs.GR cs.LG

    3D Morphable Face Models -- Past, Present and Future

    Authors: Bernhard Egger, William A. P. Smith, Ayush Tewari, Stefanie Wuhrer, Michael Zollhoefer, Thabo Beeler, Florian Bernard, Timo Bolkart, Adam Kortylewski, Sami Romdhani, Christian Theobalt, Volker Blanz, Thomas Vetter

    Abstract: In this paper, we provide a detailed survey of 3D Morphable Face Models over the 20 years since they were first proposed. The challenges in building and applying these models, namely capture, modeling, image formation, and image analysis, are still active research topics, and we review the state-of-the-art in each of these areas. We also look ahead, identifying unsolved challenges, proposing direc… ▽ More

    Submitted 16 April, 2020; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: ACM Transactions on Graphics (TOG)

  32. arXiv:1907.07783  [pdf, other

    eess.IV cs.CG cs.CV cs.LG

    Patient-specific Conditional Joint Models of Shape, Image Features and Clinical Indicators

    Authors: Bernhard Egger, Markus D. Schirmer, Florian Dubost, Marco J. Nardin, Natalia S. Rost, Polina Golland

    Abstract: We propose and demonstrate a joint model of anatomical shapes, image features and clinical indicators for statistical shape modeling and medical image analysis. The key idea is to employ a copula model to separate the joint dependency structure from the marginal distributions of variables of interest. This separation provides flexibility on the assumptions made during the modeling process. The pro… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: Supplementary material: https://www.youtube.com/watch?v=gPoHP_iFQIA

    Journal ref: MICCAI 2019, the 22nd International Conference on Medical Image Computing and Computer Assisted Intervention, in Shenzhen, China

  33. arXiv:1811.08565  [pdf, other

    cs.CV

    Can Synthetic Faces Undo the Damage of Dataset Bias to Face Recognition and Facial Landmark Detection?

    Authors: Adam Kortylewski, Bernhard Egger, Andreas Morel-Forster, Andreas Schneider, Thomas Gerig, Clemens Blumer, Corius Reyneke, Thomas Vetter

    Abstract: It is well known that deep learning approaches to face recognition and facial landmark detection suffer from biases in modern training datasets. In this work, we propose to use synthetic face images to reduce the negative effects of dataset biases on these tasks. Using a 3D morphable face model, we generate large amounts of synthetic face images with full control over facial shape and color, pose,… ▽ More

    Submitted 22 June, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

    Comments: Technical report

  34. arXiv:1802.05891  [pdf, other

    cs.CV

    Training Deep Face Recognition Systems with Synthetic Data

    Authors: Adam Kortylewski, Andreas Schneider, Thomas Gerig, Bernhard Egger, Andreas Morel-Forster, Thomas Vetter

    Abstract: Recent advances in deep learning have significantly increased the performance of face recognition systems. The performance and reliability of these models depend heavily on the amount and quality of the training data. However, the collection of annotated large datasets does not scale well and the control over the quality of the data decreases with the size of the dataset. In this work, we explore… ▽ More

    Submitted 16 February, 2018; originally announced February 2018.

  35. arXiv:1712.01619  [pdf, other

    cs.CV

    Empirically Analyzing the Effect of Dataset Biases on Deep Face Recognition Systems

    Authors: Adam Kortylewski, Bernhard Egger, Andreas Schneider, Thomas Gerig, Andreas Morel-Forster, Thomas Vetter

    Abstract: It is unknown what kind of biases modern in the wild face datasets have because of their lack of annotation. A direct consequence of this is that total recognition rates alone only provide limited insight about the generalization ability of a Deep Convolutional Neural Networks (DCNNs). We propose to empirically study the effect of different types of dataset biases on the generalization ability of… ▽ More

    Submitted 19 April, 2018; v1 submitted 5 December, 2017; originally announced December 2017.

    Comments: Accepted to CVPR 2018 Workshop on Analysis and Modeling of Faces and Gestures (AMFG)

  36. arXiv:1709.08398  [pdf, other

    cs.CV

    Morphable Face Models - An Open Framework

    Authors: Thomas Gerig, Andreas Morel-Forster, Clemens Blumer, Bernhard Egger, Marcel Lüthi, Sandro Schönborn, Thomas Vetter

    Abstract: In this paper, we present a novel open-source pipeline for face registration based on Gaussian processes as well as an application to face image analysis. Non-rigid registration of faces is significant for many applications in computer vision, such as the construction of 3D Morphable face models (3DMMs). Gaussian Process Morphable Models (GPMMs) unify a variety of non-rigid deformation models with… ▽ More

    Submitted 26 September, 2017; v1 submitted 25 September, 2017; originally announced September 2017.