Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Khamis, S

Searching in archive cs. Search in all archives.
.
  1. Single-Shot Implicit Morphable Faces with Consistent Texture Parameterization

    Authors: Connor Z. Lin, Koki Nagano, Jan Kautz, Eric R. Chan, Umar Iqbal, Leonidas Guibas, Gordon Wetzstein, Sameh Khamis

    Abstract: There is a growing demand for the accessible creation of high-quality 3D avatars that are animatable and customizable. Although 3D morphable models provide intuitive control for editing and animation, and robustness for single-view face reconstruction, they cannot easily capture geometric and appearance details. Methods based on neural implicit representations, such as signed distance functions (S… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: SIGGRAPH 2023, Project Page: https://research.nvidia.com/labs/toronto-ai/ssif

  2. arXiv:2305.02310  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Real-Time Radiance Fields for Single-Image Portrait View Synthesis

    Authors: Alex Trevithick, Matthew Chan, Michael Stengel, Eric R. Chan, Chao Liu, Zhiding Yu, Sameh Khamis, Manmohan Chandraker, Ravi Ramamoorthi, Koki Nagano

    Abstract: We present a one-shot method to infer and render a photorealistic 3D representation from a single unposed image (e.g., face portrait) in real-time. Given a single RGB input, our image encoder directly predicts a canonical triplane representation of a neural radiance field for 3D-aware novel view synthesis via volume rendering. Our method is fast (24 fps) on consumer hardware, and produces higher q… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Project page: https://research.nvidia.com/labs/nxp/lp3d/

  3. arXiv:2212.03237  [pdf, other

    cs.CV

    RANA: Relightable Articulated Neural Avatars

    Authors: Umar Iqbal, Akin Caliskan, Koki Nagano, Sameh Khamis, Pavlo Molchanov, Jan Kautz

    Abstract: We propose RANA, a relightable and articulated neural avatar for the photorealistic synthesis of humans under arbitrary viewpoints, body poses, and lighting. We only require a short video clip of the person to create the avatar and assume no knowledge about the lighting environment. We present a novel framework to model humans while disentangling their geometry, texture, and also lighting environm… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: project page: https://nvlabs.github.io/RANA/

  4. arXiv:2209.10510  [pdf, other

    cs.CV cs.GR cs.LG

    Learning to Relight Portrait Images via a Virtual Light Stage and Synthetic-to-Real Adaptation

    Authors: Yu-Ying Yeh, Koki Nagano, Sameh Khamis, Jan Kautz, Ming-Yu Liu, Ting-Chun Wang

    Abstract: Given a portrait image of a person and an environment map of the target lighting, portrait relighting aims to re-illuminate the person in the image as if the person appeared in an environment with the target lighting. To achieve high-quality results, recent methods rely on deep learning. An effective approach is to supervise the training of deep neural networks with a high-fidelity dataset of desi… ▽ More

    Submitted 10 August, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: To appear in ACM Transactions on Graphics (SIGGRAPH Asia 2022). 21 pages, 25 figures, 7 tables. Project page: https://research.nvidia.com/labs/dir/lumos/

    Journal ref: ACM Trans. Graph. 41, 6, Article 231 (December 2022), 21 pages

  5. arXiv:2205.07058  [pdf, other

    cs.CV

    RTMV: A Ray-Traced Multi-View Synthetic Dataset for Novel View Synthesis

    Authors: Jonathan Tremblay, Moustafa Meshry, Alex Evans, Jan Kautz, Alexander Keller, Sameh Khamis, Thomas Müller, Charles Loop, Nathan Morrical, Koki Nagano, Towaki Takikawa, Stan Birchfield

    Abstract: We present a large-scale synthetic dataset for novel view synthesis consisting of ~300k images rendered from nearly 2000 complex scenes using high-quality ray tracing at high resolution (1600 x 1600 pixels). The dataset is orders of magnitude larger than existing synthetic datasets for novel view synthesis, thus providing a large unified benchmark for both training and evaluation. Using 4 distinct… ▽ More

    Submitted 24 October, 2022; v1 submitted 14 May, 2022; originally announced May 2022.

    Comments: ECCV 2022 Workshop on Learning to Generate 3D Shapes and Scenes. Project page at http://www.cs.umd.edu/~mmeshry/projects/rtmv

  6. arXiv:2203.15798  [pdf, other

    cs.CV

    DRaCoN -- Differentiable Rasterization Conditioned Neural Radiance Fields for Articulated Avatars

    Authors: Amit Raj, Umar Iqbal, Koki Nagano, Sameh Khamis, Pavlo Molchanov, James Hays, Jan Kautz

    Abstract: Acquisition and creation of digital human avatars is an important problem with applications to virtual telepresence, gaming, and human modeling. Most contemporary approaches for avatar generation can be viewed either as 3D-based methods, which use multi-view data to learn a 3D representation with appearance (such as a mesh, implicit surface, or volume), or 2D-based methods which learn photo-realis… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Project page at https://dracon-avatars.github.io/

  7. arXiv:2112.07945  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Efficient Geometry-aware 3D Generative Adversarial Networks

    Authors: Eric R. Chan, Connor Z. Lin, Matthew A. Chan, Koki Nagano, Boxiao Pan, Shalini De Mello, Orazio Gallo, Leonidas Guibas, Jonathan Tremblay, Sameh Khamis, Tero Karras, Gordon Wetzstein

    Abstract: Unsupervised generation of high-quality multi-view-consistent images and 3D shapes using only collections of single-view 2D photographs has been a long-standing challenge. Existing 3D GANs are either compute-intensive or make approximations that are not 3D-consistent; the former limits quality and resolution of the generated images and the latter adversely affects multi-view consistency and shape… ▽ More

    Submitted 27 April, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: Project page: https://matthew-a-chan.github.io/EG3D

  8. arXiv:2112.01741  [pdf, other

    cs.CV cs.GR cs.LG

    Frame Averaging for Equivariant Shape Space Learning

    Authors: Matan Atzmon, Koki Nagano, Sanja Fidler, Sameh Khamis, Yaron Lipman

    Abstract: The task of shape space learning involves mapping a train set of shapes to and from a latent representation space with good generalization properties. Often, real-world collections of shapes have symmetries, which can be defined as transformations that do not change the essence of the shape. A natural way to incorporate symmetries in shape space learning is to ask that the mapping to the shape spa… ▽ More

    Submitted 26 August, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: Accepted to CVPR 2022

  9. arXiv:2112.00958  [pdf, other

    cs.CV cs.GR cs.LG

    Hierarchical Neural Implicit Pose Network for Animation and Motion Retargeting

    Authors: Sourav Biswas, Kangxue Yin, Maria Shugrina, Sanja Fidler, Sameh Khamis

    Abstract: We present HIPNet, a neural implicit pose network trained on multiple subjects across many poses. HIPNet can disentangle subject-specific details from pose-specific details, effectively enabling us to retarget motion from one subject to another or to animate between keyframes through latent space interpolation. To this end, we employ a hierarchical skeleton-based representation to learn a signed d… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  10. arXiv:2111.13674  [pdf, other

    cs.CV cs.GR cs.LG

    Neural Fields as Learnable Kernels for 3D Reconstruction

    Authors: Francis Williams, Zan Gojcic, Sameh Khamis, Denis Zorin, Joan Bruna, Sanja Fidler, Or Litany

    Abstract: We present Neural Kernel Fields: a novel method for reconstructing implicit 3D shapes based on a learned kernel ridge regression. Our technique achieves state-of-the-art results when reconstructing 3D objects and large scenes from sparse oriented points, and can reconstruct shape categories outside the training set with almost no drop in accuracy. The core insight of our approach is that kernel me… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

  11. arXiv:2111.00140  [pdf, other

    cs.CV cs.GR

    DIB-R++: Learning to Predict Lighting and Material with a Hybrid Differentiable Renderer

    Authors: Wenzheng Chen, Joey Litalien, Jun Gao, Zian Wang, Clement Fuji Tsang, Sameh Khamis, Or Litany, Sanja Fidler

    Abstract: We consider the challenging problem of predicting intrinsic object properties from a single image by exploiting differentiable renderers. Many previous learning-based approaches for inverse graphics adopt rasterization-based renderers and assume naive lighting and material models, which often fail to account for non-Lambertian, specular reflections commonly observed in the wild. In this work, we p… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

  12. arXiv:2108.12958  [pdf, other

    cs.CV cs.AI cs.GR

    3DStyleNet: Creating 3D Shapes with Geometric and Texture Style Variations

    Authors: Kangxue Yin, Jun Gao, Maria Shugrina, Sameh Khamis, Sanja Fidler

    Abstract: We propose a method to create plausible geometric and texture style variations of 3D objects in the quest to democratize 3D content creation. Given a pair of textured source and target objects, our method predicts a part-aware affine transformation field that naturally warps the source shape to imitate the overall geometric style of the target. In addition, the texture style of the target is trans… ▽ More

    Submitted 29 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021. Supplementary material can be found on the project page: https://nv-tlabs.github.io/3DStyleNet/

  13. arXiv:2002.03933  [pdf, other

    cs.CV

    RePose: Learning Deep Kinematic Priors for Fast Human Pose Estimation

    Authors: Hossam Isack, Christian Haene, Cem Keskin, Sofien Bouaziz, Yuri Boykov, Shahram Izadi, Sameh Khamis

    Abstract: We propose a novel efficient and lightweight model for human pose estimation from a single image. Our model is designed to achieve competitive results at a fraction of the number of parameters and computational cost of various state-of-the-art methods. To this end, we explicitly incorporate part-based structural and geometric priors in a hierarchical prediction framework. At the coarsest resolutio… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

  14. arXiv:1904.04290  [pdf, other

    cs.CV cs.GR

    Neural Rerendering in the Wild

    Authors: Moustafa Meshry, Dan B Goldman, Sameh Khamis, Hugues Hoppe, Rohit Pandey, Noah Snavely, Ricardo Martin-Brualla

    Abstract: We explore total scene capture -- recording, modeling, and rerendering a scene under varying appearance such as season and time of day. Starting from internet photos of a tourist landmark, we apply traditional 3D reconstruction to register the photos and approximate the scene as a point cloud. For each photo, we render the scene points into a deep framebuffer, and train a neural network to learn t… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

    Comments: To be presented at CVPR 2019 (oral). Supplementary video available at http://youtu.be/E1crWQn_kmY

  15. arXiv:1811.05029  [pdf, other

    cs.CV

    LookinGood: Enhancing Performance Capture with Real-time Neural Re-Rendering

    Authors: Ricardo Martin-Brualla, Rohit Pandey, Shuoran Yang, Pavel Pidlypenskyi, Jonathan Taylor, Julien Valentin, Sameh Khamis, Philip Davidson, Anastasia Tkach, Peter Lincoln, Adarsh Kowdle, Christoph Rhemann, Dan B Goldman, Cem Keskin, Steve Seitz, Shahram Izadi, Sean Fanello

    Abstract: Motivated by augmented and virtual reality applications such as telepresence, there has been a recent focus in real-time performance capture of humans under motion. However, given the real-time constraint, these systems often suffer from artifacts in geometry and texture such as holes and noise in the final rendering, poor lighting, and low-resolution textures. We take the novel approach to augmen… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

    Comments: The supplementary video is available at: http://youtu.be/Md3tdAKoLGU To be presented at SIGGRAPH Asia 2018

  16. arXiv:1807.08865  [pdf, other

    cs.CV

    StereoNet: Guided Hierarchical Refinement for Real-Time Edge-Aware Depth Prediction

    Authors: Sameh Khamis, Sean Fanello, Christoph Rhemann, Adarsh Kowdle, Julien Valentin, Shahram Izadi

    Abstract: This paper presents StereoNet, the first end-to-end deep architecture for real-time stereo matching that runs at 60 fps on an NVidia Titan X, producing high-quality, edge-preserved, quantization-free disparity maps. A key insight of this paper is that the network achieves a sub-pixel matching precision than is a magnitude higher than those of traditional stereo matching approaches. This allows us… ▽ More

    Submitted 23 July, 2018; originally announced July 2018.

    Comments: ECCV 2018

  17. arXiv:1807.06009  [pdf, other

    cs.CV

    ActiveStereoNet: End-to-End Self-Supervised Learning for Active Stereo Systems

    Authors: Yinda Zhang, Sameh Khamis, Christoph Rhemann, Julien Valentin, Adarsh Kowdle, Vladimir Tankovich, Michael Schoenberg, Shahram Izadi, Thomas Funkhouser, Sean Fanello

    Abstract: In this paper we present ActiveStereoNet, the first deep learning solution for active stereo systems. Due to the lack of ground truth, our method is fully self-supervised, yet it produces precise depth with a subpixel precision of $1/30th$ of a pixel; it does not suffer from the common over-smoothing issues; it preserves the edges; and it explicitly handles occlusions. We introduce a novel reconst… ▽ More

    Submitted 16 July, 2018; originally announced July 2018.

    Comments: Accepted by ECCV2018, Oral Presentation, Main paper + Supplementary Materials

  18. arXiv:1602.02822  [pdf, other

    cs.CV

    Parameterizing Region Covariance: An Efficient Way To Apply Sparse Codes On Second Order Statistics

    Authors: Xiyang Dai, Sameh Khamis, Yangmuzi Zhang, Larry S. Davis

    Abstract: Sparse representations have been successfully applied to signal processing, computer vision and machine learning. Currently there is a trend to learn sparse models directly on structure data, such as region covariance. However, such methods when combined with region covariance often require complex computation. We present an approach to transform a structured sparse model learning problem to a tra… ▽ More

    Submitted 8 February, 2016; originally announced February 2016.