Zum Hauptinhalt springen

Showing 1–25 of 25 results for author: Martin-Brualla, R

.
  1. arXiv:2405.10314  [pdf, other

    cs.CV

    CAT3D: Create Anything in 3D with Multi-View Diffusion Models

    Authors: Ruiqi Gao, Aleksander Holynski, Philipp Henzler, Arthur Brussee, Ricardo Martin-Brualla, Pratul Srinivasan, Jonathan T. Barron, Ben Poole

    Abstract: Advances in 3D reconstruction have enabled high-quality 3D capture, but require a user to collect hundreds to thousands of images to create a 3D scene. We present CAT3D, a method for creating anything in 3D by simulating this real-world capture process with a multi-view diffusion model. Given any number of input images and a set of target novel viewpoints, our model generates highly consistent nov… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Project page: https://cat3d.github.io

  2. arXiv:2308.10902  [pdf, other

    cs.CV cs.GR

    CamP: Camera Preconditioning for Neural Radiance Fields

    Authors: Keunhong Park, Philipp Henzler, Ben Mildenhall, Jonathan T. Barron, Ricardo Martin-Brualla

    Abstract: Neural Radiance Fields (NeRF) can be optimized to obtain high-fidelity 3D scene reconstructions of objects and large-scale scenes. However, NeRFs require accurate camera parameters as input -- inaccurate camera parameters result in blurry renderings. Extrinsic and intrinsic camera parameters are usually estimated using Structure-from-Motion (SfM) methods as a pre-processing step to NeRF, but these… ▽ More

    Submitted 30 August, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: SIGGRAPH Asia 2023, Project page: https://camp-nerf.github.io

  3. arXiv:2306.09109  [pdf, other

    cs.CV

    NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

    Authors: Varun Jampani, Kevis-Kokitsi Maninis, Andreas Engelhardt, Arjun Karpur, Karen Truong, Kyle Sargent, Stefan Popov, André Araujo, Ricardo Martin-Brualla, Kaushal Patel, Daniel Vlasic, Vittorio Ferrari, Ameesh Makadia, Ce Liu, Yuanzhen Li, Howard Zhou

    Abstract: Recent advances in neural reconstruction enable high-quality 3D object reconstruction from casually captured image collections. Current techniques mostly analyze their progress on relatively simple image collections where Structure-from-Motion (SfM) techniques can provide ground-truth (GT) camera poses. We note that SfM techniques tend to fail on in-the-wild image collections such as image search… ▽ More

    Submitted 13 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 camera ready. Project page: https://navidataset.github.io

  4. arXiv:2303.13582  [pdf, other

    cs.CV

    SCADE: NeRFs from Space Carving with Ambiguity-Aware Depth Estimates

    Authors: Mikaela Angelina Uy, Ricardo Martin-Brualla, Leonidas Guibas, Ke Li

    Abstract: Neural radiance fields (NeRFs) have enabled high fidelity 3D reconstruction from multiple 2D input views. However, a well-known drawback of NeRFs is the less-than-ideal performance under a small number of views, due to insufficient constraints enforced by volumetric rendering. To address this issue, we introduce SCADE, a novel technique that improves NeRF reconstruction quality on sparse, unconstr… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  5. arXiv:2303.12779  [pdf, other

    cs.CV

    LFM-3D: Learnable Feature Matching Across Wide Baselines Using 3D Signals

    Authors: Arjun Karpur, Guilherme Perrotta, Ricardo Martin-Brualla, Howard Zhou, André Araujo

    Abstract: Finding localized correspondences across different images of the same object is crucial to understand its geometry. In recent years, this problem has seen remarkable progress with the advent of deep learning-based local image features and learnable matchers. Still, learnable matchers often underperform when there exists only small regions of co-visibility between image pairs (i.e. wide camera base… ▽ More

    Submitted 30 January, 2024; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: 3DV 2024, oral paper

  6. arXiv:2210.04628  [pdf, other

    cs.CV cs.GR cs.LG

    Novel View Synthesis with Diffusion Models

    Authors: Daniel Watson, William Chan, Ricardo Martin-Brualla, Jonathan Ho, Andrea Tagliasacchi, Mohammad Norouzi

    Abstract: We present 3DiM, a diffusion model for 3D novel view synthesis, which is able to translate a single input view into consistent and sharp completions across many views. The core component of 3DiM is a pose-conditional image-to-image diffusion model, which takes a source view and its pose as inputs, and generates a novel view for a target pose as output. 3DiM can generate multiple views that are 3D… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  7. arXiv:2111.13679  [pdf, other

    cs.CV cs.GR eess.IV

    NeRF in the Dark: High Dynamic Range View Synthesis from Noisy Raw Images

    Authors: Ben Mildenhall, Peter Hedman, Ricardo Martin-Brualla, Pratul Srinivasan, Jonathan T. Barron

    Abstract: Neural Radiance Fields (NeRF) is a technique for high quality novel view synthesis from a collection of posed input images. Like most view synthesis methods, NeRF uses tonemapped low dynamic range (LDR) as input; these images have been processed by a lossy camera pipeline that smooths detail, clips highlights, and distorts the simple noise distribution of raw sensor data. We modify NeRF to instead… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: Project page: https://bmild.github.io/rawnerf/

  8. arXiv:2111.05849  [pdf, other

    cs.GR cs.CV

    Advances in Neural Rendering

    Authors: Ayush Tewari, Justus Thies, Ben Mildenhall, Pratul Srinivasan, Edgar Tretschk, Yifan Wang, Christoph Lassner, Vincent Sitzmann, Ricardo Martin-Brualla, Stephen Lombardi, Tomas Simon, Christian Theobalt, Matthias Niessner, Jonathan T. Barron, Gordon Wetzstein, Michael Zollhoefer, Vladislav Golyanik

    Abstract: Synthesizing photo-realistic images and videos is at the heart of computer graphics and has been the focus of decades of research. Traditionally, synthetic images of a scene are generated using rendering algorithms such as rasterization or ray tracing, which take specifically defined representations of geometry and material properties as input. Collectively, these inputs define the actual scene an… ▽ More

    Submitted 30 March, 2022; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: 33 pages, 14 figures, 5 tables; State of the Art Report at EUROGRAPHICS 2022

  9. arXiv:2106.13228  [pdf, other

    cs.CV cs.GR

    HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields

    Authors: Keunhong Park, Utkarsh Sinha, Peter Hedman, Jonathan T. Barron, Sofien Bouaziz, Dan B Goldman, Ricardo Martin-Brualla, Steven M. Seitz

    Abstract: Neural Radiance Fields (NeRF) are able to reconstruct scenes with unprecedented fidelity, and various recent works have extended NeRF to handle dynamic scenes. A common approach to reconstruct such non-rigid scenes is through the use of a learned deformation field mapping from coordinates in each input image into a canonical template coordinate space. However, these deformation-based approaches st… ▽ More

    Submitted 10 September, 2021; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: SIGGRAPH Asia 2021, Project page: https://hypernerf.github.io/

  10. arXiv:2104.08418  [pdf, other

    cs.CV

    FiG-NeRF: Figure-Ground Neural Radiance Fields for 3D Object Category Modelling

    Authors: Christopher Xie, Keunhong Park, Ricardo Martin-Brualla, Matthew Brown

    Abstract: We investigate the use of Neural Radiance Fields (NeRF) to learn high quality 3D object category models from collections of input images. In contrast to previous work, we are able to do this whilst simultaneously separating foreground objects from their varying backgrounds. We achieve this via a 2-component NeRF model, FiG-NeRF, that prefers explanation of the scene as a geometrically constant bac… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  11. arXiv:2104.04532  [pdf, other

    cs.CV

    Neural RGB-D Surface Reconstruction

    Authors: Dejan Azinović, Ricardo Martin-Brualla, Dan B Goldman, Matthias Nießner, Justus Thies

    Abstract: Obtaining high-quality 3D reconstructions of room-scale scenes is of paramount importance for upcoming applications in AR or VR. These range from mixed reality applications for teleconferencing, virtual measuring, virtual room planing, to robotic applications. While current volume-based view synthesis methods that use neural radiance fields (NeRFs) show promising results in reproducing the appeara… ▽ More

    Submitted 14 March, 2022; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: CVPR'22; Project page: https://dazinovic.github.io/neural-rgbd-surface-reconstruction/ Video: https://youtu.be/iWuSowPsC3g

  12. arXiv:2103.13415  [pdf, other

    cs.CV cs.GR

    Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields

    Authors: Jonathan T. Barron, Ben Mildenhall, Matthew Tancik, Peter Hedman, Ricardo Martin-Brualla, Pratul P. Srinivasan

    Abstract: The rendering procedure used by neural radiance fields (NeRF) samples a scene with a single ray per pixel and may therefore produce renderings that are excessively blurred or aliased when training or testing images observe scene content at different resolutions. The straightforward solution of supersampling by rendering with multiple rays per pixel is impractical for NeRF, because rendering each r… ▽ More

    Submitted 13 August, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

  13. arXiv:2102.13090  [pdf, other

    cs.CV

    IBRNet: Learning Multi-View Image-Based Rendering

    Authors: Qianqian Wang, Zhicheng Wang, Kyle Genova, Pratul Srinivasan, Howard Zhou, Jonathan T. Barron, Ricardo Martin-Brualla, Noah Snavely, Thomas Funkhouser

    Abstract: We present a method that synthesizes novel views of complex scenes by interpolating a sparse set of nearby views. The core of our method is a network architecture that includes a multilayer perceptron and a ray transformer that estimates radiance and volume density at continuous 5D locations (3D spatial locations and 2D viewing directions), drawing appearance information on the fly from multiple s… ▽ More

    Submitted 6 April, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: CVPR 2021. Project page: https://ibrnet.github.io/

  14. arXiv:2102.08860  [pdf, other

    cs.CV cs.GR

    ShaRF: Shape-conditioned Radiance Fields from a Single View

    Authors: Konstantinos Rematas, Ricardo Martin-Brualla, Vittorio Ferrari

    Abstract: We present a method for estimating neural scenes representations of objects given only a single image. The core of our method is the estimation of a geometric scaffold for the object and its use as a guide for the reconstruction of the underlying radiance field. Our formulation is based on a generative process that first maps a latent code to a voxelized shape, and then renders it to an image, wit… ▽ More

    Submitted 23 June, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: Project page: http://www.krematas.com/sharf/index.html

  15. Time-Travel Rephotography

    Authors: Xuan Luo, Xuaner Zhang, Paul Yoo, Ricardo Martin-Brualla, Jason Lawrence, Steven M. Seitz

    Abstract: Many historical people were only ever captured by old, faded, black and white photos, that are distorted due to the limitations of early cameras and the passage of time. This paper simulates traveling back in time with a modern camera to rephotograph famous subjects. Unlike conventional image restoration filters which apply independent operations like denoising, colorization, and superresolution,… ▽ More

    Submitted 13 December, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: SIGGRAPH Asia 2021. Project Page: https://time-travel-rephotography.github.io Video: https://youtu.be/ceIopN2UZ_s

    Journal ref: ACM Transactions on Graphics. 40 (2021) 1-12

  16. arXiv:2012.10565  [pdf, other

    cs.CV cs.GR

    No Shadow Left Behind: Removing Objects and their Shadows using Approximate Lighting and Geometry

    Authors: Edward Zhang, Ricardo Martin-Brualla, Janne Kontkanen, Brian Curless

    Abstract: Removing objects from images is a challenging problem that is important for many applications, including mixed reality. For believable results, the shadows that the object casts should also be removed. Current inpainting-based methods only remove the object itself, leaving shadows behind, or at best require specifying shadow regions to inpaint. We introduce a deep learning pipeline for removing a… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

  17. arXiv:2011.12948  [pdf, other

    cs.CV cs.GR

    Nerfies: Deformable Neural Radiance Fields

    Authors: Keunhong Park, Utkarsh Sinha, Jonathan T. Barron, Sofien Bouaziz, Dan B Goldman, Steven M. Seitz, Ricardo Martin-Brualla

    Abstract: We present the first method capable of photorealistically reconstructing deformable scenes using photos/videos captured casually from mobile phones. Our approach augments neural radiance fields (NeRF) by optimizing an additional continuous volumetric deformation field that warps each observed point into a canonical 5D NeRF. We observe that these NeRF-like deformation fields are prone to local mini… ▽ More

    Submitted 9 September, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: ICCV 2021, Project page with videos: https://nerfies.github.io/

  18. arXiv:2008.04852  [pdf, other

    cs.CV cs.GR cs.LG

    GeLaTO: Generative Latent Textured Objects

    Authors: Ricardo Martin-Brualla, Rohit Pandey, Sofien Bouaziz, Matthew Brown, Dan B Goldman

    Abstract: Accurate modeling of 3D objects exhibiting transparency, reflections and thin structures is an extremely challenging problem. Inspired by billboards and geometric proxies used in computer graphics, this paper proposes Generative Latent Textured Objects (GeLaTO), a compact representation that combines a set of coarse shape proxies defining low frequency geometry with learned neural textures, to enc… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: ECCV 2020 Spotlight. Project website: https://gelato-paper.github.io

    Journal ref: European Conference on Computer Vision 2020

  19. arXiv:2008.02268  [pdf, other

    cs.CV cs.GR cs.LG

    NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections

    Authors: Ricardo Martin-Brualla, Noha Radwan, Mehdi S. M. Sajjadi, Jonathan T. Barron, Alexey Dosovitskiy, Daniel Duckworth

    Abstract: We present a learning-based method for synthesizing novel views of complex scenes using only unstructured collections of in-the-wild photographs. We build on Neural Radiance Fields (NeRF), which uses the weights of a multilayer perceptron to model the density and color of a scene as a function of 3D coordinates. While NeRF works well on images of static subjects captured under controlled settings,… ▽ More

    Submitted 6 January, 2021; v1 submitted 5 August, 2020; originally announced August 2020.

    Comments: Project website: https://nerf-w.github.io. Ricardo Martin-Brualla, Noha Radwan, and Mehdi S. M. Sajjadi contributed equally to this work. Updated with results for three additional scenes

  20. arXiv:2004.03805  [pdf, other

    cs.CV cs.GR

    State of the Art on Neural Rendering

    Authors: Ayush Tewari, Ohad Fried, Justus Thies, Vincent Sitzmann, Stephen Lombardi, Kalyan Sunkavalli, Ricardo Martin-Brualla, Tomas Simon, Jason Saragih, Matthias Nießner, Rohit Pandey, Sean Fanello, Gordon Wetzstein, Jun-Yan Zhu, Christian Theobalt, Maneesh Agrawala, Eli Shechtman, Dan B Goldman, Michael Zollhöfer

    Abstract: Efficient rendering of photo-realistic virtual worlds is a long standing effort of computer graphics. Modern graphics techniques have succeeded in synthesizing photo-realistic images from hand-crafted scene representations. However, the automatic generation of shape, materials, lighting, and other aspects of scenes remains a challenging problem that, if solved, would make photo-realistic computer… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

    Comments: Eurographics 2020 survey paper

  21. arXiv:1908.07732  [pdf, other

    cs.CV cs.GR

    KeystoneDepth: Visualizing History in 3D

    Authors: Xuan Luo, Yanmeng Kong, Jason Lawrence, Ricardo Martin-Brualla, Steve Seitz

    Abstract: This paper introduces the largest and most diverse collection of rectified stereo image pairs to the research community, KeystoneDepth, consisting of tens of thousands of stereographs of historical people, events, objects, and scenes between 1860 and 1963. Leveraging the Keystone-Mast raw scans from the California Museum of Photography, we apply multiple processing steps to produce clean stereo im… ▽ More

    Submitted 19 September, 2019; v1 submitted 21 August, 2019; originally announced August 2019.

    Comments: Project website: http://roxanneluo.github.io/KeystoneDepth.html , Video: https://youtu.be/5JrX-KKisC8 , More results: http://roxanneluo.github.io/keystonedepth_supplementary/index.html

  22. arXiv:1905.12162  [pdf, other

    cs.CV

    Volumetric Capture of Humans with a Single RGBD Camera via Semi-Parametric Learning

    Authors: Rohit Pandey, Anastasia Tkach, Shuoran Yang, Pavel Pidlypenskyi, Jonathan Taylor, Ricardo Martin-Brualla, Andrea Tagliasacchi, George Papandreou, Philip Davidson, Cem Keskin, Shahram Izadi, Sean Fanello

    Abstract: Volumetric (4D) performance capture is fundamental for AR/VR content generation. Whereas previous work in 4D performance capture has shown impressive results in studio settings, the technology is still far from being accessible to a typical consumer who, at best, might own a single RGBD sensor. Thus, in this work, we propose a method to synthesize free viewpoint renderings using a single RGBD came… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

  23. arXiv:1904.04290  [pdf, other

    cs.CV cs.GR

    Neural Rerendering in the Wild

    Authors: Moustafa Meshry, Dan B Goldman, Sameh Khamis, Hugues Hoppe, Rohit Pandey, Noah Snavely, Ricardo Martin-Brualla

    Abstract: We explore total scene capture -- recording, modeling, and rerendering a scene under varying appearance such as season and time of day. Starting from internet photos of a tourist landmark, we apply traditional 3D reconstruction to register the photos and approximate the scene as a point cloud. For each photo, we render the scene points into a deep framebuffer, and train a neural network to learn t… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

    Comments: To be presented at CVPR 2019 (oral). Supplementary video available at http://youtu.be/E1crWQn_kmY

  24. arXiv:1811.05029  [pdf, other

    cs.CV

    LookinGood: Enhancing Performance Capture with Real-time Neural Re-Rendering

    Authors: Ricardo Martin-Brualla, Rohit Pandey, Shuoran Yang, Pavel Pidlypenskyi, Jonathan Taylor, Julien Valentin, Sameh Khamis, Philip Davidson, Anastasia Tkach, Peter Lincoln, Adarsh Kowdle, Christoph Rhemann, Dan B Goldman, Cem Keskin, Steve Seitz, Shahram Izadi, Sean Fanello

    Abstract: Motivated by augmented and virtual reality applications such as telepresence, there has been a recent focus in real-time performance capture of humans under motion. However, given the real-time constraint, these systems often suffer from artifacts in geometry and texture such as holes and noise in the final rendering, poor lighting, and low-resolution textures. We take the novel approach to augmen… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

    Comments: The supplementary video is available at: http://youtu.be/Md3tdAKoLGU To be presented at SIGGRAPH Asia 2018

  25. arXiv:1511.03019  [pdf, other

    cs.CV

    3D Time-lapse Reconstruction from Internet Photos

    Authors: Ricardo Martin-Brualla, David Gallup, Steven M. Seitz

    Abstract: Given an Internet photo collection of a landmark, we compute a 3D time-lapse video sequence where a virtual camera moves continuously in time and space. While previous work assumed a static camera, the addition of camera motion during the time-lapse creates a very compelling impression of parallax. Achieving this goal, however, requires addressing multiple technical challenges, including solving f… ▽ More

    Submitted 21 February, 2020; v1 submitted 10 November, 2015; originally announced November 2015.

    Comments: To appear in ICCV'15. Supplementary video at: http://grail.cs.washington.edu/projects/timelapse3d/