Zum Hauptinhalt springen

Showing 1–50 of 50 results for author: Agapito, L

.
  1. arXiv:2408.16061  [pdf, other

    cs.CV

    3D Reconstruction with Spatial Memory

    Authors: Hengyi Wang, Lourdes Agapito

    Abstract: We present Spann3R, a novel approach for dense 3D reconstruction from ordered or unordered image collections. Built on the DUSt3R paradigm, Spann3R uses a transformer-based architecture to directly regress pointmaps from images without any prior knowledge of the scene or camera parameters. Unlike DUSt3R, which predicts per image-pair pointmaps each expressed in its local coordinate frame, Spann3R… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: Project page: \url{https://hengyiwang.github.io/projects/spanner}

  2. arXiv:2406.07169  [pdf, other

    cs.CV

    RecMoDiffuse: Recurrent Flow Diffusion for Human Motion Generation

    Authors: Mirgahney Mohamed, Harry Jake Cunningham, Marc P. Deisenroth, Lourdes Agapito

    Abstract: Human motion generation has paramount importance in computer animation. It is a challenging generative temporal modelling task due to the vast possibilities of human motion, high human sensitivity to motion coherence and the difficulty of accurately generating fine-grained motions. Recently, diffusion methods have been proposed for human motion generation due to their high sample quality and expre… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 20 pages, 6 figures

  3. arXiv:2405.19331  [pdf, other

    cs.CV cs.AI cs.GR

    NPGA: Neural Parametric Gaussian Avatars

    Authors: Simon Giebenhain, Tobias Kirschstein, Martin Rünz, Lourdes Agapito, Matthias Nießner

    Abstract: The creation of high-fidelity, digital versions of human heads is an important stepping stone in the process of further integrating virtual components into our everyday lives. Constructing such avatars is a challenging research problem, due to a high demand for photo-realism and real-time rendering performance. In this work, we propose Neural Parametric Gaussian Avatars (NPGA), a data-driven appro… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Project Page: see https://simongiebenhain.github.io/NPGA/ ; Youtube Video: see https://www.youtube.com/watch?v=NGRxAYbIkus

  4. arXiv:2404.01053  [pdf, other

    cs.CV

    HAHA: Highly Articulated Gaussian Human Avatars with Textured Mesh Prior

    Authors: David Svitov, Pietro Morerio, Lourdes Agapito, Alessio Del Bue

    Abstract: We present HAHA - a novel approach for animatable human avatar generation from monocular input videos. The proposed method relies on learning the trade-off between the use of Gaussian splatting and a textured mesh for efficient and high fidelity rendering. We demonstrate its efficiency to animate and render full-body human avatars controlled via the SMPL-X parametric model. Our model learns to app… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  5. arXiv:2312.08568  [pdf, other

    cs.CV

    NViST: In the Wild New View Synthesis from a Single Image with Transformers

    Authors: Wonbong Jang, Lourdes Agapito

    Abstract: We propose NViST, a transformer-based model for efficient and generalizable novel-view synthesis from a single image for real-world scenes. In contrast to many methods that are trained on synthetic data, object-centred scenarios, or in a category-specific manner, NViST is trained on MVImgNet, a large-scale dataset of casually-captured real-world videos of hundreds of object categories with diverse… ▽ More

    Submitted 1 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: CVPR 2024, Project page: https://wbjang.github.io/nvist_webpage

  6. arXiv:2312.06740  [pdf, other

    cs.CV

    MonoNPHM: Dynamic Head Reconstruction from Monocular Videos

    Authors: Simon Giebenhain, Tobias Kirschstein, Markos Georgopoulos, Martin Rünz, Lourdes Agapito, Matthias Nießner

    Abstract: We present Monocular Neural Parametric Head Models (MonoNPHM) for dynamic 3D head reconstructions from monocular RGB videos. To this end, we propose a latent appearance space that parameterizes a texture field on top of a neural parametric model. We constrain predicted color values to be correlated with the underlying geometry such that gradients from RGB effectively influence latent geometry code… ▽ More

    Submitted 29 May, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Project Page: see https://simongiebenhain.github.io/MonoNPHM/ ; Video: see https://youtu.be/n-wjaC3UIeE

  7. arXiv:2312.00778  [pdf, other

    cs.CV

    MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video

    Authors: Hengyi Wang, Jingwen Wang, Lourdes Agapito

    Abstract: Neural rendering has demonstrated remarkable success in dynamic scene reconstruction. Thanks to the expressiveness of neural representations, prior works can accurately capture the motion and achieve high-fidelity reconstruction of the target object. Despite this, real-world video scenarios often feature large unobserved regions where neural representations struggle to achieve realistic completion… ▽ More

    Submitted 4 April, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: CVPR2024. Project page: https://hengyiwang.github.io/projects/morpheus

  8. arXiv:2311.08159  [pdf, other

    cs.CV

    DynamicSurf: Dynamic Neural RGB-D Surface Reconstruction with an Optimizable Feature Grid

    Authors: Mirgahney Mohamed, Lourdes Agapito

    Abstract: We propose DynamicSurf, a model-free neural implicit surface reconstruction method for high-fidelity 3D modelling of non-rigid surfaces from monocular RGB-D video. To cope with the lack of multi-view cues in monocular sequences of deforming surfaces, one of the most challenging settings for 3D reconstruction, DynamicSurf exploits depth, surface normals, and RGB losses to improve reconstruction fid… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  9. Task-guided Domain Gap Reduction for Monocular Depth Prediction in Endoscopy

    Authors: Anita Rau, Binod Bhattarai, Lourdes Agapito, Danail Stoyanov

    Abstract: Colorectal cancer remains one of the deadliest cancers in the world. In recent years computer-aided methods have aimed to enhance cancer screening and improve the quality and availability of colonoscopies by automatizing sub-tasks. One such task is predicting depth from monocular video frames, which can assist endoscopic navigation. As ground truth depth from standard in-vivo colonoscopy remains u… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: First Data Engineering in Medical Imaging Workshop at MICCAI 2023

    Journal ref: Lecture Notes in Computer Science, vol 14314. 2023. Springer, Cham

  10. arXiv:2308.15975  [pdf, other

    cs.RO cs.AI cs.CV

    RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation

    Authors: Mel Vecerik, Carl Doersch, Yi Yang, Todor Davchev, Yusuf Aytar, Guangyao Zhou, Raia Hadsell, Lourdes Agapito, Jon Scholz

    Abstract: For robots to be useful outside labs and specialized factories we need a way to teach them new useful behaviors quickly. Current approaches lack either the generality to onboard new tasks without task-specific engineering, or else lack the data-efficiency to do so in an amount of time that enables practical use. In this work we explore dense tracking as a representational vehicle to allow faster a… ▽ More

    Submitted 31 August, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Project website: https://robotap.github.io

  11. arXiv:2306.16585  [pdf, other

    cs.CV cs.RO

    SeMLaPS: Real-time Semantic Mapping with Latent Prior Networks and Quasi-Planar Segmentation

    Authors: Jingwen Wang, Juan Tarrio, Lourdes Agapito, Pablo F. Alcantarilla, Alexander Vakhitov

    Abstract: The availability of real-time semantics greatly improves the core geometric functionality of SLAM systems, enabling numerous robotic and AR/VR applications. We present a new methodology for real-time semantic mapping from RGB-D sequences that combines a 2D neural network and a 3D network based on a SLAM system with 3D occupancy mapping. When segmenting a new frame we perform latent feature re-proj… ▽ More

    Submitted 13 October, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: RA-L 2023. 8 pages, 7 figures. Project page: http://jingwenwang95.github.io/SeMLaPS

  12. arXiv:2305.06356  [pdf, other

    cs.CV cs.GR cs.LG

    HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion

    Authors: Mustafa Işık, Martin Rünz, Markos Georgopoulos, Taras Khakhulin, Jonathan Starck, Lourdes Agapito, Matthias Nießner

    Abstract: Representing human performance at high-fidelity is an essential building block in diverse applications, such as film production, computer games or videoconferencing. To close the gap to production-level quality, we introduce HumanRF, a 4D dynamic neural scene representation that captures full-body appearance in motion from multi-view video input, and enables playback from novel, unseen viewpoints.… ▽ More

    Submitted 11 May, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: Project webpage: https://synthesiaresearch.github.io/humanrf Dataset webpage: https://www.actors-hq.com/ Video: https://www.youtube.com/watch?v=OTnhiLLE7io Code: https://github.com/synthesiaresearch/humanrf

  13. arXiv:2304.14377  [pdf, other

    cs.CV

    Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM

    Authors: Hengyi Wang, Jingwen Wang, Lourdes Agapito

    Abstract: We present Co-SLAM, a neural RGB-D SLAM system based on a hybrid representation, that performs robust camera tracking and high-fidelity surface reconstruction in real time. Co-SLAM represents the scene as a multi-resolution hash-grid to exploit its high convergence speed and ability to represent high-frequency local features. In addition, Co-SLAM incorporates one-blob encoding, to encourage surfac… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: CVPR2023. First two authors contributed equally. Project page: https://hengyiwang.github.io/projects/CoSLAM

  14. arXiv:2212.02761  [pdf, other

    cs.CV

    Learning Neural Parametric Head Models

    Authors: Simon Giebenhain, Tobias Kirschstein, Markos Georgopoulos, Martin Rünz, Lourdes Agapito, Matthias Nießner

    Abstract: We propose a novel 3D morphable model for complete human heads based on hybrid neural fields. At the core of our model lies a neural parametric representation that disentangles identity and expressions in disjoint latent spaces. To this end, we capture a person's identity in a canonical space as a signed distance field (SDF), and model facial expressions with a neural deformation field. In additio… ▽ More

    Submitted 14 April, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: Project Page: https://simongiebenhain.github.io/NPHM ; Project Video: https://www.youtube.com/watch?v=0mDk2tFOJCg ; Camer-Ready Version; Added Experiments

  15. arXiv:2209.10621  [pdf, other

    cs.CV

    GNPM: Geometric-Aware Neural Parametric Models

    Authors: Mirgahney Mohamed, Lourdes Agapito

    Abstract: We propose Geometric Neural Parametric Models (GNPM), a learned parametric model that takes into account the local structure of data to learn disentangled shape and pose latent spaces of 4D dynamics, using a geometric-aware architecture on point clouds. Temporally consistent 3D deformations are estimated without the need for dense correspondences at training time, by exploiting cycle consistency.… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: 10 pages, 8 figures

    ACM Class: I.2.10

  16. arXiv:2209.07147  [pdf, other

    cs.CV cs.RO

    One-Shot Transfer of Affordance Regions? AffCorrs!

    Authors: Denis Hadjivelichkov, Sicelukwanda Zwane, Marc Peter Deisenroth, Lourdes Agapito, Dimitrios Kanoulas

    Abstract: In this work, we tackle one-shot visual search of object parts. Given a single reference image of an object with annotated affordance regions, we segment semantically corresponding parts within a target scene. We propose AffCorrs, an unsupervised model that combines the properties of pre-trained DINO-ViT's image descriptors and cyclic correspondences. We use AffCorrs to find corresponding affordan… ▽ More

    Submitted 16 September, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Published in Conference on Robot Learning, 2022 For code and dataset, refer to https://sites.google.com/view/affcorrs

  17. arXiv:2206.14735  [pdf, other

    cs.CV

    GO-Surf: Neural Feature Grid Optimization for Fast, High-Fidelity RGB-D Surface Reconstruction

    Authors: Jingwen Wang, Tymoteusz Bleja, Lourdes Agapito

    Abstract: We present GO-Surf, a direct feature grid optimization method for accurate and fast surface reconstruction from RGB-D sequences. We model the underlying scene with a learned hierarchical feature voxel grid that encapsulates multi-level geometric and appearance local information. Feature vectors are directly optimized such that after being tri-linearly interpolated, decoded by two shallow MLPs into… ▽ More

    Submitted 17 September, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: 3DV2022 (Oral), first two authors contributed equally. Project page: https://jingwenwang95.github.io/go_surf/

  18. Bimodal Camera Pose Prediction for Endoscopy

    Authors: Anita Rau, Binod Bhattarai, Lourdes Agapito, Danail Stoyanov

    Abstract: Deducing the 3D structure of endoscopic scenes from images is exceedingly challenging. In addition to deformation and view-dependent lighting, tubular structures like the colon present problems stemming from their self-occluding and repetitive anatomical structure. In this paper, we propose SimCol, a synthetic dataset for camera pose estimation in colonoscopy, and a novel method that explicitly le… ▽ More

    Submitted 15 December, 2023; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: This article has been accepted for publication in IEEE Transactions on Medical Robotics and Bionics. This is the author's version which has not been fully edited and content may change prior to final publication. Citation information: DOI 10.1109/TMRB.2023.3320267

  19. arXiv:2112.04910  [pdf, other

    cs.RO cs.CV

    Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings

    Authors: Mel Vecerik, Jackie Kay, Raia Hadsell, Lourdes Agapito, Jon Scholz

    Abstract: Dense object tracking, the ability to localize specific object points with pixel-level accuracy, is an important computer vision task with numerous downstream applications in robotics. Existing approaches either compute dense keypoint embeddings in a single forward pass, meaning the model is trained to track everything at once, or allocate their full capacity to a sparse predefined set of points,… ▽ More

    Submitted 13 December, 2021; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: Supplementary material available at: https://sites.google.com/view/2021-tack

  20. arXiv:2109.01750  [pdf, other

    cs.GR cs.CV cs.LG

    CodeNeRF: Disentangled Neural Radiance Fields for Object Categories

    Authors: Wonbong Jang, Lourdes Agapito

    Abstract: CodeNeRF is an implicit 3D neural representation that learns the variation of object shapes and textures across a category and can be trained, from a set of posed images, to synthesize novel views of unseen objects. Unlike the original NeRF, which is scene specific, CodeNeRF learns to disentangle shape and texture by learning separate embeddings. At test time, given a single unposed image of an un… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

    Comments: 10 pages, 15 figures, ICCV 2021

  21. arXiv:2108.09481  [pdf, other

    cs.CV cs.RO

    DSP-SLAM: Object Oriented SLAM with Deep Shape Priors

    Authors: Jingwen Wang, Martin Rünz, Lourdes Agapito

    Abstract: We propose DSP-SLAM, an object-oriented SLAM system that builds a rich and accurate joint map of dense 3D models for foreground objects, and sparse landmark points to represent the background. DSP-SLAM takes as input the 3D point cloud reconstructed by a feature-based SLAM system and equips it with the ability to enhance its sparse map with dense reconstructions of detected objects. Objects are de… ▽ More

    Submitted 22 October, 2021; v1 submitted 21 August, 2021; originally announced August 2021.

    Comments: To be published at 3DV 2021

  22. arXiv:2104.09283  [pdf, other

    cs.CV

    Multi-person Implicit Reconstruction from a Single Image

    Authors: Armin Mustafa, Akin Caliskan, Lourdes Agapito, Adrian Hilton

    Abstract: We present a new end-to-end learning framework to obtain detailed and spatially coherent reconstructions of multiple people from a single image. Existing multi-person methods suffer from two main drawbacks: they are often model-based and therefore cannot capture accurate 3D models of people with loose clothing and hair; or they require manual intervention to resolve occlusions or interactions. Our… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: To appear in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021

  23. SelfPose: 3D Egocentric Pose Estimation from a Headset Mounted Camera

    Authors: Denis Tome, Thiemo Alldieck, Patrick Peluse, Gerard Pons-Moll, Lourdes Agapito, Hernan Badino, Fernando De la Torre

    Abstract: We present a solution to egocentric 3D body pose estimation from monocular images captured from downward looking fish-eye cameras installed on the rim of a head mounted VR device. This unusual viewpoint leads to images with unique visual appearance, with severe self-occlusions and perspective distortions that result in drastic differences in resolution between lower and upper body. We propose an e… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: 14 pages. arXiv admin note: substantial text overlap with arXiv:1907.10045

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020

  24. arXiv:2009.14711  [pdf, other

    cs.RO cs.CV cs.LG

    S3K: Self-Supervised Semantic Keypoints for Robotic Manipulation via Multi-View Consistency

    Authors: Mel Vecerik, Jean-Baptiste Regli, Oleg Sushkov, David Barker, Rugile Pevceviciute, Thomas Rothörl, Christopher Schuster, Raia Hadsell, Lourdes Agapito, Jonathan Scholz

    Abstract: A robot's ability to act is fundamentally constrained by what it can perceive. Many existing approaches to visual representation learning utilize general-purpose training criteria, e.g. image reconstruction, smoothness in latent space, or usefulness for control, or else make use of large datasets annotated with specific features (bounding boxes, segmentations, etc.). However, both approaches often… ▽ More

    Submitted 13 October, 2020; v1 submitted 30 September, 2020; originally announced September 2020.

    Comments: 11 pages, supplementary material available at: https://sites.google.com/view/2020-s3k/home

  25. arXiv:2008.10634  [pdf, other

    cs.CV

    DiverseNet: When One Right Answer is not Enough

    Authors: Michael Firman, Neill D. F. Campbell, Lourdes Agapito, Gabriel J. Brostow

    Abstract: Many structured prediction tasks in machine vision have a collection of acceptable answers, instead of one definitive ground truth answer. Segmentation of images, for example, is subject to human labeling bias. Similarly, there are multiple possible pixel values that could plausibly complete occluded image regions. State-of-the art supervised learning methods are typically optimized to make a sing… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: Presented at CVPR 2018

  26. arXiv:2005.05125  [pdf, other

    cs.CV

    FroDO: From Detections to 3D Objects

    Authors: Kejie Li, Martin Rünz, Meng Tang, Lingni Ma, Chen Kong, Tanner Schmidt, Ian Reid, Lourdes Agapito, Julian Straub, Steven Lovegrove, Richard Newcombe

    Abstract: Object-oriented maps are important for scene understanding since they jointly capture geometry and semantics, allow individual instantiation and meaningful reasoning about objects. We introduce FroDO, a method for accurate 3D reconstruction of object instances from RGB video that infers object location, pose and shape in a coarse-to-fine manner. Key to FroDO is to embed object shapes in a novel le… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Comments: To be published in CVPR 2020. The first two authors contributed equally

  27. arXiv:1907.10045  [pdf, other

    cs.CV

    xR-EgoPose: Egocentric 3D Human Pose from an HMD Camera

    Authors: Denis Tome, Patrick Peluse, Lourdes Agapito, Hernan Badino

    Abstract: We present a new solution to egocentric 3D body pose estimation from monocular images captured from a downward looking fish-eye camera installed on the rim of a head mounted virtual reality device. This unusual viewpoint, just 2 cm. away from the user's face, leads to images with unique visual appearance, characterized by severe self-occlusions and strong perspective distortions that result in a d… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

    Comments: ICCV 2019

  28. arXiv:1811.01068  [pdf, other

    cs.CV

    3D Pick & Mix: Object Part Blending in Joint Shape and Image Manifolds

    Authors: Adrian Penate-Sanchez, Lourdes Agapito

    Abstract: We present 3D Pick & Mix, a new 3D shape retrieval system that provides users with a new level of freedom to explore 3D shape and Internet image collections by introducing the ability to reason about objects at the level of their constituent parts. While classic retrieval systems can only formulate simple searches such as "find the 3D model that is most similar to the input image" our new approach… ▽ More

    Submitted 2 November, 2018; originally announced November 2018.

  29. arXiv:1808.01525  [pdf, other

    cs.CV

    Rethinking Pose in 3D: Multi-stage Refinement and Recovery for Markerless Motion Capture

    Authors: Denis Tome, Matteo Toso, Lourdes Agapito, Chris Russell

    Abstract: We propose a CNN-based approach for multi-camera markerless motion capture of the human body. Unlike existing methods that first perform pose estimation on individual cameras and generate 3D models as post-processing, our approach makes use of 3D reasoning throughout a multi-stage approach. This novelty allows us to use provisional 3D models of human pose to rethink where the joints should be loca… ▽ More

    Submitted 4 August, 2018; originally announced August 2018.

    Comments: International Conference on 3DVision (3dv)

  30. arXiv:1804.09194  [pdf, other

    cs.CV cs.RO

    MaskFusion: Real-Time Recognition, Tracking and Reconstruction of Multiple Moving Objects

    Authors: Martin Rünz, Maud Buffier, Lourdes Agapito

    Abstract: We present MaskFusion, a real-time, object-aware, semantic and dynamic RGB-D SLAM system that goes beyond traditional systems which output a purely geometric map of a static scene. MaskFusion recognizes, segments and assigns semantic class labels to different objects in the scene, while tracking and reconstructing them even when they move independently from the camera. As an RGB-D camera scans a… ▽ More

    Submitted 22 October, 2018; v1 submitted 24 April, 2018; originally announced April 2018.

    Comments: Presented at IEEE International Symposium on Mixed and Augmented Reality (ISMAR) 2018

  31. arXiv:1804.01050  [pdf, other

    stat.ML cs.CV cs.LG

    Training VAEs Under Structured Residuals

    Authors: Garoe Dorta, Sara Vicente, Lourdes Agapito, Neill D. F. Campbell, Ivor Simpson

    Abstract: Variational auto-encoders (VAEs) are a popular and powerful deep generative model. Previous works on VAEs have assumed a factorized likelihood model, whereby the output uncertainty of each pixel is assumed to be independent. This approximation is clearly limited as demonstrated by observing a residual image from a VAE reconstruction, which often possess a high level of structure. This paper demons… ▽ More

    Submitted 31 July, 2018; v1 submitted 3 April, 2018; originally announced April 2018.

    Comments: Simplified training methodology, added more results

  32. Ab Initio Electron-Phonon Interactions Using Atomic Orbital Wavefunctions

    Authors: Luis A. Agapito, Marco Bernardi

    Abstract: The interaction between electrons and lattice vibrations determines key physical properties of materials, including their electrical and heat transport, excited electron dynamics, phase transitions, and superconductivity. We present a new ab initio method that employs atomic orbital (AO) wavefunctions to compute the electron-phonon (e-ph) interactions in materials and interpolate the e-ph coupling… ▽ More

    Submitted 16 March, 2018; originally announced March 2018.

    Journal ref: Phys. Rev. B 97, 235146 (2018)

  33. arXiv:1802.07079  [pdf, other

    stat.ML

    Structured Uncertainty Prediction Networks

    Authors: Garoe Dorta, Sara Vicente, Lourdes Agapito, Neill D. F. Campbell, Ivor Simpson

    Abstract: This paper is the first work to propose a network to predict a structured uncertainty distribution for a synthesized image. Previous approaches have been mostly limited to predicting diagonal covariance matrices. Our novel model learns to predict a full Gaussian covariance matrix for each reconstruction, which permits efficient sampling and likelihood evaluation. We demonstrate that our model ca… ▽ More

    Submitted 23 March, 2018; v1 submitted 20 February, 2018; originally announced February 2018.

    Comments: CVPR 2018 (final version)

  34. Charge Transport in Organic Molecular Semiconductors from First Principles: The Band-Like Hole Mobility in Naphthalene Crystal

    Authors: Nien-En Lee, Jin-Jian Zhou, Luis A. Agapito, Marco Bernardi

    Abstract: Predicting charge transport in organic molecular crystals is notoriously challenging. Carrier mobility calculations in organic semiconductors are dominated by quantum chemistry methods based on charge hopping, which are laborious and only moderately accurate. We compute from first principles the electron-phonon scattering and the phonon-limited hole mobility of naphthalene crystal in the framework… ▽ More

    Submitted 12 March, 2018; v1 submitted 1 December, 2017; originally announced December 2017.

    Comments: 7 pages, 4 figures, Accepted by Phys. Rev. B

    Journal ref: Phys. Rev. B 97, 115203 (2018)

  35. arXiv:1712.00422  [pdf, other

    cond-mat.mtrl-sci

    The AFLOW Fleet for Materials Discovery

    Authors: Cormac Toher, Corey Oses, David Hicks, Eric Gossett, Frisco Rose, Pinku Nath, Demet Usanmaz, Denise C. Ford, Eric Perim, Camilo E. Calderon, Jose J. Plata, Yoav Lederer, Michal Jahnátek, Wahyu Setyawan, Shidong Wang, Junkai Xue, Kevin Rasch, Roman V. Chepulskii, Richard H. Taylor, Geena Gomez, Harvey Shi, Andrew R. Supka, Rabih Al Rahal Al Orabi, Priya Gopal, Frank T. Cerasoli , et al. (26 additional authors not shown)

    Abstract: The traditional paradigm for materials discovery has been recently expanded to incorporate substantial data driven research. With the intent to accelerate the development and the deployment of new technologies, the AFLOW Fleet for computational materials design automates high-throughput first principles calculations, and provides tools for data verification and dissemination for a broad community… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

    Comments: 14 pages, 8 figures

  36. arXiv:1708.01654  [pdf, other

    cs.CV

    Better Together: Joint Reasoning for Non-rigid 3D Reconstruction with Specularities and Shading

    Authors: Qi Liu-Yin, Rui Yu, Lourdes Agapito, Andrew Fitzgibbon, Chris Russell

    Abstract: We demonstrate the use of shape-from-shading (SfS) to improve both the quality and the robustness of 3D reconstruction of dynamic objects captured by a single camera. Unlike previous approaches that made use of SfS as a post-processing step, we offer a principled integrated approach that solves dynamic object tracking and reconstruction and SfS as a single unified cost function. Moving beyond Lamb… ▽ More

    Submitted 4 August, 2017; originally announced August 2017.

    Comments: Submitted to IJCV

  37. Co-Fusion: Real-time Segmentation, Tracking and Fusion of Multiple Objects

    Authors: Martin Rünz, Lourdes Agapito

    Abstract: In this paper we introduce Co-Fusion, a dense SLAM system that takes a live stream of RGB-D images as input and segments the scene into different objects (using either motion or semantic cues) while simultaneously tracking and reconstructing their 3D shape in real time. We use a multiple model fitting approach where each object can move independently from the background and still be effectively tr… ▽ More

    Submitted 20 June, 2017; originally announced June 2017.

    Comments: International Conference on Robotics and Automation (ICRA) 2017, http://visual.cs.ucl.ac.uk/pubs/cofusion, https://github.com/martinruenz/co-fusion

  38. Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image

    Authors: Denis Tome, Chris Russell, Lourdes Agapito

    Abstract: We propose a unified formulation for the problem of 3D human pose estimation from a single raw RGB image that reasons jointly about 2D joint estimation and 3D pose reconstruction to improve both tasks. We take an integrated approach that fuses probabilistic knowledge of 3D human pose with a multi-stage CNN architecture and uses the knowledge of plausible 3D landmark locations to refine the search… ▽ More

    Submitted 11 October, 2017; v1 submitted 1 January, 2017; originally announced January 2017.

    Comments: Paper presented at CVPR 17

  39. Accurate $ab~initio$ tight-binding Hamiltonians: effective tools for electronic transport and optical spectroscopy from first principles

    Authors: Pino D'Amico, Luis A. Agapito, Alessandra Catellani, Alice Ruini, Stefano Curtarolo, Marco Fornari, Marco Buongiorno Nardelli, Arrigo Calzolari

    Abstract: The calculations of electronic transport coefficients and optical properties require a very dense interpolation of the electronic band structure in reciprocal space that is computationally expensive and may have issues with band crossing and degeneracies. Capitalizing on a recently developed pseudo-atomic orbital projection technique, we exploit the exact tight-binding representation of the first… ▽ More

    Submitted 19 August, 2016; originally announced August 2016.

    Journal ref: Phys. Rev. B 94, 165166 (2016)

  40. Accurate Tight-Binding Hamiltonians for 2D and Layered Materials

    Authors: Luis Agapito, Marco Fornari, Davide Ceresoli, Andrea Ferretti, Stefano Curtarolo, Marco Buongiorno Nardelli

    Abstract: We present a scheme to controllably improve the accuracy of tight-binding Hamiltonian matrices derived by projecting the solutions of plane-wave ab initio calculations on atomic orbital basis sets. By systematically increasing the completeness of the basis set of atomic orbitals, we are able to optimize the quality of the band structure interpolation over wide energy ranges including unoccupied st… ▽ More

    Submitted 11 January, 2016; originally announced January 2016.

  41. arXiv:1511.04472  [pdf, other

    cs.CV

    Solving Jigsaw Puzzles with Linear Programming

    Authors: Rui Yu, Chris Russell, Lourdes Agapito

    Abstract: We propose a novel Linear Program (LP) based formula- tion for solving jigsaw puzzles. We formulate jigsaw solving as a set of successive global convex relaxations of the stan- dard NP-hard formulation, that can describe both jigsaws with pieces of unknown position and puzzles of unknown po- sition and orientation. The main contribution and strength of our approach comes from the LP assembly strat… ▽ More

    Submitted 13 November, 2015; originally announced November 2015.

  42. Accurate tight-binding Hamiltonian matrices from ab-initio calculations: Minimal basis sets

    Authors: Luis A. Agapito, Sohrab Ismail-Beigi. Stefano Curtarolo, Marco Fornari, Marco Buongiorno Nardelli

    Abstract: Projection of Bloch states obtained from quantum-mechanical calculations onto atomic orbitals is the fastest scheme to construct ab-initio tight-binding Hamiltonian matrices. However, the presence of spurious states and unphysical hybridizations of the tight-binding eigenstates has hindered the applicability of this construction. Here we demonstrate that those spurious effects are due to the inclu… ▽ More

    Submitted 19 October, 2015; v1 submitted 8 September, 2015; originally announced September 2015.

    Journal ref: Phys. Rev. B 93, 035104 (2016)

  43. arXiv:1505.05245  [pdf, other

    cond-mat.mtrl-sci

    Improved predictions of the physical properties of Zn- and Cd-based wide band-gap semiconductors: a validation of the ACBN0 functional

    Authors: Priya Gopal, Marco Fornari, Stefano Curtarolo, Luis A. Agapito, Laalitha S. I. Liyanage, Marco Buongiorno Nardelli

    Abstract: We study the physical properties of Zn$X$ ($X$=O, S, Se, Te) and Cd$X$ ($X$=O, S, Se, Te) in the zinc-blende, rock-salt, and wurtzite structures using the recently developed fully $ab$ $initio$ pseudo-hybrid Hubbard density functional ACBN0. We find that both the electronic and vibrational properties of these wide-band gap semiconductors are systematically improved over the PBE values and reproduc… ▽ More

    Submitted 20 May, 2015; originally announced May 2015.

    Comments: 6 figures, 8 tables

  44. arXiv:1503.06465  [pdf, other

    cs.CV

    Lifting Object Detection Datasets into 3D

    Authors: Joao Carreira, Sara Vicente, Lourdes Agapito, Jorge Batista

    Abstract: While data has certainly taken the center stage in computer vision in recent years, it can still be difficult to obtain in certain scenarios. In particular, acquiring ground truth 3D shapes of objects pictured in 2D images remains a challenging feat and this has hampered progress in recognition-based object reconstruction from a single image. Here we propose to bypass previous solutions such as 3D… ▽ More

    Submitted 31 July, 2016; v1 submitted 22 March, 2015; originally announced March 2015.

  45. arXiv:1406.3259  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Reformulation of DFT+U as a pseudo-hybrid Hubbard density functional

    Authors: Luis A. Agapito, Stefano Curtarolo, Marco Buongiorno Nardelli

    Abstract: The accurate prediction of the electronic properties of materials at a low computational expense is a necessary conditions for the development of effective high-throughput quantum-mechanics (HTQM) frameworks for accelerated materials discovery. HTQM infrastructures rely on the predictive capability of Density Functional Theory (DFT), the method of choice for the first principles study of materials… ▽ More

    Submitted 21 October, 2014; v1 submitted 12 June, 2014; originally announced June 2014.

    Comments: 16 pages, 6 figures, 4 tables

  46. arXiv:1310.0060  [pdf, ps, other

    cond-mat.mes-hall

    Effective and accurate representation of extended Bloch states on finite Hilbert spaces

    Authors: Luis A. Agapito, Andrea Ferretti, Arrigo Calzolari, Stefano Curtarolo, Marco Buongiorno Nardelli

    Abstract: We present a straightforward, noniterative projection scheme that can represent the electronic ground state of a periodic system on a finite atomic-orbital-like basis, up to a predictable number of electronic states and with controllable accuracy. By co-filtering the projections of plane-wave Bloch states with high-kinetic-energy components, the richness of the finite space and thus the number of… ▽ More

    Submitted 30 September, 2013; originally announced October 2013.

  47. Strain-induced topological insulator phase transition in HgSe

    Authors: Lars Winterfeld, Luis A. Agapito, Jin Li, Nicholas Kioussis, Peter Blaha, Yong P. Chen

    Abstract: Using ab initio electronic structure calculations we investigate the change of the band structure and the nu_0 topological invariant in HgSe (non-centrosymmetric system) under two different type of uniaxial strain along the [001] and [110] directions, respectively. Both compressive [001] and [110] strain leads to the opening of a (crystal field) band gap (with a maximum value of about 37 meV) in t… ▽ More

    Submitted 17 February, 2013; originally announced February 2013.

    Comments: 8 pages, 8 figures

  48. Aviram-Ratner rectifying mechanism for DNA base pair sequencing through graphene nanogaps

    Authors: Luis A. Agapito, Jacob Gayles, Christian Wolowiec, Nicholas Kioussis

    Abstract: We demonstrate that biological molecules such as Watson-Crick DNA base pairs can behave as biological Aviram-Ratner electrical rectifiers because of the spatial separation and weak hydrogen bonding between the nucleobases. We have performed a parallel computational implementation of the ab-initio non-equilibrium Green's function (NEGF) theory to determine the electrical response of graphene---base… ▽ More

    Submitted 17 February, 2012; v1 submitted 30 December, 2011; originally announced January 2012.

    Journal ref: Nanotechnology, Vol 23, Page 135202, Year 2012

  49. arXiv:1105.5672  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Approaching the Intrinsic Bandgap in Suspended High-Mobility Graphene Nanoribbons

    Authors: Ming-Wei Lin, Cheng Ling, Luis A. Agapito, Nicholas Kioussis, Yiyang Zhang, Mark Ming-Cheng Cheng, Wei L. Wang, Efthimios Kaxiras, Zhixian Zhou

    Abstract: We report electrical transport measurements on a suspended ultra-low-disorder graphene nanoribbon(GNR) with nearly atomically smooth edges that reveal a high mobility exceeding 3000 cm2 V-1 s-1 and an intrinsic band gap. The experimentally derived bandgap is in quantitative agreement with the results of our electronic-structure calculations on chiral GNRs with comparable width taking into account… ▽ More

    Submitted 27 May, 2011; originally announced May 2011.

    Comments: 22 pages, 6 figures

  50. arXiv:1104.1599  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Room-Temperature High On/Off Ratio in Suspended Graphene Nanoribbon Field Effect Transistors

    Authors: Ming-Wei Lin, Cheng Ling, Yiyang Zhang, Hyeun Joong Yoon, Mark Ming-Cheng Cheng, Luis A. Agapito, Nicholas Kioussis, Noppi Widjaja, Zhixian Zhou

    Abstract: We have fabricated suspended few layer (1-3 layers) graphene nanoribbon field effect transistors from unzipped multiwall carbon nanotubes. Electrical transport measurements show that current-annealing effectively removes the impurities on the suspended graphene nanoribbons, uncovering the intrinsic ambipolar transfer characteristic of graphene. Further increasing the annealing current creates a na… ▽ More

    Submitted 8 April, 2011; originally announced April 2011.

    Comments: 19 pages, 6 figures, accepted for publication in Nanotechnology (2011)