Zum Hauptinhalt springen

Showing 1–13 of 13 results for author: Bazavan, E G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07516  [pdf, other

    cs.CV

    Instant 3D Human Avatar Generation using Image Diffusion Models

    Authors: Nikos Kolotouros, Thiemo Alldieck, Enric Corona, Eduard Gabriel Bazavan, Cristian Sminchisescu

    Abstract: We present AvatarPopUp, a method for fast, high quality 3D human avatar generation from different input modalities, such as images and text prompts and with control over the generated pose and shape. The common theme is the use of diffusion-based image generation networks that are specialized for each particular task, followed by a 3D lifting network. We purposefully decouple the generation from t… ▽ More

    Submitted 12 July, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Camera-ready version

  2. arXiv:2403.08764  [pdf, other

    cs.CV

    VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis

    Authors: Enric Corona, Andrei Zanfir, Eduard Gabriel Bazavan, Nikos Kolotouros, Thiemo Alldieck, Cristian Sminchisescu

    Abstract: We propose VLOGGER, a method for audio-driven human video generation from a single input image of a person, which builds on the success of recent generative diffusion models. Our method consists of 1) a stochastic human-to-3d-motion diffusion model, and 2) a novel diffusion-based architecture that augments text-to-image models with both spatial and temporal controls. This supports the generation o… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Project web: https://enriccorona.github.io/vlogger/

  3. arXiv:2311.02461  [pdf, other

    cs.CV

    SPHEAR: Spherical Head Registration for Complete Statistical 3D Modeling

    Authors: Eduard Gabriel Bazavan, Andrei Zanfir, Thiemo Alldieck, Teodor Alexandru Szente, Mihai Zanfir, Cristian Sminchisescu

    Abstract: We present \emph{SPHEAR}, an accurate, differentiable parametric statistical 3D human head model, enabled by a novel 3D registration method based on spherical embeddings. We shift the paradigm away from the classical Non-Rigid Registration methods, which operate under various surface priors, increasing reconstruction fidelity and minimizing required human intervention. Additionally, SPHEAR is a \e… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: To be published at the International Conference on 3D Vision 2024

  4. arXiv:2309.05782  [pdf, other

    cs.CV

    Blendshapes GHUM: Real-time Monocular Facial Blendshape Prediction

    Authors: Ivan Grishchenko, Geng Yan, Eduard Gabriel Bazavan, Andrei Zanfir, Nikolai Chinaev, Karthik Raveendran, Matthias Grundmann, Cristian Sminchisescu

    Abstract: We present Blendshapes GHUM, an on-device ML pipeline that predicts 52 facial blendshape coefficients at 30+ FPS on modern mobile phones, from a single monocular RGB image and enables facial motion capture applications like virtual avatars. Our main contributions are: i) an annotation-free offline method for obtaining blendshape coefficients from real-world human scans, ii) a lightweight real-time… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: 4 pages, 3 figures

  5. arXiv:2306.09329  [pdf, other

    cs.CV

    DreamHuman: Animatable 3D Avatars from Text

    Authors: Nikos Kolotouros, Thiemo Alldieck, Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Fieraru, Cristian Sminchisescu

    Abstract: We present DreamHuman, a method to generate realistic animatable 3D human avatar models solely from textual descriptions. Recent text-to-3D methods have made considerable strides in generation, but are still lacking in important aspects. Control and often spatial resolution remain limited, existing methods produce fixed rather than animated 3D human models, and anthropometric consistency for compl… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Project website at https://dream-human.github.io/

  6. arXiv:2212.06820  [pdf, other

    cs.CV

    Structured 3D Features for Reconstructing Controllable Avatars

    Authors: Enric Corona, Mihai Zanfir, Thiemo Alldieck, Eduard Gabriel Bazavan, Andrei Zanfir, Cristian Sminchisescu

    Abstract: We introduce Structured 3D Features, a model based on a novel implicit 3D representation that pools pixel-aligned image features onto dense 3D points sampled from a parametric, statistical human mesh surface. The 3D points have associated semantics and can move freely in 3D space. This allows for optimal coverage of the person of interest, beyond just the body shape, which in turn, additionally he… ▽ More

    Submitted 15 April, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: Accepted at CVPR 2023. Project page: https://enriccorona.github.io/s3f/, Video: https://www.youtube.com/watch?v=mcZGcQ6L-2s

  7. arXiv:2206.11678  [pdf, other

    cs.CV

    BlazePose GHUM Holistic: Real-time 3D Human Landmarks and Pose Estimation

    Authors: Ivan Grishchenko, Valentin Bazarevsky, Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Zanfir, Richard Yee, Karthik Raveendran, Matsvei Zhdanovich, Matthias Grundmann, Cristian Sminchisescu

    Abstract: We present BlazePose GHUM Holistic, a lightweight neural network pipeline for 3D human body landmarks and pose estimation, specifically tailored to real-time on-device inference. BlazePose GHUM Holistic enables motion capture from a single RGB image including avatar control, fitness tracking and AR/VR effects. Our main contributions include i) a novel method for 3D ground truth data acquisition, i… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: 4 pages, 4 figures; CVPR Workshop on Computer Vision for Augmented and Virtual Reality, New Orleans, LA, 2022

  8. arXiv:2112.12867  [pdf, other

    cs.CV

    HSPACE: Synthetic Parametric Humans Animated in Complex Environments

    Authors: Eduard Gabriel Bazavan, Andrei Zanfir, Mihai Zanfir, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu

    Abstract: Advances in the state of the art for 3d human sensing are currently limited by the lack of visual datasets with 3d ground truth, including multiple people, in motion, operating in real-world environments, with complex illumination or occlusion, and potentially observed by a moving camera. Sophisticated scene understanding would require estimating human pose and shape as well as gestures, towards r… ▽ More

    Submitted 6 January, 2022; v1 submitted 23 December, 2021; originally announced December 2021.

  9. arXiv:2111.00038  [pdf, other

    cs.CV

    On-device Real-time Hand Gesture Recognition

    Authors: George Sung, Kanstantsin Sokal, Esha Uboweja, Valentin Bazarevsky, Jonathan Baccash, Eduard Gabriel Bazavan, Chuo-Ling Chang, Matthias Grundmann

    Abstract: We present an on-device real-time hand gesture recognition (HGR) system, which detects a set of predefined static gestures from a single RGB camera. The system consists of two parts: a hand skeleton tracker and a gesture classifier. We use MediaPipe Hands as the basis of the hand skeleton tracker, improve the keypoint accuracy, and add the estimation of 3D keypoints in a world metric space. We cre… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

    Comments: 5 pages, 6 figures; ICCV Workshop on Computer Vision for Augmented and Virtual Reality, Montreal, Canada, 2021

  10. arXiv:2106.09336  [pdf, other

    cs.CV

    THUNDR: Transformer-based 3D HUmaN Reconstruction with Markers

    Authors: Mihai Zanfir, Andrei Zanfir, Eduard Gabriel Bazavan, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu

    Abstract: We present THUNDR, a transformer-based deep neural network methodology to reconstruct the 3d pose and shape of people, given monocular RGB images. Key to our methodology is an intermediate 3d marker representation, where we aim to combine the predictive power of model-free-output architectures and the regularizing, anthropometrically-preserving properties of a statistical human surface model like… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  11. arXiv:2008.06910  [pdf, other

    cs.CV

    Neural Descent for Visual 3D Human Pose and Shape

    Authors: Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Zanfir, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu

    Abstract: We present deep neural network methodology to reconstruct the 3d pose and shape of people, given an input RGB image. We rely on a recently introduced, expressivefull body statistical 3d human model, GHUM, trained end-to-end, and learn to reconstruct its pose and shape state in a self-supervised regime. Central to our methodology, is a learning to learn and optimize approach, referred to as HUmanNe… ▽ More

    Submitted 14 June, 2021; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: CVPR 2021

  12. arXiv:2003.10350  [pdf, other

    cs.CV

    Weakly Supervised 3D Human Pose and Shape Reconstruction with Normalizing Flows

    Authors: Andrei Zanfir, Eduard Gabriel Bazavan, Hongyi Xu, Bill Freeman, Rahul Sukthankar, Cristian Sminchisescu

    Abstract: Monocular 3D human pose and shape estimation is challenging due to the many degrees of freedom of the human body and thedifficulty to acquire training data for large-scale supervised learning in complex visual scenes. In this paper we present practical semi-supervised and self-supervised models that support training and good generalization in real-world images and video. Our formulation is based o… ▽ More

    Submitted 22 August, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

    Journal ref: ECCV 2020

  13. arXiv:1203.1483  [pdf, other

    cs.CV cs.LG

    Learning Random Kernel Approximations for Object Recognition

    Authors: Eduard Gabriel Băzăvan, Fuxin Li, Cristian Sminchisescu

    Abstract: Approximations based on random Fourier features have recently emerged as an efficient and formally consistent methodology to design large-scale kernel machines. By expressing the kernel as a Fourier expansion, features are generated based on a finite set of random basis projections, sampled from the Fourier transform of the kernel, with inner products that are Monte Carlo approximations of the ori… ▽ More

    Submitted 7 March, 2012; originally announced March 2012.