Free-viewpoint video of human actors using multiple handheld Kinects

IEEE Trans Cybern. 2013 Oct;43(5):1370-82. doi: 10.1109/TCYB.2013.2272321. Epub 2013 Jul 22.

Abstract

We present an algorithm for creating free-viewpoint video of interacting humans using three handheld Kinect cameras. Our method reconstructs deforming surface geometry and temporal varying texture of humans through estimation of human poses and camera poses for every time step of the RGBZ video. Skeletal configurations and camera poses are found by solving a joint energy minimization problem, which optimizes the alignment of RGBZ data from all cameras, as well as the alignment of human shape templates to the Kinect data. The energy function is based on a combination of geometric correspondence finding, implicit scene segmentation, and correspondence finding using image features. Finally, texture recovery is achieved through jointly optimization on spatio-temporal RGB data using matrix completion. As opposed to previous methods, our algorithm succeeds on free-viewpoint video of human actors under general uncontrolled indoor scenes with potentially dynamic background, and it succeeds even if the cameras are moving.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Actigraphy / instrumentation
  • Actigraphy / methods*
  • Algorithms
  • Artificial Intelligence*
  • Computer Peripherals*
  • Computer Simulation
  • Humans
  • Image Enhancement / instrumentation
  • Image Enhancement / methods
  • Imaging, Three-Dimensional / methods*
  • Pattern Recognition, Automated / methods*
  • Transducers
  • Video Games
  • Video Recording / methods*
  • Whole Body Imaging / instrumentation
  • Whole Body Imaging / methods*