Machine learning for automated and real-time two-dimensional to three-dimensional registration of the spine using a single radiograph

Neurosurg Focus. 2023 Jun;54(6):E16. doi: 10.3171/2023.3.FOCUS2345.

Abstract

Objective: The goal of this work was to methodically evaluate, optimize, and validate a self-supervised machine learning algorithm capable of real-time automatic registration and fluoroscopic localization of the spine using a single radiograph or fluoroscopic frame.

Methods: The authors propose a two-dimensional to three-dimensional (2D-3D) registration algorithm that maximizes an image similarity metric between radiographic images to identify the position of a C-arm relative to a 3D volume. This work utilizes digitally reconstructed radiographs (DRRs), which are synthetic radiographic images generated by simulating the x-ray projections as they would pass through a CT volume. To evaluate the algorithm, the authors used cone-beam CT data for 127 patients obtained from an open-source de-identified registry of cervical, thoracic, and lumbar scans. They systematically evaluated and tuned the algorithm, then quantified the convergence rate of the model by simulating C-arm registrations with 80 randomly simulated DRRs for each CT volume. The endpoints of this study were time to convergence, accuracy of convergence for each of the C-arm's degrees of freedom, and overall registration accuracy based on a voxel-by-voxel measurement.

Results: A total of 10,160 unique radiographic images were simulated from 127 CT scans. The algorithm successfully converged to the correct solution 82% of the time with an average of 1.96 seconds of computation. The radiographic images for which the algorithm converged to the solution demonstrated 99.9% registration accuracy despite utilizing only single-precision computation for speed. The algorithm was found to be optimized for convergence when the search space was limited to a ± 45° offset in the right anterior oblique/left anterior oblique, cranial/caudal, and receiver rotation angles with the radiographic isocenter contained within 8000 cm3 of the volumetric center of the CT volume.

Conclusions: The investigated machine learning algorithm has the potential to aid surgeons in level localization, surgical planning, and intraoperative navigation through a completely automated 2D-3D registration process. Future work will focus on algorithmic optimizations to improve the convergence rate and speed profile.

Keywords: DRR; digitally reconstructed radiograph; machine learning; registration; spine.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Humans
  • Imaging, Three-Dimensional* / methods
  • Machine Learning
  • Radiography
  • Spine / diagnostic imaging
  • Spine / surgery
  • Tomography, X-Ray Computed* / methods