Performance and scalability of Fourier domain optical coherence tomography acceleration using graphics processing units

Appl Opt. 2011 May 1;50(13):1832-8. doi: 10.1364/AO.50.001832.

Abstract

Fourier domain optical coherence tomography (FD-OCT) provides faster line rates, better resolution, and higher sensitivity for noninvasive, in vivo biomedical imaging compared to traditional time domain OCT (TD-OCT). However, because the signal processing for FD-OCT is computationally intensive, real-time FD-OCT applications demand powerful computing platforms to deliver acceptable performance. Graphics processing units (GPUs) have been used as coprocessors to accelerate FD-OCT by leveraging their relatively simple programming model to exploit thread-level parallelism. Unfortunately, GPUs do not "share" memory with their host processors, requiring additional data transfers between the GPU and CPU. In this paper, we implement a complete FD-OCT accelerator on a consumer grade GPU/CPU platform. Our data acquisition system uses spectrometer-based detection and a dual-arm interferometer topology with numerical dispersion compensation for retinal imaging. We demonstrate that the maximum line rate is dictated by the memory transfer time and not the processing time due to the GPU platform's memory model. Finally, we discuss how the performance trends of GPU-based accelerators compare to the expected future requirements of FD-OCT data rates.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acceleration
  • Algorithms
  • Computer Graphics
  • Computers
  • Equipment Design
  • Fourier Analysis
  • Humans
  • Image Processing, Computer-Assisted / methods*
  • Optics and Photonics
  • Reproducibility of Results
  • Retina / pathology
  • Signal Processing, Computer-Assisted
  • Software
  • Time Factors
  • Tomography, Optical Coherence / methods*