Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Takizawa, S

Searching in archive cs. Search in all archives.
.
  1. Performance Portable Back-projection Algorithms on CPUs: Agnostic Data Locality and Vectorization Optimizations

    Authors: Peng Chen, Mohamed Wahib, Xiao Wang, Shinichiro Takizawa, Takahiro Hirofuchi, Hirotaka Ogawa, Satoshi Matsuoka

    Abstract: Computed Tomography (CT) is a key 3D imaging technology that fundamentally relies on the compute-intense back-projection operation to generate 3D volumes. GPUs are typically used for back-projection in production CT devices. However, with the rise of power-constrained micro-CT devices, and also the emergence of CPUs comparable in performance to GPUs, back-projection for CPUs could become favorable… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: ACM International Conference on Supercomputing 2021 (ICS'21)

  2. iFDK: A Scalable Framework for Instant High-resolution Image Reconstruction

    Authors: Peng Chen, Mohamed Wahib, Shinichiro Takizawa, Ryousei Takano, Satoshi Matsuoka

    Abstract: Computed Tomography (CT) is a widely used technology that requires compute-intense algorithms for image reconstruction. We propose a novel back-projection algorithm that reduces the projection computation cost to 1/6 of the standard algorithm. We also propose an efficient implementation that takes advantage of the heterogeneity of GPU-accelerated systems by overlapping the filtering and back-proje… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

    Comments: ACM/IEEE Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'19)

  3. A Versatile Software Systolic Execution Model for GPU Memory-Bound Kernels

    Authors: Peng Chen, Mohamed Wahib, Shinichiro Takizawa, Ryousei Takano, Satoshi Matsuoka

    Abstract: This paper proposes a versatile high-performance execution model, inspired by systolic arrays, for memory-bound regular kernels running on CUDA-enabled GPUs. We formulate a systolic model that shifts partial sums by CUDA warp primitives for the computation. We also employ register files as a cache resource in order to operate the entire model efficiently. We demonstrate the effectiveness and versa… ▽ More

    Submitted 6 September, 2019; v1 submitted 13 July, 2019; originally announced July 2019.

    Comments: ACM/IEEE Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'19)