Search | arXiv e-print repository

Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models

Authors: Sho Ozaki, Shizuo Kaji, Toshikazu Imae, Kanabu Nawa, Hideomi Yamashita, Keiichi Nakagawa

Abstract: Image generative AI has garnered significant attention in recent years. In particular, the diffusion model, a core component of recent generative AI, produces high-quality images with rich diversity. In this study, we propose a novel CT reconstruction method by combining the denoising diffusion probabilistic model with iterative CT reconstruction. In sharp contrast to previous studies, we optimize… ▽ More Image generative AI has garnered significant attention in recent years. In particular, the diffusion model, a core component of recent generative AI, produces high-quality images with rich diversity. In this study, we propose a novel CT reconstruction method by combining the denoising diffusion probabilistic model with iterative CT reconstruction. In sharp contrast to previous studies, we optimize the fidelity loss of CT reconstruction with respect to the latent variable of the diffusion model, instead of the image and model parameters. To suppress anatomical structure changes produced by the diffusion model, we shallow the diffusion and reverse processes, and fix a set of added noises in the reverse process to make it deterministic during inference. We demonstrate the effectiveness of the proposed method through sparse view CT reconstruction of 1/10 view projection data. Despite the simplicity of the implementation, the proposed method shows the capability of reconstructing high-quality images while preserving the patient's anatomical structure, and outperforms existing methods including iterative reconstruction, iterative reconstruction with total variation, and the diffusion model alone in terms of quantitative indices such as SSIM and PSNR. We also explore further sparse view CT using 1/20 view projection data with the same trained diffusion model. As the number of iterations increases, image quality improvement comparable to that of 1/10 sparse view CT reconstruction is achieved. In principle, the proposed method can be widely applied not only to CT but also to other imaging modalities such as MRI, PET, and SPECT. △ Less

Submitted 6 August, 2024; originally announced August 2024.

Comments: 19 pages, 9 figures

arXiv:2308.04977 [pdf, other]

An explicit construction of Kaleidocycles

Authors: Shizuo Kaji, Kenji Kajiwara, Shota Shigetomi

Abstract: We model a family of closed kinematic chains, known as Kaleidocycles, with the theory of discrete spatial curves. By leveraging the connection between the deformation of discrete curves and the semi-discrete integrable systems, we describe the motion of a Kaleidocycle by elliptic theta functions. This study showcases an interesting example in which an integrable system generates an orbit in the sp… ▽ More We model a family of closed kinematic chains, known as Kaleidocycles, with the theory of discrete spatial curves. By leveraging the connection between the deformation of discrete curves and the semi-discrete integrable systems, we describe the motion of a Kaleidocycle by elliptic theta functions. This study showcases an interesting example in which an integrable system generates an orbit in the space of the real solutions of polynomial equations defined by geometric constraints. △ Less

Submitted 9 August, 2023; originally announced August 2023.

MSC Class: 53A04; 53A17; 70B15; 37K25; 37K10; 35Q53

arXiv:2107.05238 [pdf, other]

doi 10.1002/mp.15626

Training of deep cross-modality conversion models with a small dataset, and their application in megavoltage CT to kilovoltage CT conversion

Authors: Sho Ozaki, Shizuo Kaji, Kanabu Nawa, Toshikazu Imae, Atsushi Aoki, Takahiro Nakamoto, Takeshi Ohta, Yuki Nozawa, Hideomi Yamashita, Akihiro Haga, Keiichi Nakagawa

Abstract: In recent years, deep-learning-based image processing has emerged as a valuable tool for medical imaging owing to its high performance. However, the quality of deep-learning-based methods heavily relies on the amount of training data; the high cost of acquiring a large dataset is a limitation to their utilization in medical fields. Herein, based on deep learning, we developed a computed tomography… ▽ More In recent years, deep-learning-based image processing has emerged as a valuable tool for medical imaging owing to its high performance. However, the quality of deep-learning-based methods heavily relies on the amount of training data; the high cost of acquiring a large dataset is a limitation to their utilization in medical fields. Herein, based on deep learning, we developed a computed tomography (CT) modality conversion method requiring only a few unsupervised images. The proposed method is based on CycleGAN with several extensions tailored for CT images, which aims at preserving the structure in the processed images and reducing the amount of training data. This method was applied to realize the conversion of megavoltage computed tomography (MVCT) to kilovoltage computed tomography (kVCT) images. Training was conducted using several datasets acquired from patients with head and neck cancer. The size of the datasets ranged from 16 slices (two patients) to 2745 slices (137 patients) for MVCT and 2824 slices (98 patients) for kVCT. The required size of the training data was found to be as small as a few hundred slices. By statistical and visual evaluations, the quality improvement and structure preservation of the MVCT images converted by the proposed model were investigated. As a clinical benefit, it was observed by medical doctors that the converted images enhanced the precision of contouring. We developed an MVCT to kVCT conversion model based on deep learning, which can be trained using only a few hundred unpaired images. The stability of the model against changes in data size was demonstrated. This study promotes the reliable use of deep learning in clinical medicine by partially answering commonly asked questions, such as "Is our data sufficient?" and "How much data should we acquire?" △ Less

Submitted 5 April, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

Comments: 3+27 pages, 13 figures, version published in Medical Physics

arXiv:2103.07584 [pdf, other]

Free-form Design of Discrete Architectural Surfaces by use of Circle Packing

Authors: Shizuo Kaji, Jingyao Zhang

Abstract: This paper presents an efficient approach for the conceptual design of architectural surfaces which are composed of triangular panels. In the free-form design of discrete architectural surfaces, the Gaussian curvature plays an important role not only aesthetically but also in terms of stiffness and constructability. However, designing a surface manually with specific Gaussian curvatures can be a t… ▽ More This paper presents an efficient approach for the conceptual design of architectural surfaces which are composed of triangular panels. In the free-form design of discrete architectural surfaces, the Gaussian curvature plays an important role not only aesthetically but also in terms of stiffness and constructability. However, designing a surface manually with specific Gaussian curvatures can be a time-consuming task. We propose a method to find a triangulated surface with user-specified Gaussian curvatures (not limited to constant Gaussian curvatures) and boundary vertex positions. In addition, the conformal class of the final design can be specified; that is, the user has control over the shape (the corner angles) of each triangular panel. The panels could be encouraged to form a regular tessellation or kept close to those of the initial design. The controllability of the conformal class suppresses possible distortion of the panels, resulting in higher structural performance and aesthetics. Our method relies on the idea in computational conformal geometry called circle packing. In this line of research, the discrete Ricci flow has been widely used for surface modelling. However, it is not trivial to incorporate constraints such as boundary locations and convexity of the spanned surface, which are essential to architectural applications. We propose a perturbation of the discrete Ricci energy and develop a least-squares-based optimisation scheme to address these problems with an open-source implementation available online. △ Less

Submitted 11 May, 2022; v1 submitted 12 March, 2021; originally announced March 2021.

ACM Class: I.3.5; J.6

arXiv:2007.02007 [pdf, other]

Nested Subspace Arrangement for Representation of Relational Data

Authors: Nozomi Hata, Shizuo Kaji, Akihiro Yoshida, Katsuki Fujisawa

Abstract: Studies on acquiring appropriate continuous representations of discrete objects, such as graphs and knowledge base data, have been conducted by many researchers in the field of machine learning. In this study, we introduce Nested SubSpace (NSS) arrangement, a comprehensive framework for representation learning. We show that existing embedding techniques can be regarded as special cases of the NSS… ▽ More Studies on acquiring appropriate continuous representations of discrete objects, such as graphs and knowledge base data, have been conducted by many researchers in the field of machine learning. In this study, we introduce Nested SubSpace (NSS) arrangement, a comprehensive framework for representation learning. We show that existing embedding techniques can be regarded as special cases of the NSS arrangement. Based on the concept of the NSS arrangement, we implement a Disk-ANChor ARrangement (DANCAR), a representation learning method specialized to reproducing general graphs. Numerical experiments have shown that DANCAR has successfully embedded WordNet in ${\mathbb R}^{20}$ with an F1 score of 0.993 in the reconstruction task. DANCAR is also suitable for visualization in understanding the characteristics of graphs. △ Less

Submitted 4 July, 2020; originally announced July 2020.

Comments: 11 pages, 13 figures, ICML 2020

MSC Class: 68T30 ACM Class: I.2.4

arXiv:2005.12692 [pdf, other]

Cubical Ripser: Software for computing persistent homology of image and volume data

Authors: Shizuo Kaji, Takeki Sudo, Kazushi Ahara

Abstract: We introduce Cubical Ripser for computing persistent homology of image and volume data (more precisely, weighted cubical complexes). To our best knowledge, Cubical Ripser is currently the fastest and the most memory-efficient program for computing persistent homology of weighted cubical complexes. We demonstrate our software with an example of image analysis in which persistent homology and convol… ▽ More We introduce Cubical Ripser for computing persistent homology of image and volume data (more precisely, weighted cubical complexes). To our best knowledge, Cubical Ripser is currently the fastest and the most memory-efficient program for computing persistent homology of weighted cubical complexes. We demonstrate our software with an example of image analysis in which persistent homology and convolutional neural networks are successfully combined. Our open-source implementation is available online. △ Less

Submitted 12 June, 2020; v1 submitted 23 May, 2020; originally announced May 2020.

MSC Class: 55N31 (primary); 68R01 (secondary)

arXiv:1901.05773 [pdf, other]

doi 10.1002/mp.13963

Visual enhancement of Cone-beam CT by use of CycleGAN

Authors: S. Kida, S. Kaji, K. Nawa, T. Imae, T. Nakamoto, S. Ozaki, T. Ohta, Y. Nozawa, K. Nakagawa

Abstract: Cone-beam computed tomography (CBCT) offers advantages over conventional fan-beam CT in that it requires a shorter time and less exposure to obtain images. CBCT has found a wide variety of applications in patient positioning for image-guided radiation therapy, extracting radiomic information for designing patient-specific treatment, and computing fractional dose distributions for adaptive radiatio… ▽ More Cone-beam computed tomography (CBCT) offers advantages over conventional fan-beam CT in that it requires a shorter time and less exposure to obtain images. CBCT has found a wide variety of applications in patient positioning for image-guided radiation therapy, extracting radiomic information for designing patient-specific treatment, and computing fractional dose distributions for adaptive radiation therapy. However, CBCT images suffer from low soft-tissue contrast, noise, and artifacts compared to conventional fan-beam CT images. Therefore, it is essential to improve the image quality of CBCT. In this paper, we propose a synthetic approach to translate CBCT images with deep neural networks. Our method requires only unpaired and unaligned CBCT images and planning fan-beam CT (PlanCT) images for training. Once trained, 3D reconstructed CBCT images can be directly translated to high-quality PlanCT-like images. We demonstrate the effectiveness of our method with images obtained from 24 prostate patients, and we provide a statistical and visual comparison. The image quality of the translated images shows substantial improvement in voxel values, spatial uniformity, and artifact suppression compared to those of the original CBCT. The anatomical structures of the original CBCT images were also well preserved in the translated images. Our method enables more accurate adaptive radiation therapy, and opens up new applications for CBCT that hinge on high-quality images. △ Less

Submitted 25 November, 2019; v1 submitted 17 January, 2019; originally announced January 2019.

arXiv:1607.06138 [pdf, other]

doi 10.1007/978-981-13-2850-3

Dappled tiling

Authors: Shizuo Kaji, Alexandre Derouet-Jourdan, Hiroyuki Ochiai

Abstract: We consider a certain tiling problem of a planar region in which there are no long horizontal or vertical strips consisting of copies of the same tile. Intuitively speaking, we would like to create a dappled pattern with two or more kinds of tiles. We give an efficient algorithm to turn any tiling into one satisfying the condition, and discuss its applications in texturing. We consider a certain tiling problem of a planar region in which there are no long horizontal or vertical strips consisting of copies of the same tile. Intuitively speaking, we would like to create a dappled pattern with two or more kinds of tiles. We give an efficient algorithm to turn any tiling into one satisfying the condition, and discuss its applications in texturing. △ Less

Submitted 2 February, 2017; v1 submitted 20 July, 2016; originally announced July 2016.

Comments: minor errors fixed, more pictures added

MSC Class: 52C20; 68U05 ACM Class: G.2.3; I.3.3

Journal ref: Mathematical Insights into Advanced Computer Graphics Techniques, pp 59--72, Springer Singapore, 2019

arXiv:1603.04292 [pdf, other]

A linear algorithm for Brick Wang tiling

Authors: Alexandre Derouet-Jourdan, Shizuo Kaji, Yoshihiro Mizoguchi

Abstract: The Wang tiling is a classical problem in combinatorics. A major theoretical question is to find a (small) set of tiles which tiles the plane only aperiodically. In this case, resulting tilings are rather restrictive. On the other hand, Wang tiles are used as a tool to generate textures and patterns in computer graphics. In these applications, a set of tiles is normally chosen so that it tiles the… ▽ More The Wang tiling is a classical problem in combinatorics. A major theoretical question is to find a (small) set of tiles which tiles the plane only aperiodically. In this case, resulting tilings are rather restrictive. On the other hand, Wang tiles are used as a tool to generate textures and patterns in computer graphics. In these applications, a set of tiles is normally chosen so that it tiles the plane or its sub-regions easily in many different ways. With computer graphics applications in mind, we introduce a class of such tileset, which we call sequentially permissive tilesets, and consider tiling problems with constrained boundary. We apply our methodology to a special set of Wang tiles, called Brick Wang tiles, introduced by Derouet-Jourdan et al. in 2015 to model wall patterns. We generalise their result by providing a linear algorithm to decide and solve the tiling problem for arbitrary planar regions with holes. △ Less

Submitted 8 May, 2017; v1 submitted 14 March, 2016; originally announced March 2016.

MSC Class: 05B45; 52C20; 68R10 ACM Class: I.3.3; I.3.6

arXiv:1601.04816 [pdf, other]

Tetrisation of triangular meshes and its application in shape blending

Authors: Shizuo Kaji

Abstract: The As-Rigid-As-Possible (ARAP) shape deformation framework is a versatile technique for morphing, surface modelling, and mesh editing. We discuss an improvement of the ARAP framework in a few aspects: 1. Given a triangular mesh in 3D space, we introduce a method to associate a tetrahedral structure, which encodes the geometry of the original mesh. 2. We use a Lie algebra based method to interpola… ▽ More The As-Rigid-As-Possible (ARAP) shape deformation framework is a versatile technique for morphing, surface modelling, and mesh editing. We discuss an improvement of the ARAP framework in a few aspects: 1. Given a triangular mesh in 3D space, we introduce a method to associate a tetrahedral structure, which encodes the geometry of the original mesh. 2. We use a Lie algebra based method to interpolate local transformation, which provides better handling of rotation with large angle. 3. We propose a new error function to compile local transformations into a global piecewise linear map, which is rotation invariant and easy to minimise. We implemented a shape blender based on our algorithm and its MIT licensed source code is available online. △ Less

Submitted 19 January, 2016; originally announced January 2016.

ACM Class: I.3.5; I.3.7

arXiv:1601.01754 [pdf, other]

Anti-commutative Dual Complex Numbers and 2D Rigid Transformation

Authors: Genki Matsuda, Shizuo Kaji, Hiroyuki Ochiai

Abstract: We introduce a new presentation of the two dimensional rigid transformation which is more concise and efficient than the standard matrix presentation. By modifying the ordinary dual number construction for the complex numbers, we define the ring of the anti-commutative dual complex numbers, which parametrizes two dimensional rotation and translation all together. With this presentation, one can ea… ▽ More We introduce a new presentation of the two dimensional rigid transformation which is more concise and efficient than the standard matrix presentation. By modifying the ordinary dual number construction for the complex numbers, we define the ring of the anti-commutative dual complex numbers, which parametrizes two dimensional rotation and translation all together. With this presentation, one can easily interpolate or blend two or more rigid transformations at a low computational cost. We developed a library for C++ with the MIT-licensed source code and demonstrate its facility by an interactive deformation tool developed for iPad. △ Less

Submitted 7 January, 2016; originally announced January 2016.

ACM Class: I.3.5; I.3.3

arXiv:1507.05290 [pdf, other]

A concise parametrisation of affine transformation

Authors: Shizuo Kaji, Hiroyuki Ochiai

Abstract: Good parametrisations of affine transformations are essential to interpolation, deformation, and analysis of shape, motion, and animation. It has been one of the central research topics in computer graphics. However, there is no single perfect method and each one has both advantages and disadvantages. In this paper, we propose a novel parametrisation of affine transformations, which is a generalis… ▽ More Good parametrisations of affine transformations are essential to interpolation, deformation, and analysis of shape, motion, and animation. It has been one of the central research topics in computer graphics. However, there is no single perfect method and each one has both advantages and disadvantages. In this paper, we propose a novel parametrisation of affine transformations, which is a generalisation to or an improvement of existing methods. Our method adds yet another choice to the existing toolbox and shows better performance in some applications. A C++ implementation is available to make our framework ready to use in various applications. △ Less

Submitted 5 July, 2016; v1 submitted 19 July, 2015; originally announced July 2015.

Comments: errors corrected, a section on Frechet mean removed

MSC Class: 68U05; 65D18; 65F60; 15A16 ACM Class: I.3.5; I.3.7

arXiv:1506.02742 [pdf, ps, other]

Polynomial Expressions of Carries in p-ary Arithmetics

Authors: Shizuo Kaji, Toshiaki Maeno, Koji Nuida, Yasuhide Numata

Abstract: It is known that any $n$-variable function on a finite prime field of characteristic $p$ can be expressed as a polynomial over the same field with at most $p^n$ monomials. However, it is not obvious to determine the polynomial for a given concrete function. In this paper, we study the concrete polynomial expressions of the carries in addition and multiplication of $p$-ary integers. For the case of… ▽ More It is known that any $n$-variable function on a finite prime field of characteristic $p$ can be expressed as a polynomial over the same field with at most $p^n$ monomials. However, it is not obvious to determine the polynomial for a given concrete function. In this paper, we study the concrete polynomial expressions of the carries in addition and multiplication of $p$-ary integers. For the case of addition, our result gives a new family of symmetric polynomials, which generalizes the known result for the binary case $p = 2$ where the carries are given by elementary symmetric polynomials. On the other hand, for the case of multiplication of $n$ single-digit integers, we give a simple formula of the polynomial expression for the carry to the next digit using the Bernoulli numbers, and show that it has only $(n+1)(p-1)/2 + 1$ monomials, which is significantly fewer than the worst-case number $p^n$ of monomials for general functions. We also discuss applications of our results to cryptographic computation on encrypted data. △ Less

Submitted 18 February, 2016; v1 submitted 8 June, 2015; originally announced June 2015.

Comments: (v2) Improved results and new observations (v3) The authors are notified that our main theorem (Theorem 2) appears (by a different approach) in [C. Sturtivant, G. S. Frandsen: Theoretical Computer Science 112 (1993) 291-309]. The authors would like to keep this preprint online for reference purposes

MSC Class: 11T06 (primary); 05E05; 68R05; 94A60

arXiv:1206.0069 [pdf, ps, other]

doi 10.1142/S0129054115500100

A mathematical problem for security analysis of hash functions and pseudorandom generators

Authors: Koji Nuida, Takuro Abe, Shizuo Kaji, Toshiaki Maeno, Yasuhide Numata

Abstract: In this paper, we specify a class of mathematical problems, which we refer to as "Function Density Problems" (FDPs, in short), and point out novel connections of FDPs to the following two cryptographic topics; theoretical security evaluations of keyless hash functions (such as SHA-1), and constructions of provably secure pseudorandom generators (PRGs) with some enhanced security property introduce… ▽ More In this paper, we specify a class of mathematical problems, which we refer to as "Function Density Problems" (FDPs, in short), and point out novel connections of FDPs to the following two cryptographic topics; theoretical security evaluations of keyless hash functions (such as SHA-1), and constructions of provably secure pseudorandom generators (PRGs) with some enhanced security property introduced by Dubrov and Ishai [STOC 2006]. Our argument aims at proposing new theoretical frameworks for these topics (especially for the former) based on FDPs, rather than providing some concrete and practical results on the topics. We also give some examples of mathematical discussions on FDPs, which would be of independent interest from mathematical viewpoints. Finally, we discuss possible directions of future research on other cryptographic applications of FDPs and on mathematical studies on FDPs themselves. △ Less

Submitted 28 August, 2014; v1 submitted 31 May, 2012; originally announced June 2012.

Comments: 18 pages; (v2) 19 pages, to appear in International Journal of Foundations of Computer Science

MSC Class: 94A60 (Primary); 68R05; 52C99 (Secondary)

Journal ref: International Journal of Foundations of Computer Science, vol.26, no.2 (2015) 169--194

Showing 1–14 of 14 results for author: Kaji, S