Noise reduction and mask removal neural network for X-ray single-particle imaging

J Appl Crystallogr. 2022 Feb 1;55(Pt 1):122-132. doi: 10.1107/S1600576721012371.

Abstract

Free-electron lasers could enable X-ray imaging of single biological macromolecules and the study of protein dynamics, paving the way for a powerful new imaging tool in structural biology, but a low signal-to-noise ratio and missing regions in the detectors, colloquially termed 'masks', affect data collection and hamper real-time evaluation of experimental data. In this article, the challenges posed by noise and masks are tackled by introducing a neural network pipeline that aims to restore diffraction intensities. For training and testing of the model, a data set of diffraction patterns was simulated from 10 900 different proteins with molecular weights within the range of 10-100 kDa and collected at a photon energy of 8 keV. The method is compared with a simple low-pass filtering algorithm based on autocorrelation constraints. The results show an improvement in the mean-squared error of roughly two orders of magnitude in the presence of masks compared with the noisy data. The algorithm was also tested at increasing mask width, leading to the conclusion that demasking can achieve good results when the mask is smaller than half of the central speckle of the pattern. The results highlight the competitiveness of this model for data processing and the feasibility of restoring diffraction intensities from unknown structures in real time using deep learning methods. Finally, an example is shown of this preprocessing making orientation recovery more reliable, especially for data sets containing very few patterns, using the expansion-maximization-compression algorithm.

Keywords: XFELs; coherent X-ray diffractive imaging (CXDI); diffract-then-destroy; free-electron lasers; imaging; protein structures; single particles.

Grants and funding

This work was funded by Stiftelsen för Strategisk Forskning grant ITM17-0455; Vetenskapsrådet grants 2017-05336 and 2018-00234.