NuFold: end-to-end approach for RNA tertiary structure prediction with flexible nucleobase center representation

Nat Commun. 2025 Jan 21;16(1):881. doi: 10.1038/s41467-025-56261-7.

Abstract

RNA plays a crucial role not only in information transfer as messenger RNA during gene expression but also in various biological functions as non-coding RNAs. Understanding mechanical mechanisms of function needs tertiary structure information; however, experimental determination of three-dimensional RNA structures is costly and time-consuming, leading to a substantial gap between RNA sequence and structural data. To address this challenge, we developed NuFold, a novel computational approach that leverages state-of-the-art deep learning architecture to accurately predict RNA tertiary structures. NuFold is a deep neural network trained end-to-end for the output structure from the input sequence. NuFold incorporates a nucleobase center representation, which enables flexible conformation of ribose rings. Benchmark study showed that NuFold clearly outperformed energy-based methods and demonstrated comparable results with existing state-of-the-art deep-learning-based methods. NuFold exhibited a particular advantage in building correct local geometries of RNA. Analyses of individual components in the NuFold pipeline indicated that the performance improved by utilizing metagenome sequences for multiple sequence alignment and increasing the number of recycling. NuFold is also capable of predicting multimer complex structures of RNA by linking the input sequences.

MeSH terms

  • Computational Biology / methods
  • Deep Learning
  • Neural Networks, Computer
  • Nucleic Acid Conformation*
  • RNA* / chemistry
  • RNA* / genetics
  • Sequence Alignment / methods
  • Software

Substances

  • RNA