Triple coding in human SRD5A1 mRNA

Res Sq [Preprint]. 2024 Dec 19:rs.3.rs-5390104. doi: 10.21203/rs.3.rs-5390104/v1.

Abstract

Background: Nucleotide sequence can be translated in three reading frames from 5' to 3' producing distinct protein products. Many examples of RNA translation in two reading frames (dual coding) have been identified so far. Results: We report simultaneous translation of mRNA transcripts derived from SRD5A1 locus in all three reading frames that result in the synthesis of long proteins. This occurs due to initiation at three nearby AUG codons occurring in all three-reading frame. Only one of the three proteoforms contains the conserved catalytical domain of SDRD5A1 produced either from the second or the third AUG codon depending on the transcript. Paradoxically, ribosome profiling data and expression reporters indicate that the most efficient translation produces catalytically inactive proteoforms. While phylogenetic analysis suggests that the long triple decoding region is specific to primates, occurrence of nearby AUGs in all three reading frames is ancestral to placental mammals. This suggests that their evolutionary significance belongs to regulation of translation rather than biological role of their products. By analysing multiple publicly available ribosome profiling data and with gene expression assays carried out in different cellular environments, we show that relative expression of these proteoforms is mutually dependent and vary across environments supporting this conjecture. A remarkable feature of triple decoding is its resistance to indel mutations with apparent implications to clinical interpretation of genomic variants. Conclusion: We argue for the importance of identification, characterisation and annotation of productive RNA translation irrespective of the presumed biological roles of the products of this translation.

Publication types

  • Preprint