Genome sequencing of Colletotrichum gloeosporioides ES026 reveals plausible pathway of HupA

Mol Biol Rep. 2022 Dec;49(12):11611-11622. doi: 10.1007/s11033-022-07850-y. Epub 2022 Sep 26.

Abstract

Background: Colletotrichum gloeosporioides ES026, isolated as an endophytic fungal strain, was found to produce the important medicinal compound HuperzineA (HupA). In a genetic context, ES026 showed potential in elucidating the biosynthetic pathway of HupA.

Methods and results: The ES026 strain was sequenced using de-novo Illumina sequencing methods in this study. Assembling the cleaned data resulted in 58,594,804bp, consisting of 404 scaffolds. The G + C mol % content of this genome was 52.53%. The genome progressive-alignment with other 4 Colletotrichum strains revealed that ES026 showed closer relation with 030206, SMCG1#C and Nara gc5. More than 60 putative biosynthetic clusters were predicted with the fungal version antiSMASH4.0 program. More than 33 types I polyketide-related biosynthetic gene clusters were distributed, containing PKS and PKS-NRPS (polyketide-nonribosomal peptides) hybrid gene clusters. Another 8 NRPS biosynthetic gene clusters were distributed among the genome of ES026. The prenyltransferases, probably involved in aromatic prenyl-compounds and terpenoid biosynthesis, were analyzed using bioinformatics tools like MEGA.

Conclusion: We predicted a new possible biosynthetic pathway for the HupA from the pipecolic acid, based on the published HupA biosynthesis proposed pathway, the biosynthesis and pipecolic acid-derived compounds. We hypothesize that a hybrid PKS-NRPS mega-enzyme was probably involved in the biosynthesis of HupA with the pipecolic acid, the building block of rapamycin, as a HupA precursor. The rapamycin is produced from a polyketide biosynthesis pathway, and the domain incorporating the pipecolic acid is studied.

Keywords: De-novo Illumina sequencing; Endophytic fungi; HupA biosynthetic pathway; Medicinal plants; Polyketides; Secondary metabolites diversity.

MeSH terms

  • Base Sequence
  • Colletotrichum* / genetics
  • Multigene Family
  • Polyketides* / metabolism
  • Sirolimus

Substances

  • Polyketides
  • Sirolimus

Supplementary concepts

  • Colletotrichum gloeosporioides