Caused by both eukaryotic dinoflagellates and prokaryotic cyanobacteria, harmful algal blooms are events of severe ecological, economic, and public health consequence, and their incidence has become more common of late. Despite coordinated research efforts to identify and characterize the genomes of harmful algal bloom-causing organisms, the genomic basis and evolutionary origins of paralytic shellfish toxins produced by harmful algal blooms remain at best incomplete. The paralytic shellfish toxin saxitoxin has an especially complex genomic architecture and enigmatic phylogenetic distribution, spanning dinoflagellates and multiple cyanobacterial genera. Using filtration and extraction techniques to target the desired cyanobacteria from nonaxenic culture, coupled with a combination of short- and long-read sequencing, we generated a reference-quality hybrid genome assembly for Heteroscytonema crispum UTEX LB 1556, a freshwater, paralytic shellfish toxin-producing cyanobacterium thought to have the largest known genome in its phylum. We report a complete, novel biosynthetic gene cluster for the paralytic shellfish toxin saxitoxin. Leveraging this biosynthetic gene cluster, we find support for the hypothesis that paralytic shellfish toxin production has appeared in divergent Cyanobacteria lineages through widespread and repeated horizontal gene transfer. This work demonstrates the utility of long-read sequencing and metagenomic assembly toward advancing our understanding of paralytic shellfish toxin biosynthetic gene cluster diversity and suggests a mechanism for the origin of paralytic shellfish toxin biosynthetic genes.
Keywords: biosynthetic gene cluster; cyanobacteria; horizontal gene transfer; saxitoxin; selection.
Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution 2024.