Annotated genome and transcriptome of the endangered Caribbean mountainous star coral (Orbicella faveolata) using PacBio long-read sequencing

BMC Genomics. 2024 Feb 29;25(1):226. doi: 10.1186/s12864-024-10092-w.

Abstract

Long-read sequencing is revolutionizing de-novo genome assemblies, with continued advancements making it more readily available for previously understudied, non-model organisms. Stony corals are one such example, with long-read de-novo genome assemblies now starting to be publicly available, opening the door for a wide array of 'omics-based research. Here we present a new de-novo genome assembly for the endangered Caribbean star coral, Orbicella faveolata, using PacBio circular consensus reads. Our genome assembly improved the contiguity (51 versus 1,933 contigs) and complete and single copy BUSCO orthologs (93.6% versus 85.3%, database metazoa_odb10), compared to the currently available reference genome generated using short-read methodologies. Our new de-novo assembled genome also showed comparable quality metrics to other coral long-read genomes. Telomeric repeat analysis identified putative chromosomes in our scaffolded assembly, with these repeats at either one, or both ends, of scaffolded contigs. We identified 32,172 protein coding genes in our assembly through use of long-read RNA sequencing (ISO-seq) of additional O. faveolata fragments exposed to a range of abiotic and biotic treatments, and publicly available short-read RNA-seq data. With anthropogenic influences heavily affecting O. faveolata, as well as its increasing incorporation into reef restoration activities, this updated genome resource can be used for population genomics and other 'omics analyses to aid in the conservation of this species.

Keywords: Orbicella faveolata; De-novo genome assembly; HiFi reads; ISO-seq; Long-read sequencing; PacBio circular consensus sequencing; Stony coral Scleractinia.

MeSH terms

  • Animals
  • Anthozoa* / genetics
  • Caribbean Region
  • Genome
  • High-Throughput Nucleotide Sequencing / methods
  • Sequence Analysis, DNA / methods
  • Transcriptome*