Assessing the de novo assemblers: a metaviromic study of apple and first report of citrus concave gum-associated virus, apple rubbery wood virus 1 and 2 infecting apple in India

BMC Genomics. 2024 Nov 8;25(1):1057. doi: 10.1186/s12864-024-10968-x.

Abstract

Background: The choice of de novo assembler for high-throughput sequencing (HTS) data remains a pivotal factor in the HTS-based discovery of viral pathogens. This study assessed de novo assemblers, namely Trinity, SPAdes, and MEGAHIT for HTS datasets generated on the Illumina platform from 23 apple samples, representing 15 exotic and indigenous apple varieties and a rootstock. The assemblers were compared based on assembly quality metrics, including the largest contig, total assembly length, genome coverage, and N50.

Results: MEGAHIT was most efficient assembler according to the metrics evaluated in this study. By using multiple assemblers, near-complete genome sequences of citrus concave gum-associated virus (CCGaV), apple rubbery wood virus 1 (ARWV-1), ARWV-2, apple necrotic mosaic virus (ApNMV), apple mosaic virus, apple stem pitting virus, apple stem grooving virus, apple chlorotic leaf spot virus, apple hammerhead viroid and apple scar skin viroid were reconstructed. These viruses were further confirmed through Sanger sequencing in different apple cultivars. Among them, CCGaV, ARWV-1 and ARWV-2 were recorded from apples in India for the first time. The analysis of virus richness revealed that ApNMV was dominant, followed by ARWV-1 and CCGaV. Moreover, MEGAHIT identified novel single-nucleotide variants.

Conclusions: Our analyses highlight the crucial role of assembly methods in reconstructing near-complete apple virus genomes from the Illumina reads. This study emphasizes the significance of employing multiple assemblers for de novo virus genome assembly in vegetatively propagated perennial fruit crops.

Keywords: Apple rubbery wood virus 1; Apple virome; Citrus concave gum-associated virus; De novo assemblers; High-throughput sequencing; Single-nucleotide variants.

MeSH terms

  • Genome, Viral
  • High-Throughput Nucleotide Sequencing*
  • India
  • Malus* / virology
  • Phylogeny
  • Plant Diseases* / virology
  • Plant Viruses / genetics
  • Plant Viruses / isolation & purification