A Standardized Pipeline for Assembly and Annotation of African Swine Fever Virus Genome

Edward Spinard; Mark Dinhobl; Cassidy N G Erdelyan; James O'Dwyer; Jacob Fenster; Hillary Birtley; Nicolas Tesler; Sten Calvelage; Mikael Leijon; Lucilla Steinaa; Vivian O'Donnell; Sandra Blome; Armanda Bastos; Elizabeth Ramirez-Medina; Anna Lacasta; Karl Ståhl; Huaji Qiu; Dachrit Nilubol; Chandana Tennakoon; Charles Maesembe; Bonto Faburay; Aruna Ambagala; David Williams; Paolo Ribeca; Manuel V Borca; Douglas P Gladue

doi:10.3390/v16081293

A Standardized Pipeline for Assembly and Annotation of African Swine Fever Virus Genome

Viruses. 2024 Aug 13;16(8):1293. doi: 10.3390/v16081293.

Authors

Edward Spinard^{1

2}, Mark Dinhobl^{1

2}, Cassidy N G Erdelyan³, James O'Dwyer⁴, Jacob Fenster⁵, Hillary Birtley⁵, Nicolas Tesler⁵, Sten Calvelage⁶, Mikael Leijon⁷, Lucilla Steinaa⁸, Vivian O'Donnell⁹, Sandra Blome⁶, Armanda Bastos¹⁰, Elizabeth Ramirez-Medina^{1

2}, Anna Lacasta⁸, Karl Ståhl¹¹, Huaji Qiu¹², Dachrit Nilubol¹³, Chandana Tennakoon¹⁴, Charles Maesembe¹⁵, Bonto Faburay⁹, Aruna Ambagala⁴, David Williams³, Paolo Ribeca^{16

17}, Manuel V Borca^{1

2}, Douglas P Gladue^{1

2}

Affiliations

¹ U.S. Department of Agriculture, Agricultural Research Service, Foreign Animal Disease Research Unit, Plum Island Animal Disease Center (PIADC), P.O. Box 848, Greenport, NY 11944, USA.
² U.S. Department of Agriculture, Agricultural Research Service, Foreign Animal Disease Research Unit, National Bio and Agro-Defense Facility, Manhattan, KS 66502, USA.
³ CSIRO, Australian Centre for Disease Preparedness, Geelong, VIC 3220, Australia.
⁴ National Centre for Foreign Animal Disease, Canadian Food Inspection Agency, Winnipeg, MB R3E 3M4, Canada.
⁵ Oak Ridge Institute for Science and Education (ORISE), Oak Ridge, TN 37830, USA.
⁶ Friedrich-Loeffler-Institut, Federal Research Institute for Animal Health, Südufer 10, 17493 Greifswald-Insel Riems, Germany.
⁷ Department of Microbiology, Swedish Veterinary Agency, SE-751 89 Uppsala, Sweden.
⁸ Animal and Human Heath Program, International Livestock Research Institute, Nairobi 00100, Kenya.
⁹ U.S. Department of Agriculture, Animal and Plant Inspection Service, Plum Island Animal Disease Center, Greenport, NY 11944, USA.
¹⁰ Department of Veterinary Tropical Diseases, Faculty of Veterinary Science, University of Pretoria, Onderstepoort 0110, South Africa.
¹¹ Department of Epidemiology, Surveillance and Risk assessment, Swedish Veterinary Agency, SE-751 89 Uppsala, Sweden.
¹² State Key Laboratory for Animal Disease Control and Prevention, National African Swine Fever Para-Reference Laboratory, National High Containment Facilities for Animal Diseases Control and Prevention, Harbin Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Harbin 100081, China.
¹³ Swine Viral Evolution and Vaccine Development Research Unit, Department of Veterinary Microbiology, Faculty of Veterinary Science, Chulalongkorn University, Henry Dunant Road, Pathumwan, Bangkok 10330, Thailand.
¹⁴ The Pirbright Institute, Ash Road, Pirbright, Woking GU24 0NF, UK.
¹⁵ Department of Zoology, Entomology and Fisheries Sciences, School of Biosciences, College of Natural Sciences, Makerere University, Kampala P.O. Box 7062, Uganda.
¹⁶ UK Health Security Agency, London E14 4PU, UK.
¹⁷ Biomathematics and Statistics Scotland, Edinburgh EH9 3FD, UK.

Abstract

Obtaining a complete good-quality sequence and annotation for the long double-stranded DNA genome of the African swine fever virus (ASFV) from next-generation sequencing (NGS) technology has proven difficult, despite the increasing availability of reference genome sequences and the increasing affordability of NGS. A gap analysis conducted by the global African swine fever research alliance (GARA) partners identified that a standardized, automatic pipeline for NGS analysis was urgently needed, particularly for new outbreak strains. Whilst there are several diagnostic and research labs worldwide that collect isolates of the ASFV from outbreaks, many do not have the capability to analyze, annotate, and format NGS data from outbreaks for submission to NCBI, and some publicly available ASFV genomes have missing or incorrect annotations. We developed an automated, standardized pipeline for the analysis of NGS reads that directly provides users with assemblies and annotations formatted for their submission to NCBI. This pipeline is freely available on GitHub and has been tested through the GARA partners by examining two previously sequenced ASFV genomes; this study also aimed to assess the accuracy and limitations of two strategies present within the pipeline: reference-based (Illumina reads) and de novo assembly (Illumina and Nanopore reads) strategies.

Keywords: ASF; ASFV; African swine fever; African swine fever virus; next-generation sequencing; pipeline.

MeSH terms

African Swine Fever Virus* / classification
African Swine Fever Virus* / genetics
African Swine Fever Virus* / isolation & purification
African Swine Fever* / virology
Animals
Computational Biology / methods
Genome, Viral*
High-Throughput Nucleotide Sequencing* / methods
Molecular Sequence Annotation*
Sequence Analysis, DNA / methods
Swine

Abstract

MeSH terms

Grants and funding