Exploring the genome and transcriptome of the cave nectar bat Eonycteris spelaea with PacBio long-read sequencing

Gigascience. 2018 Oct 1;7(10):giy116. doi: 10.1093/gigascience/giy116.

Abstract

Background: In the past two decades, bats have emerged as an important model system to study host-pathogen interactions. More recently, it has been shown that bats may also serve as a new and excellent model to study aging, inflammation, and cancer, among other important biological processes. The cave nectar bat or lesser dawn bat (Eonycteris spelaea) is known to be a reservoir for several viruses and intracellular bacteria. It is widely distributed throughout the tropics and subtropics from India to Southeast Asia and pollinates several plant species, including the culturally and economically important durian in the region. Here, we report the whole-genome and transcriptome sequencing, followed by subsequent de novo assembly, of the E. spelaea genome solely using the Pacific Biosciences (PacBio) long-read sequencing platform.

Findings: The newly assembled E. spelaea genome is 1.97 Gb in length and consists of 4,470 sequences with a contig N50 of 8.0 Mb. Identified repeat elements covered 34.65% of the genome, and 20,640 unique protein-coding genes with 39,526 transcripts were annotated.

Conclusions: We demonstrated that the PacBio long-read sequencing platform alone is sufficient to generate a comprehensive de novo assembled genome and transcriptome of an important bat species. These results will provide useful insights and act as a resource to expand our understanding of bat evolution, ecology, physiology, immunology, viral infection, and transmission dynamics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing
  • Animals
  • Chiroptera / classification
  • Chiroptera / genetics*
  • Computational Biology / methods
  • Evolution, Molecular
  • Female
  • Genome*
  • Genomics* / methods
  • High-Throughput Nucleotide Sequencing
  • Molecular Sequence Annotation
  • Phylogeny
  • Transcriptome*