Genome report: First whole genome sequence of Triatoma sanguisuga (Le Conte, 1855), vector of Chagas disease

bioRxiv [Preprint]. 2024 Oct 8:2024.10.07.611472. doi: 10.1101/2024.10.07.611472.

Abstract

Triatoma sanguisuga is the most widespread triatomine bug species in the United States (US). The species vectors the human parasite Trypanosoma cruzi , which causes Chagas disease. Vector-borne Chagas disease is rarely diagnosed in the US, but T. sanguisuga has been implicated in a handful of cases. Despite its public health importance, little is known about the genomics or population genetics of T. sanguisuga . Here, we used long-read sequencing to assemble the first whole genome sequence for T. sanguisuga using DNA extracted from one adult specimen from Delaware. The final size of the genome was 1.162 Gbp with 77.7x coverage. The assembly consisted of 183 contigs with an N50 size of 94.97 Kb. The Benchmarking Universal Single-Copy Ortholog (BUSCO) complete score was 99.1%, suggesting a very complete assembly. Genome-wide GC level was 33.56%, and DNA methylation was 18.84%. The genome consists of 61.4% repetitive DNA and 17,799 predicted coding genes. The assembled T. sanguisuga genome was slightly larger than that of Triatominae species Triatoma infestans and Rhodnius prolixus (949 Mbp with 90.4% BUSCO score and 706 Mbp with 96.5% BUSCO score, respectively). The T. sanguisuga genome is the first North American triatomine species genome to be sequenced, and it is the most complete genome yet for any Triatominae species. The T. sangsuiga genome will allow for deeper investigations into epidemiologically relevant aspects of this important vector species, including blood feeding, host seeking, and parasite competence, thus providing potential vector-borne disease management targets and strengthening public health preparedness.

Publication types

  • Preprint