A high-quality chromosome-level genome of wild Rosa rugosa

Fengqi Zang; Yan Ma; Xiaolong Tu; Ping Huang; Qichao Wu; Zhimin Li; Tao Liu; Furong Lin; Surui Pei; Dekui Zang; Xuemei Zhang; Yongqi Zheng; Yunyan Yu

doi:10.1093/dnares/dsab017

A high-quality chromosome-level genome of wild Rosa rugosa

DNA Res. 2021 Sep 13;28(5):dsab017. doi: 10.1093/dnares/dsab017.

Authors

Fengqi Zang¹, Yan Ma², Xiaolong Tu³, Ping Huang¹, Qichao Wu², Zhimin Li³, Tao Liu³, Furong Lin¹, Surui Pei³, Dekui Zang², Xuemei Zhang³, Yongqi Zheng¹, Yunyan Yu²

Affiliations

¹ State Key Laboratory of Tree Genetics and Breeding; Key Laboratory of Forest Silviculture and Tree Cultivation, National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing 100091, P.R. China.
² Key Laboratory of State Forestry Administration for Silviculture of the Lower Yellow River, College of Forestry, Shandong Agricultural University, Tai'an 271018, P. R. China.
³ Annoroad Gene Technology (Beijing) Co., Ltd, Beijing 100176, P. R. China.

Abstract

Rosa rugosa is an important shrub with economic, ecological, and pharmaceutical value. A high-quality chromosome-scale genome for R. rugosa sequences was assembled using PacBio and Hi-C technologies. The final assembly genome sequences size was about 407.1 Mb, the contig N50 size was 2.85 Mb, and the scaffold N50 size was 56.6 Mb. More than 98% of the assembled genome sequences were anchored to seven pseudochromosomes (402.9 Mb). The genome contained 37,512 protein-coding genes, with 37,016 genes (98.68%) that were functionally annotated, and 206.67 Mb (50.76%) of the assembled sequences are repetitive sequences. Phylogenetic analyses indicated that R. rugosa diverged from Rosa chinensis ∼6.6 million years ago, and no lineage-specific whole-genome duplication event occurred after divergence from R. chinensis. Chromosome synteny analysis demonstrated highly conserved synteny between R. rugosa and R. chinensis, between R. rugosa and Prunus persica as well. Comparative genome and transcriptome analysis revealed genes related to colour, scent, and environment adaptation. The chromosome-level reference genome provides important genomic resources for molecular-assisted breeding and horticultural comparative genomics research.

Keywords: Rosa rugosa; Hi-C assembly; chromosome synteny; genome annotation; genome sequencing.

MeSH terms

Chromosomes
Genome
Genomics
Phylogeny
Rosa* / genetics