Identification of stress-related genes by co-expression network analysis based on the improved turbot genome

Sci Data. 2022 Jun 29;9(1):374. doi: 10.1038/s41597-022-01458-4.

Abstract

Turbot (Scophthalmus maximus), commercially important flatfish species, is widely cultivated in Europe and China. With the continuous expansion of the intensive breeding scale, turbot is exposed to various stresses, which greatly impedes the healthy development of turbot industry. Here, we present an improved high-quality chromosome-scale genome assembly of turbot using a combination of PacBio long-read and Illumina short-read sequencing technologies. The genome assembly spans 538.22 Mb comprising 27 contigs with a contig N50 size of 25.76 Mb. Annotation of the genome assembly identified 104.45 Mb repetitive sequences, 22,442 protein-coding genes and 3,345 ncRNAs. Moreover, a total of 345 stress responsive candidate genes were identified by gene co-expression network analysis based on 14 published stress-related RNA-seq datasets consisting of 165 samples. Significantly improved genome assembly and stress-related candidate gene pool will provide valuable resources for further research on turbot functional genome and stress response mechanism, as well as theoretical support for the development of molecular breeding technology for resistant turbot varieties.

Publication types

  • Dataset
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Flatfishes* / genetics
  • Gene Expression Regulation
  • Genome
  • Genomics
  • High-Throughput Nucleotide Sequencing
  • Molecular Sequence Annotation
  • Sequence Analysis, DNA
  • Stress, Physiological* / genetics