pSATdb 2.0: a database of organellar common, polymorphic, and unique microsatellites

Funct Integr Genomics. 2024 Nov 15;24(6):213. doi: 10.1007/s10142-024-01498-6.

Abstract

Microsatellites, or simple sequence repeats (SSRs), are repetitive DNA sequences typically composed of 1-6 nucleotides. These repetitive sequences are found in almost all genomes, including chloroplasts and mitochondria, and are widely distributed throughout the genomes. Microsatellites are highly polymorphic, and their length may differ from species to species. Consequently, microsatellites are widely used as molecular markers and play pivotal roles in various biological research. However, comprehensive information about the length variation of microsatellites in various organellar genome sequences is not available. Therefore, to provide mined information and explore the variability in the length of microsatellites across species, we developed a comprehensive resource named pSATdb 2.0 (polymorphic microSATellites database; https://bioinfo.icgeb.res.in/psatdb/ ). This upgraded version of its predecessor pSATdb provides comprehensive information on the frequency and distribution of 348,894 microsatellites identified in organellar genome sequences. These sequences originate from 15,681 organisms spanning 3252 genera within Metazoa and Viridiplantae. Remarkably, pSATdb 2.0 is the only database that offers information on common and polymorphic microsatellites detected between organisms, along with unique microsatellites specific to each genus. Furthermore, this database features unrestricted access and includes pioneer functionalities such as Advanced Search, BLAST, and JBrowse, which facilitate user-specific microsatellite search and its visualization within the database. The pSATdb holds immense potential for the research community to support diverse studies, including genetic diversity, genetic mapping, marker-assisted selection, and comparative population investigations.

Keywords: Chloroplast; Database; Microsatellites; Mitochondria.