Elucidating the process of SNPs identification in non-reference genome crops

J Biomol Struct Dyn. 2023;41(24):15682-15690. doi: 10.1080/07391102.2023.2194002. Epub 2023 Apr 5.

Abstract

Advances in the next generation sequencing technologies, genome reduction techniques and bioinformatics tools have given a big impetus to the identification of genome-wide single nucleotide polymorphisms (SNPs) in crops. NGS technologies can make available a large amount of sequence data in a short span of time. The huge data requires detailed bioinformatics analysis steps, including preprocessing, mapping, and identification of sequence variants. A plethora of available software meant for sequence analysis is used for different sequence analysis steps. However, SNPs identification is far more challenging for orphaned crops or non-reference genome crops. The current article reports different steps for in silico SNPs identification in a sequential manner and proposes some mapping approaches using CLC Genomics software that could provide an alternative method for SNPs identification in orphan crops having no reference genome. The three mapping approaches: Common reference map from progenitor genomes (CRMPG), step-wise use of progenitor genomes (SWPG) and de novo assembly of sequence read (DASR) were validated with the dd-RAD sequenced data of two genotypes from Brassica juncea.Communicated by Ramaswamy H. Sarma.

Keywords: Next-generation sequencing; mapping approaches; orphan crops; single nucleotide polymorphisms.

Publication types

  • Review

MeSH terms

  • Genome, Plant* / genetics
  • Genomics / methods
  • Genotype
  • High-Throughput Nucleotide Sequencing / methods
  • Polymorphism, Single Nucleotide* / genetics
  • Sequence Analysis, DNA / methods