Uncovering patterns of the evolution of genomic sequence entropy and complexity

Mol Genet Genomics. 2021 Mar;296(2):289-298. doi: 10.1007/s00438-020-01729-y. Epub 2020 Oct 21.

Abstract

The lack of consensus concerning the biological meaning of entropy and complexity of genomes and the different ways to assess these data hamper conclusions concerning what are the causes of genomic entropy variation among species. This study aims to evaluate the entropy and complexity of genomic sequences of several species without using homologies to assess relationships among these variables and non-molecular data (e.g., the number of individuals) to seek a trigger of interspecific genomic entropy variation. The results indicate a relationship among genomic entropy, genome size, genomic complexity, and the number of individuals: species with a small number of individuals harbors large genome, and hence, low entropy but a higher complexity. We defined that the complexity of a genome relies on the entropy of each DNA segment within genome. Then, the entropy and complexity of a genome reflects its organization solely. Exons of vertebrates harbor smaller entropies than non-exon regions (likely by the repeats that accumulated from duplications), whereas other taxonomic groups do not present this pattern. Our findings suggest that small initial population might have defined current genomic entropy and complexity: actual genomes are less complex than ancestral ones. Besides, our data disagree with the relationship between phenotype and genomic entropies previously established. Finally, by establishing the relationship between genomic entropy/complexity with the number of individuals and genome size, under an evolutive perspective, ideas concerning the genomic variability may emerge.

Keywords: Biological complexity; Comparative genomics; Genomic complexity; Genomic evolution; Shannon entropy of genomes.

MeSH terms

  • Animals
  • Entropy
  • Evolution, Molecular
  • Genetic Variation*
  • Genome
  • Humans
  • Models, Genetic
  • Sequence Analysis, DNA / methods*
  • Vertebrates / growth & development*