Background. Studies of ancestry are difficult in the tomato because it crosses with many wild relatives and species in the tomato clade that have diverged very recently. As a result, the phylogeny in relation to its closest relatives remains uncertain. By using the coding sequence from Solanum lycopersicum, S. galapagense, S. pimpinellifolium, S. corneliomuelleri, and S. tuberosum and the genomic sequence from S. lycopersicum 'Heinz', an heirloom line, S. lycopersicum 'Yellow Pear', and two of cultivated tomato's closest relatives, S. galapagense and S. pimpinellifolium, we have aimed to resolve the phylogenies of these closely related species as well as identify phylogenetic discordance in the reference cultivated tomato. Results. Divergence date estimates suggest that the divergence of S. lycopersicum, S. galapagense, and S. pimpinellifolium happened less than 0.5 MYA. Phylogenies based on 8,857 coding sequences support grouping of S. lycopersicum and S. galapagense, although two secondary trees are also highly represented. A total of 25 genes in our analysis had sites with evidence of positive selection along the S. lycopersicum lineage. Whole genome phylogenies showed that while incongruence is prevalent in genomic comparisons between these genotypes, likely as a result of introgression and incomplete lineage sorting, a primary phylogenetic history was strongly supported. Conclusions. Based on analysis of these genotypes, S. galapagense appears to be closely related to S. lycopersicum, suggesting they had a common ancestor prior to the arrival of an S. galapagense ancestor to the Galápagos Islands, but after divergence of the sequenced S. pimpinellifolium. Genes showing selection along the S. lycopersicum lineage may be important in domestication or selection occurring post-domestication. Further analysis of intraspecific data in these species will help to establish the evolutionary history of cultivated tomato. The use of an heirloom line is helpful in deducing true phylogenetic information of S. lycopersicum and identifying regions of introgression from wild species.
Keywords: Genome; Incomplete lineage sorting; Introgression; Phylogeny; Selection; Solanum; Tomato.