Deep learning has previously shown success in automatically generating morphological traits which carry a phylogenetic signal. In this paper we explore combining molecular data with deep learning derived morphological traits from images of pinned insects to generate total-evidence phylogenies and we reveal challenges. Deep learning derived morphological traits, while informative, underperform when used in isolation compared to molecular analyses. However, they can improve molecular results in total evidence settings. We use a dataset of rove beetle images to compare the effect of different dataset splits and deep metric loss functions on morphological and total evidence results. We find a slight preference for the cladistic dataset split and contrastive loss function. Additionally, we explore the effect of varying the number of genes used in inference and find that different gene combinations provide the best results when used on their own vs in total evidence analysis. Despite the promising nature of integrating deep learning techniques with molecular data, challenges remain regarding the strength of the phylogenetic signal and the resource demands of data acquisition. We suggest that future work focus on improved trait extraction and the development of disentangled networks to better interpret the derived traits, thus expanding the applicability of these methods in phylogenetic studies.
Keywords: cladistics; continuous morphological traits; deep metric learning; image analysis; neural networks; phylogenetics; total evidence analysis.
© The Author(s) 2025. Published by Oxford University Press on behalf of the Society of Systematic Biologists.