Reflectance spectroscopy and machine learning as a tool for the categorization of twin species based on the example of the Diachrysia genus

Spectrochim Acta A Mol Biomol Spectrosc. 2022 May 15:273:121058. doi: 10.1016/j.saa.2022.121058. Epub 2022 Feb 17.

Abstract

In our work we used noninvasive point reflectance spectroscopy in the range from 400 to 2100 nm coupled with machine learning to study scales on the brown and golden iridescent areas on the dorsal side of the forewing of Diachrysia chrysitis and D. stenochrysis. We used our approach to distinguish between these species of moths. The basis for the study was a statistically significant collection of 95 specimens identified based on morphological feature and gathered during 23 years in Poland. The numerical part of an experiment included two independent discriminant analyses: stochastic and deterministic. The more sensitive stochastic approach achieved average compliance with the species identification made by entomologists at the level of 99-100%. It demonstrated high stability against the different configurations of training and validation sets, hence strong predictors of Diachrysia siblings distinctiveness. Both methods resulted in the same small set of relevant features, where minimal fully discriminating subsets of wavelengths were three for glass scales on the golden area and four for the brown. The differences between species in scales primarily concern their major components and ultrastructure. In melanin-absent glass scales, this is mainly chitin configuration, while in melanin-present brown scales, melanin reveals as an additional factor.

Keywords: Chemometry; LDA; Lepidoptera; Noctuidae; Random Forest.

MeSH terms

  • Animals
  • Machine Learning*
  • Moths*
  • Spectrum Analysis