The successes and pitfalls: Deep-learning effectiveness in a Chernobyl field camera trap application

Ecol Evol. 2023 Sep 5;13(9):e10454. doi: 10.1002/ece3.10454. eCollection 2023 Sep.

Abstract

Camera traps have become in situ sensors for collecting information on animal abundance and occupancy estimates. When deployed over a large landscape, camera traps have become ideal for measuring the health of ecosystems, particularly in unstable habitats where it can be dangerous or even impossible to observe using conventional methods. However, manual processing of imagery is extremely time and labor intensive. Because of the associated expense, many studies have started to employ machine-learning tools, such as convolutional neural networks (CNNs). One drawback for the majority of networks is that a large number of images (millions) are necessary to devise an effective identification or classification model. This study examines specific factors pertinent to camera trap placement in the field that may influence the accuracy metrics of a deep-learning model that has been trained with a small set of images. False negatives and false positives may occur due to a variety of environmental factors that make it difficult for even a human observer to classify, including local weather patterns and daylight. We transfer-trained a CNN to detect 16 different object classes (14 animal species, humans, and fires) across 9576 images taken from camera traps placed in the Chernobyl Exclusion Zone. After analyzing wind speed, cloud cover, temperature, image contrast, and precipitation, there was not a significant correlation between CNN success and ambient conditions. However, a possible positive relationship between temperature and CNN success was noted. Furthermore, we found that the model was more successful when images were taken during the day as well as when precipitation was not present. This study suggests that while qualitative site-specific factors may confuse quantitative classification algorithms such as CNNs, training with a dynamic training set can account for ambient conditions so that they do not have a significant impact on CNN success.

Keywords: Chernobyl; animal abundance and occupancy; artificial intelligence; camera trap imagery; convolutional neural network; machine learning.