Experts fail to reliably detect AI-generated histological data

Sci Rep. 2024 Nov 19;14(1):28677. doi: 10.1038/s41598-024-73913-8.

Abstract

AI-based methods to generate images have seen unprecedented advances in recent years challenging both image forensic and human perceptual capabilities. Accordingly, these methods are expected to play an increasingly important role in the fraudulent fabrication of data. This includes images with complicated intrinsic structures such as histological tissue samples, which are harder to forge manually. Here, we use stable diffusion, one of the most recent generative algorithms, to create such a set of artificial histological samples. In a large study with over 800 participants, we study the ability of human subjects to discriminate between these artificial and genuine histological images. Although they perform better than naive participants, we find that even experts fail to reliably identify fabricated data. While participant performance depends on the amount of training data used, even low quantities are sufficient to create convincing images, necessitating methods and policies to detect fabricated data in scientific publications.

Keywords: Artificial intelligence; Fraud; Histology; Misconduct; Publishing; Stable diffusion.

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Humans
  • Image Processing, Computer-Assisted* / methods