Prioritizing cases from a multi-institutional cohort for a dataset of pathologist annotations

Victor Garcia; Emma Gardecki; Stephanie Jou; Xiaoxian Li; Kenneth R Shroyer; Joel Saltz; Balazs Acs; Katherine Elfer; Jochen Lennerz; Roberto Salgado; Brandon D Gallas

doi:10.1016/j.jpi.2024.100411

Prioritizing cases from a multi-institutional cohort for a dataset of pathologist annotations

J Pathol Inform. 2024 Nov 16:16:100411. doi: 10.1016/j.jpi.2024.100411. eCollection 2025 Jan.

Authors

Victor Garcia¹, Emma Gardecki¹, Stephanie Jou², Xiaoxian Li², Kenneth R Shroyer³, Joel Saltz³, Balazs Acs^{4

5}, Katherine Elfer^{1

6}, Jochen Lennerz⁷, Roberto Salgado^{8

9}, Brandon D Gallas¹

Affiliations

¹ U.S. Food and Drug Administration, Center for Devices and Radiological Health, Office of Science and Engineering Laboratories, Division of Imaging, Diagnostics, and Software Reliability, Silver Spring, MD, United States of America.
² Department of Pathology and Laboratory Medicine, Emory University, Atlanta, GA, United States of America.
³ Department of Pathology, Renaissance School of Medicine, Stony Brook University, Stony Brook, NY, United States of America.
⁴ Department of Oncology and Pathology, Cancer Centre Karolinska (CCK), Karolinska Institutet, Stockholm, Sweden.
⁵ Department of Clinical Pathology and Cancer Diagnostics, Karolinska University Hospital, Stockholm, Sweden.
⁶ Division of Cancer Prevention, National Cancer Institute, National Institute of Health, Shady Grove, MD, United States of America.
⁷ BostonGene, Waltham, MA, USA.
⁸ Division of Research, Peter Mac Callum Cancer Centre, Melbourne, Australia.
⁹ Department of Pathology, ZAS Hospitals, Antwerp, Belgium.

Abstract

Objective: With the increasing energy surrounding the development of artificial intelligence and machine learning (AI/ML) models, the use of the same external validation dataset by various developers allows for a direct comparison of model performance. Through our High Throughput Truthing project, we are creating a validation dataset for AI/ML models trained in the assessment of stromal tumor-infiltrating lymphocytes (sTILs) in triple negative breast cancer (TNBC).

Materials and methods: We obtained clinical metadata for hematoxylin and eosin-stained glass slides and corresponding scanned whole slide images (WSIs) of TNBC core biopsies from two US academic medical centers. We selected regions of interest (ROIs) from the WSIs to target regions with various tissue morphologies and sTILs densities. Given the selected ROIs, we implemented a hierarchical rank-sort method for case prioritization.

Results: We received 122 glass slides and clinical metadata on 105 unique patients with TNBC. All received cases were female, and the mean age was 63.44 years. 60% of all cases were White patients, and 38.1% were Black or African American. After case prioritization, the skewness of the sTILs density distribution improved from 0.60 to 0.46 with a corresponding increase in the entropy of the sTILs density bins from 1.20 to 1.24. We retained cases with less prevalent metadata elements.

Conclusion: This method allows us to prioritize underrepresented subgroups based on important clinical factors. In this manuscript, we discuss how we sourced the clinical metadata, selected ROIs, and developed our approach to prioritizing cases for inclusion in our pivotal study.

Keywords: Data; Prioritization; Sampling; Validation.