Background: Gene expression studies have identified molecular subtypes of breast cancer with implications to chemotherapy recommendations. For distinction of these types, a combination of immunohistochemistry (IHC) markers, including proliferative activity of tumor cells, estimated by Ki67 labeling index is used. Clinical studies are frequently based on IHC performed on tissue microarrays (TMA) with variable tissue sampling. This raises the need for evidence-based sampling criteria for individual IHC biomarker studies. We present a novel tissue sampling simulation model and demonstrate its application on Ki67 assessment in breast cancer tissue taking intratumoral heterogeneity into account.
Methods: Whole slide images (WSI) of 297 breast cancer sections, immunohistochemically stained for Ki67, were subjected to digital image analysis (DIA). Percentage of tumor cells stained for Ki67 was computed for hexagonal tiles super-imposed on the WSI. From this, intratumoral Ki67 heterogeneity indicators (Haralick's entropy values) were extracted and used to dichotomize the tumors into homogeneous and heterogeneous subsets. Simulations with random selection of hexagons, equivalent to 0.75 mm circular diameter TMA cores, were performed. The tissue sampling requirements were investigated in relation to tumor heterogeneity using linear regression and extended error analysis.
Results: The sampling requirements were dependent on the heterogeneity of the biomarker expression. To achieve a coefficient error of 10 %, 5-6 cores were needed for homogeneous cases, 11-12 cores for heterogeneous cases; in mixed tumor population 8 TMA cores were required. Similarly, to achieve the same accuracy, approximately 4,000 nuclei must be counted when the intratumor heterogeneity is mixed/unknown. Tumors of low proliferative activity would require larger sampling (10-12 TMA cores, or 6,250 nuclei) to achieve the same error measurement results as for highly proliferative tumors.
Conclusions: Our data show that optimal tissue sampling for IHC biomarker evaluation is dependent on the heterogeneity of the tissue under study and needs to be determined on a per use basis. We propose a method that can be applied to determine the sampling strategy for specific biomarkers, tissues and study targets. In addition, our findings highlight the benefit of high-capacity computer-based IHC measurement techniques to improve accuracy of the testing.
Keywords: Breast cancer; Digital image analysis; Ki67; TMA; Tissue microarrays; Tissue sampling; Tumor heterogeneity.