Current World Health Organization classification of endometrial hyperplasia is problematic because of poor diagnostic reproducibility. We sought to determine factors that cause diagnostic disagreement in a review of 2601 endometrial specimens. Blinded random specimens of normal endometrium, hyperplasias, and carcinoma were reviewed by 2 pathologists, with review by a third pathologist in cases with disagreement. All cases of endometrial hyperplasia or carcinoma were scored for degree of glandular crowding, architectural complexity, and cytologic atypia. Sample adequacy, hyperplasia volume, presence of metaplasia, or endometrial polyp were also scored. The overall kappa for agreement was 0.71, with a lower kappa of 0.36 when cases called "no hyperplasia" were excluded. The percent specific agreement was 90.3% for no hyperplasia, 31.1% for simple hyperplasia, 51.1% for complex hyperplasia, 49.8% for atypical hyperplasia, and 57.5% for adenocarcinoma. Cases categorized as "low volume hyperplasia" had more diagnostic disagreement than "high volume," (62% vs. 39%, P=0.003). Similarly, cases called "scant" had more diagnostic disagreement than "not scant" (65% vs. 57%, P=0.013). The histologic feature associated with the most diagnostic disagreement was cytologic atypia (P<0.0001). Architectural crowding, architectural complexity, or the presence of a polyp were all associated with diagnostic disagreement (P<0.0001). High diagnostic disagreement in endometrial hyperplasia is related to both sample adequacy and interpretation of histologic features present. Although obtaining additional tissue may increase diagnostic reproducibility, differences in interpretation of key histologic features like cytologic atypia remain major factors contributing to diagnostic disagreement.