In the problem of large-scale multiple testing the p-plot is a graphically based competitor to the notoriously weak Bonferroni method. The p-plot is less stringent and more revealing in that it gives a gauge of how many hypotheses are decidedly false. The method is elucidated and extended here: the bootstrap reveals bias and sampling error in the usual point estimates, a bootstrap-based confidence interval for the gauge is given, as well as two acceptably powerful blanket tests of significance. Guidelines for use are given, and interpretational pitfalls pointed out, in the discussion of a case study linking premortem neuropsychological and postmortem neuropathologic data in an HIV cohort study.