Profiling candidate therapeutics with limited cancer models during preclinical development hinders predictions of clinical efficacy and identifying factors that underlie heterogeneous patient responses for patient-selection strategies. We established ∼1,000 patient-derived tumor xenograft models (PDXs) with a diverse set of driver mutations. With these PDXs, we performed in vivo compound screens using a 1 × 1 × 1 experimental design (PDX clinical trial or PCT) to assess the population responses to 62 treatments across six indications. We demonstrate both the reproducibility and the clinical translatability of this approach by identifying associations between a genotype and drug response, and established mechanisms of resistance. In addition, our results suggest that PCTs may represent a more accurate approach than cell line models for assessing the clinical potential of some therapeutic modalities. We therefore propose that this experimental paradigm could potentially improve preclinical evaluation of treatment modalities and enhance our ability to predict clinical trial responses.