Because of the central role of the general practice in the delivery of British primary care, intervention trials in primary care often use the practice as the unit of randomization. The creation of primary care groups (PCGs) in April 1999 changed the organization of primary care and the commissioning of secondary care services. PCGs will directly affect the organization and delivery of primary, secondary and social care services. The PCG therefore becomes an appropriate target for organizational and educational interventions. Trials testing these interventions should involve randomization by PCG. This paper discusses the sample size required for a trial in primary care assessing the effect of a falls prevention programme among older people. In this trial PCGs will be randomized. The sample size calculations involve estimating intra-PCG correlation in primary outcome: fractured femur rate for those 65 years and over. No data on fractured femur rate were available at PCG level. PCGs are, however, similar in size and often coterminous with local authorities. Therefore, intra-PCG correlation in fractured femur rate was estimated from the intra-local authority correlation calculated from routine data. Three alternative trial designs are considered. In the first design, PCGs are selected for inclusion in the trial from the total population of England (eight regions). In the second design, PCGs are selected from two regions only. The third design is similar to the second except that PCGs are stratified by region and baseline value of fracture rate. Intracluster correlation is estimated for each of these designs using two methods: an approximation which assumes cluster sizes are equal and an alternative method which takes account of the fact that cluster sizes vary. Estimates of sample size required vary between 26 and 7 PCGs in each intervention group, depending on the trial design and the method used to calculate sample size. Not unexpectedly, stratification by baseline value of the outcome variable decreases the sample size required. In our analyses, geographic restriction of the population to be sampled reduces between-cluster variability in the primary outcome. This leads to an increase in precision. When allowance for variable cluster size is made, the increase in precision is not as great as would be expected with equal cluster sizes. This paper highlights the usefulness of routine data in work of this kind, and establishes one of the essential prerequisites for our proposed trial and other trials using primary outcomes with similar between-PCG variation: a feasible sample size.
Copyright 2001 John Wiley & Sons, Ltd.