Three common mutations in the CARD15 (NOD2) gene are known to be associated with susceptibility to Crohn disease (CD), and genetic data suggest a gene dosage model with an increased risk of 2-4-fold in heterozygotes and 20-40-fold in homozygotes. However, the discovery of numerous rare variants of CARD15 indicates that some heterozygotes for the common mutations have a rare mutation on the other CARD15 allele, which would support a recessive model for CD. We addressed this issue by screening CARD15 for mutations in 100 CD patients who were heterozygous for one of the three common mutations. We also developed a strategy for evaluating potential disease susceptibility alleles (DSAs) that involves assessing the degree of evolutionary conservation of involved residues, predicted effects on protein structure and function, and genotyping in a large sample of cases and controls. The evolutionary analysis was aided by sequencing the entire coding region of CARD15 in three primates (chimp, gibbon, and tamarin) and aligning the human sequence with these and orthologs from other species. We found that 11 of the 100 CD patients screened had a second potential pathogenic mutation within the exonic and periexonic sequences examined. Assuming that there are no additional pathogenic mutations in noncoding regions, our study suggests that most carriers of the common DSAs are true heterozygotes, and supports previous evidence for a gene dosage model. Four novel nonsynonymous mutations were detected, one of which would produce premature termination of translation c.2686C>T (p.Arg896X). Two potential DSAs--c.2107C>T (p.Arg703Cys) and g.2238T>A (c.74-7T>A)--were significantly associated with CD in the case control sample. Analysis of the evolution of CARD15 revealed strong conservation of the encoded protein, with identity to the human sequence ranging from 99.1% in the chimp to 44.5% in fugu. Higher primates possess an open reading frame (ORF) upstream of the putative initiation site in other species that encodes a further 27 N-terminal amino acids, while four regions of high conservation are observed outside of the known domains of CARD15, indicative of additional residues of functional importance. The strategy developed here may have general application to the assessment of mutation pathogenicity and genetic models in other complex disorders.
2005 Wiley-Liss, Inc.