The structure of a genomic DNA fragment encoding mouse cathepsin B was characterized. The genomic insert spans 15 kbp and contains 9 exons encoding the 339 amino acid residues of mouse preprocathepsin B. Intron break-points are not found at the junctions of the pre-peptide, pro-peptide and mature enzyme. Like other cysteine proteinase genes, the region around the cysteinyl active site is split by an intron, but in contrast with cathepsins L and H the intron break-point is located immediately after the active site.