CRISPR-Cas systems provide heritable immunity against viruses by capturing short invader DNA sequences, termed spacers, and incorporating them into the CRISPR loci of the prokaryotic host genome. Here, we investigate DNA elements that control accurate spacer uptake in the type II-A CRISPR locus of Streptococcus thermophilus. We determined that purified Cas1 and Cas2 proteins catalyze spacer integration with high specificity for CRISPR repeat junctions. We show that 10 bp of the CRISPR leader sequence is critical for stimulating polarized integration preferentially at the repeat proximal to the leader. Spacer integration proceeds through a two-step transesterification reaction where the 3' hydroxyl groups of the spacer target both repeat borders on opposite strands. The leader-proximal end of the repeat is preferentially targeted for the first site of integration through recognition of sequences spanning the leader-repeat junction. Subsequently, second-site integration at the leader-distal end of the repeat is specified by multiple determinants including a length-defining mechanism relying on a repeat element proximal to the second site of integration. Our results highlight the intrinsic ability of type II Cas1/Cas2 proteins to coordinate directional and site-specific spacer integration into the CRISPR locus to ensure precise duplication of the repeat required for CRISPR immunity.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.