R2 retrotransposons are site-specific eukaryotic non-long terminal repeat retrotransposons that copy and paste into gene loci encoding ribosomal RNAs. Recently, we demonstrated that avian A-clade R2 proteins achieve efficient and precise insertion of transgenes into their native safe-harbor loci in human cells. The features of A-clade R2 proteins that support gene insertion are not well characterized. Here, we report high-resolution cryo-electron microscopy structures of two vertebrate A-clade R2 proteins at the initiation of target-primed reverse transcription and after cDNA synthesis and second-strand nicking. Using biochemical and cellular assays, we illuminate the basis for high selectivity of template use ... More
R2 retrotransposons are site-specific eukaryotic non-long terminal repeat retrotransposons that copy and paste into gene loci encoding ribosomal RNAs. Recently, we demonstrated that avian A-clade R2 proteins achieve efficient and precise insertion of transgenes into their native safe-harbor loci in human cells. The features of A-clade R2 proteins that support gene insertion are not well characterized. Here, we report high-resolution cryo-electron microscopy structures of two vertebrate A-clade R2 proteins at the initiation of target-primed reverse transcription and after cDNA synthesis and second-strand nicking. Using biochemical and cellular assays, we illuminate the basis for high selectivity of template use and unique roles for each of the three zinc-finger domains in nucleic acid recognition. Reverse transcriptase active site architecture is reinforced by an unanticipated insertion motif specific to vertebrate A-clade R2 proteins. Our work provides the first insights into A-clade R2 protein structure during gene insertion and may enable future improvement and adaptation of R2-based systems for precise transgene insertion.