至今,GenScript的服务及产品已被Cell, Nature, Science, PNAS等1300多家生物医药类杂志引用近万次,处于行业领先水平。NIH、哈佛、耶鲁、斯坦福、普林斯顿、杜克大学等约400家全球著名机构使用GenScript的基因合成、多肽服务、抗体服务和蛋白服务等成功地发表科研成果,再次证明GenScript 有能力帮助业内科学家Make research easy.

Discovery of CRISPR-Cas12a clades using a large language model

Nature Communications. 2025-08; 
Yuanyuan Feng, Junchao Shi, Zhanwei Li, Yongqian Li, Jiaxi Yang, Shisheng Huang, Jinfang Zheng, Wei Han, Yunbo Qiao, Jun Zhang, Qi Liu, Yao Yang, Chunyi Hu, Lina Wu, Xiaokang Zhang, Jin Tang, Xingxu Huang, Peixiang Ma Research Center for Life Sciences computing, Zhejiang Lab
Products/Services Used Details Operation
Synthetic Guide RNA The crRNAs were synthesized by GenScript (Nanjing, China), and sequences are listed in Supplementary Table 7. Get A Quote

摘要

CRISPR-Cas systems revolutionize life science. Metagenomes contain millions of unknown Cas proteins. Traditional mining relies on protein sequence alignments. In this work, we employ an evolutionary scale language model (ESM) to learn the information beyond sequences. Trained with CRISPR-Cas data, ESM accurately identifies Cas proteins without alignment. Limited experimental data restricts feature prediction, but integrating with machine learning enables trans-cleavage activity prediction of uncharacterized Cas12a. We discover 7 undocumented Cas12a subtypes with unique CRISPR loci. Structural analyses reveal 8 subtypes of Cas1, Cas2, and Cas4. Cas12a subtypes display distinct 3D-folds. CryoEM analyses unveil un... More

关键词