Computational design of novel Cas9 PAM-interacting domains using evolution-based modelling and structural quality assessment

Archive ouverte

Malbranke, Cyril | Rostain, William | Depardieu, Florence | Cocco, Simona | Monasson, Rémi | Bikard, David

Edité par CCSD ; PLOS -

International audience. We present here an approach to protein design that combines (i) scarce functional information such as experimental data (ii) evolutionary information learned from a natural sequence variants and (iii) physics-grounded modeling. Using a Restricted Boltzmann Machine (RBM), we learn a sequence model of a protein family. We use semi-supervision to leverage available functional information during the RBM training. We then propose a strategy to explore the protein representation space that can be informed by external models such as an empirical force-field method (FoldX). Our approach is applied to a domain of the Cas9 protein responsible for recognition of a short DNA motif. We experimentally assess the functionality of 71 variants generated to explore a range of RBM and FoldX energies. Sequences with as many as 50 differences (20% of the protein domain) to the wild-type retained functionality. Overall, 21/71 sequences designed with our method were functional. Interestingly, 6/71 sequences showed an improved activity in comparison with the original wild-type protein sequence. These results demonstrate the interest in further exploring the synergies between machine-learning of protein sequence representations and physics grounded modeling strategies informed by structural information.

Suggestions

Du même auteur

Improving sequence-based modeling of protein families using secondary structure quality assessment

Archive ouverte | Malbranke, Cyril | CCSD

Motivation: Modeling of protein family sequence distribution from homologous sequence data recently received considerable attention, in particular for structure and function predictions, as well as for protein design. In particula...

Cas9 off-target binding to the promoter of bacterial genes leads to silencing and toxicity

Archive ouverte | Rostain, William | CCSD

International audience. Abstract Genetic tools derived from the Cas9 RNA-guided nuclease are providing essential capabilities to study and engineer bacteria. While the importance of off-target effects was noted earl...

Specificity and Mechanism of tRNA cleavage by the AriB Toprim nuclease of the PARIS bacterial immune system

Archive ouverte | Belukhina, Svetlana | CCSD

Transfer RNA molecules have been recently recognized as widespread targets of bacterial immune systems. Translation inhibition through tRNA cleavage or modification inhibits phage propagation, thereby protecting the bacterial popu...

Chargement des enrichissements...