Functional effects of mutations in proteins can be predicted and interpreted by guided selection of sequence covariation information

Archive ouverte

Cocco, Simona | Posani, Lorenzo | Monasson, Rémi

Edité par CCSD ; National Academy of Sciences -

International audience. Predicting the effects of one or more mutations to the in vivo or in vitro properties of a wild-type protein is a major computational challenge, due to the presence of epistasis, that is, of interactions between amino acids in the sequence. We introduce a computationally efficient procedure to build minimal epistatic models to predict mutational effects by combining evolutionary (homologous sequence) and few mutational-scan data. Mutagenesis measurements guide the selection of links in a sparse graphical model, while the parameters on the nodes and the edges are inferred from sequence data. We show, on 10 mutational scans, that our pipeline exhibits performances comparable to state-of-the-art deep networks trained on many more data, while requiring much less parameters and being hence more interpretable. In particular, the identified interactions adapt to the wild-type protein and to the fitness or biochemical property experimentally measured, mostly focus on key functional sites, and are not necessarily related to structural contacts. Therefore, our method is able to extract information relevant for one mutational experiment from homologous sequence data reflecting the multitude of structural and functional constraints acting on proteins throughout evolution.

Suggestions

Du même auteur

Infer global, predict local: Quantity-relevance trade-off in protein fitness predictions from sequence data

Archive ouverte | Posani, Lorenzo | CCSD

International audience. Predicting the effects of mutations on protein function is an important issue in evolutionary biology and biomedical applications. Computational approaches, ranging from graphical models to d...

A synaptic signal for novelty processing in the hippocampus

Archive ouverte | Gómez-Ocádiz, Ruy | CCSD

International audience. Abstract Episodic memory formation and recall are complementary processes that rely on opposing neuronal computations in the hippocampus. How this conflict is resolved in hippocampal circuits...

Integration and multiplexing of positional and contextual information by the hippocampal network

Archive ouverte | Posani, Lorenzo | CCSD

International audience. The hippocampus is known to store cognitive representations, or maps, that encode both positional and contextual information, critical for episodic memories and functional behavior. How path ...

Chargement des enrichissements...