Unsupervised Inference of Protein Fitness Landscape from Deep Mutational Scan

Archive ouverte

Fernandez-De-Cossio-Diaz, Jorge | Uguzzoni, Guido | Pagnani, Andrea

Edité par CCSD ; Oxford University Press (OUP) -

International audience. Abstract The recent technological advances underlying the screening of large combinatorial libraries in high-throughput mutational scans deepen our understanding of adaptive protein evolution and boost its applications in protein design. Nevertheless, the large number of possible genotypes requires suitable computational methods for data analysis, the prediction of mutational effects, and the generation of optimized sequences. We describe a computational method that, trained on sequencing samples from multiple rounds of a screening experiment, provides a model of the genotype–fitness relationship. We tested the method on five large-scale mutational scans, yielding accurate predictions of the mutational effects on fitness. The inferred fitness landscape is robust to experimental and sampling noise and exhibits high generalization power in terms of broader sequence space exploration and higher fitness variant predictions. We investigate the role of epistasis and show that the inferred model provides structural information about the 3D contacts in the molecular fold.

Suggestions

Du même auteur

Unsupervised modeling of mutational landscapes of adeno-associated viruses viability

Archive ouverte | de Leonardis, Matteo | CCSD

International audience. Adeno-associated viruses 2 (AAV2) are minute viruses renowned for their capacity to infect human cells and akin organisms. They have recently emerged as prominent candidates in the field of g...

Inference of annealed protein fitness landscapes with AnnealDCA

Archive ouverte | Sesta, Luca | CCSD

International audience. The design of proteins with specific tasks is a major challenge in molecular biology with important diagnostic and therapeutic applications. High-throughput screening methods have been develo...

AMaLa: Analysis of Directed Evolution Experiments via Annealed Mutational Approximated Landscape

Archive ouverte | Sesta, Luca | CCSD

International audience. We present Annealed Mutational approximated Landscape (AMaLa), a new method to infer fitness landscapes from Directed Evolution experiments sequencing data. Such experiments typically start f...

Chargement des enrichissements...