Inference of annealed protein fitness landscapes with AnnealDCA

Archive ouverte

Sesta, Luca | Pagnani, Andrea | Fernandez-De-Cossio-Diaz, Jorge | Uguzzoni, Guido

Edité par CCSD ; PLOS -

International audience. The design of proteins with specific tasks is a major challenge in molecular biology with important diagnostic and therapeutic applications. High-throughput screening methods have been developed to systematically evaluate protein activity, but only a small fraction of possible protein variants can be tested using these techniques. Computational models that explore the sequence space in-silico to identify the fittest molecules for a given function are needed to overcome this limitation. In this article, we propose AnnealDCA, a machine-learning framework to learn the protein fitness landscape from sequencing data derived from a broad range of experiments that use selection and sequencing to quantify protein activity. We demonstrate the effectiveness of our method by applying it to antibody Rep-Seq data of immunized mice and screening experiments, assessing the quality of the fitness landscape reconstructions. Our method can be applied to several experimental cases where a population of protein variants undergoes various rounds of selection and sequencing, without relying on the computation of variants enrichment ratios, and thus can be used even in cases of disjoint sequence samples.

Suggestions

Du même auteur

AMaLa: Analysis of Directed Evolution Experiments via Annealed Mutational Approximated Landscape

Archive ouverte | Sesta, Luca | CCSD

International audience. We present Annealed Mutational approximated Landscape (AMaLa), a new method to infer fitness landscapes from Directed Evolution experiments sequencing data. Such experiments typically start f...

Unsupervised modeling of mutational landscapes of adeno-associated viruses viability

Archive ouverte | de Leonardis, Matteo | CCSD

International audience. Adeno-associated viruses 2 (AAV2) are minute viruses renowned for their capacity to infect human cells and akin organisms. They have recently emerged as prominent candidates in the field of g...

Unsupervised Inference of Protein Fitness Landscape from Deep Mutational Scan

Archive ouverte | Fernandez-De-Cossio-Diaz, Jorge | CCSD

International audience. Abstract The recent technological advances underlying the screening of large combinatorial libraries in high-throughput mutational scans deepen our understanding of adaptive protein evolution...

Chargement des enrichissements...