Unsupervised modeling of mutational landscapes of adeno-associated viruses viability

Archive ouverte

de Leonardis, Matteo | Fernandez-De-Cossio-Diaz, Jorge | Uguzzoni, Guido | Pagnani, Andrea

Edité par CCSD ; BioMed Central -

International audience. Adeno-associated viruses 2 (AAV2) are minute viruses renowned for their capacity to infect human cells and akin organisms. They have recently emerged as prominent candidates in the field of gene therapy, primarily attributed to their inherent non-pathogenic nature in humans and the safety associated with their manipulation. The efficacy of AAV2 as gene therapy vectors hinges on their ability to infiltrate host cells, a phenomenon reliant on their competence to construct a capsid capable of breaching the nucleus of the target cell. To enhance their infection potential, researchers have extensively scrutinized various combinatorial libraries by introducing mutations into the capsid, aiming to boost their effectiveness. The emergence of high-throughput experimental techniques, like deep mutational scanning (DMS), has made it feasible to experimentally assess the fitness of these libraries for their intended purpose. Notably, machine learning is starting to demonstrate its potential in addressing predictions within the mutational landscape from sequence data. In this context, we introduce a biophysically-inspired model designed to predict the viability of genetic variants in DMS experiments. This model is tailored to a specific segment of the CAP region within AAV2’s capsid protein. To evaluate its effectiveness, we conduct model training with diverse datasets, each tailored to explore different aspects of the mutational landscape influenced by the selection process. Our assessment of the biophysical model centers on two primary objectives: (i) providing quantitative forecasts for the log-selectivity of variants and (ii) deploying it as a binary classifier to categorize sequences into viable and non-viable classes.

Suggestions

Du même auteur

Inference of annealed protein fitness landscapes with AnnealDCA

Archive ouverte | Sesta, Luca | CCSD

International audience. The design of proteins with specific tasks is a major challenge in molecular biology with important diagnostic and therapeutic applications. High-throughput screening methods have been develo...

Unsupervised Inference of Protein Fitness Landscape from Deep Mutational Scan

Archive ouverte | Fernandez-De-Cossio-Diaz, Jorge | CCSD

International audience. Abstract The recent technological advances underlying the screening of large combinatorial libraries in high-throughput mutational scans deepen our understanding of adaptive protein evolution...

AMaLa: Analysis of Directed Evolution Experiments via Annealed Mutational Approximated Landscape

Archive ouverte | Sesta, Luca | CCSD

International audience. We present Annealed Mutational approximated Landscape (AMaLa), a new method to infer fitness landscapes from Directed Evolution experiments sequencing data. Such experiments typically start f...

Chargement des enrichissements...