ContaTester: Fast cross-contamination estimation and identification for large human sequencing cohorts. Menu Avril 2025 Du 31 mars au 4 avril Du 7 au 11 avril

Archive ouverte

Delafoy, Damien | Mercier, Jonathan | Larsonneur, Elise | Wiart, Nicolas | Sandron, Florian | Mejean, Thomas | Meslage, Stéphane | Daian, Delphine | Olaso, Robert | Boland, Anne | Deleuze, Jean-François | Meyer, Vincent

Edité par CCSD -

International audience. Background Interest in genomic medicine for human health studies and clinical applications is rapidly increasing. Clinical applications require contamination-free samples to avoid misleading results and provide a sound basis for diagnosis. Results Here we present ContaTester, a tool which requires only allele balance information gathered from a VCF file to detect cross-contamination in germline human DNA samples. Based on a regression model of allele balance distribution, ContaTester allows fast checking of contamination levels for single samples or large cohorts (less than two minutes per sample). We demonstrate the efficiency of ContaTester using experimental validations: ContaTester shows similar results to methods requiring alignment data but with a significantly reduced storage footprint and less computation time. Additionally, for contamination levels above 5%, ContaTester can identify contaminants across a cohort, providing important clues for troubleshooting and quality assessment. Conclusions ContaTester estimates contamination levels from VCF files generated from whole genome sequencing normal sample and provides reliable contaminant identification for cohorts or experimental batches.

Suggestions

Du même auteur

A comparison of high-throughput SARS-CoV-2 sequencing methods from nasopharyngeal samples

Archive ouverte | Gerber, Zuzana | CCSD

International audience. The COVID-19 pandemic caused by the new Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) continues to threaten public health and burden healthcare systems worldwide. Whole SARS-Co...

SURFBAT: a surrogate family based association test building on large imputation reference panels

Archive ouverte | Herzig, Anthony | CCSD

International audience. Abstract Genotype–phenotype association tests are typically adjusted for population stratification using principal components that are estimated genome-wide. This lacks resolution when analyz...

How local reference panels improve imputation in French populations

Archive ouverte | Herzig, Anthony | CCSD

International audience. Imputation servers offer the exclusive possibility to harness the largest public reference panelswhich have been shown to deliver very high precision in the imputation of European genomes. Ma...

Chargement des enrichissements...