CroCoDeEL: accurate detection of cross-sample contaminations in metagenomic data

Archive ouverte

Goulet, Lindsay | Onate, Florian Plaza | Prifti, Edi | Belda, Eugeni | Le Chatelier, Emmanuelle | Gautreau, Guillaume

Edité par CCSD -

International audience. Metagenomic sequencing provides deep insights into microbial communities but is subject to various experimental biases such as cross-sample contamination where microbial contents from simultaneously processed samples are accidentally mixed. Although a critical issue that can potentially lead to erroneous conclusions, such contamination remains understudied. A few methods have already been proposed to detect it, but with multiple limitations, notably the lack of sensitivity. Here, we introduce CroCoDeEL, a tool based on a supervised pre-trained model that identifies specific patterns in taxonomic profiles associated with cross-sample contamination. Benchmarks across three public cohorts comprehensively curated by the authors revealed that CroCoDeEL identifies with high accuracy not only the contaminated samples but also their respective contamination sources, even at low rates (< 0.1%) if the sequencing depth allows it. Our work underlines the urgency to acknowledge and systematically address this phenomenon to ensure the robustness of studies based on metagenomic data.

Consulter en ligne

Suggestions

Du même auteur

CroCoDeEL: accurate control-free detection of cross-sample contamination in metagenomic data

Archive ouverte | Goulet, Lindsay | CCSD

Metagenomic sequencing provides profound insights into microbial communities, but it is often compromised by technical biases, including cross-sample contamination. This underexplored phenomenon arises when microbial content is in...

Towards a machine learning approach for automated detection of well-to-well contamination in metagenomics data

Archive ouverte | Goulet, Lindsay | CCSD

International audience. Combining advances in high-throughput sequencing and Big Data, metagenomic sequencing has revolutionized our vision of the microscopic world and microbiology by allowing the characterization,...

Towards a machine learning approach for automated detection of well-to-well contamination in metagenomic data

Archive ouverte | Goulet, Lindsay | CCSD

International audience. Samples subjected to metagenomic sequencing can be accidentally contaminated during wet lab steps (DNA extraction, library preparation) by DNA from an external source (e.g.: lab reagents) or ...

Chargement des enrichissements...