PaSiT: a novel approach based on short-oligonucleotide frequencies for efficient bacterial identification and typing

Archive ouverte

Goussarov, Gleb | Cleenwerck, Ilse | Mysara, Mohamed | Leys, Natalie | Monsieurs, Pieter | Tahon, Guillaume | Carlier, Aurélien | Vandamme, Peter | van Houdt, Rob

Edité par CCSD ; Oxford University Press (OUP) -

International audience. Motivation: One of the most widespread methods used in taxonomy studies to distinguish between strains or taxa is the calculation of average nucleotide identity. It requires a computationally expensive alignment step and is therefore not suitable for large-scale comparisons. Short oligonucleotide-based methods do offer a faster alternative but at the expense of accuracy. Here, we aim to address this shortcoming by providing a software that implements a novel method based on short-oligonucleotide frequencies to compute inter-genomic distances. Results: Our tetranucleotide and hexanucleotide implementations, which were optimized based on a taxonomically well-defined set of over 200 newly sequenced bacterial genomes, are as accurate as the short oligonucleotide-based method TETRA and average nucleotide identity, for identifying bacterial species and strains, respectively. Moreover, the lightweight nature of this method makes it applicable for large-scale analyses. Availability and implementation: The method introduced here was implemented, together with other existing methods , in a dependency-free software written in C, GenDisCal, available as source code from https://github.com/LM-UGent/GenDisCal. The software supports multithreading and has been tested on Windows and Linux (CentOS). In addition, a Java-based graphical user interface that acts as a wrapper for the software is also available.

Suggestions

Du même auteur

The Complete Genome Sequence of Cupriavidus metallidurans Strain CH34, a Master Survivalist in Harsh and Anthropogenic Environments

Archive ouverte | Janssen, Paul, J | CCSD

A utilisé MicroScope Platform. International audience. Many bacteria in the environment have adapted to the presence of toxic heavy metals. Over the last 30 years, this heavy metal tolerance was the subject of exten...

Cyclical Patterns Affect Microbial Dynamics in the Water Basin of a Nuclear Research Reactor

Archive ouverte | van Eesbeeck, Valérie | CCSD

International audience. The BR2 nuclear research reactor in Mol, Belgium, runs in successive phases of operation (cycles) and shutdown, whereby a water basin surrounding the reactor vessel undergoes periodic changes...

Introducing SPeDE: High-Throughput Dereplication and Accurate Determination of Microbial Diversity from Matrix-Assisted Laser Desorption–Ionization Time of Flight Mass Spectrometry Data

Archive ouverte | Dumolin, Charles | CCSD

International audience. The isolation of microorganisms from microbial community samples often yields a large number of conspecific isolates. Increasing the diversity covered by an isolate collection entails the imp...

Chargement des enrichissements...