Novel methods included in SpolLineages tool for fast and precise prediction of Mycobacterium tuberculosis complex spoligotype families

Archive ouverte

Couvin, David | Segretier, Wilfried | Stattner, Erick | Rastogi, Nalin

Edité par CCSD ; Oxford University Press -

International audience. Bioinformatic tools are currently being developed to better understand the Mycobacterium tuberculosis complex (MTBC). Several approaches already exist for the identification of MTBC lineages using classical genotyping methods such as mycobacterial interspersed repetitive units-variable number of tandem DNA repeats and spoligotyping-based families. In the recently released SITVIT2 proprietary database of the Institut Pasteur de la Guadeloupe, a large number of spoligotype families were assigned by either manual curation/expertise or using an in-house algorithm. In this study, we present two complementary data-driven approaches allowing fast and precise family prediction from spoligotyping patterns. The first one is based on data transformation and the use of decision tree classifiers. In contrast, the second one searches for a set of simple rules using binary masks through a specifically designed evolutionary algorithm. The comparison with the three main approaches in the field highlighted the good performances of our contributions and the significant runtime gain. Finally, we propose the 'SpolLineages' software tool (https://github.com/dcouvin/SpolLineages), which implements these approaches for MTBC spoligotype families' identification.

Suggestions

Du même auteur

KaruBioNet: a network and discussion group for a better collaboration and structuring of bioinformatics in Guadeloupe (French West Indies)

Archive ouverte | Couvin, David | CCSD

International audience. Sequencing and other biological data are now more frequently available and at a lower price. Mutual tools and strategies are needed to analyze the huge amount of heterogeneous data generated ...

Molecular epidemiology and evolutionary genetics of Mycobacterium tuberculosis isolated from different parts of India

Archive ouverte | Singh, Sarman | CCSD

Open Access since 1 January 2015 (Open Access funded by Asian-African Society for Mycobacteriology). International audience. Background: The evolutionary changes in mycobacterium tuberculosis (MTB) have been phenoty...

Clonal expansion across the seas as seen through CPLP-TB database: A joint effort in cataloguing Mycobacterium tuberculosis genetic diversity in Portuguese-speaking countries

Archive ouverte | Perdigão, João | CCSD

International audience. Tuberculosis (TB) remains a major health problem within the Community of Portuguese Language Speaking Countries (CPLP). Despite the marked variation in TB incidence across its member-states a...

Chargement des enrichissements...