Protein similarity search with subset seeds on a dedicated reconfigurable hardware

Archive ouverte

Peterlongo, Pierre | Noé, Laurent | Lavenier, Dominique | Georges, Gilles | Jacques, Julien | Kucherov, Gregory | Giraud, Mathieu

Edité par CCSD -

International audience. Genome sequencing of numerous species raises the need of complete genome comparison with precise and fast similarity searches. Today, advanced seed-based techniques (spaced seeds, multiple seeds, subset seeds) provide better sensitivity/specificity ratios. We present an implementation of such a seed-based technique onto parallel specialized hardware embedding reconfigurable architecture (FPGA), where the FPGA is tightly connected to large capacity Flash memories. This parallel system allows large databases to be fully indexed and rapidly accessed. Compared to traditional approaches like the Blastp software, we obtain both significant speed-up and better results. As our knowledge, this is the first attempt to exploit modern seed features for parallelizing similarity search.

Suggestions

Du même auteur

Optimal neighborhood indexing for protein similarity search

Archive ouverte | Peterlongo, Pierre | CCSD

International audience. Similarity inference, one of the main bioinformatics tasks, has to face an exponential growth of the biological data. A classical approach used to cope with this data flow involves heuristics...

Recherches de motifs et de similarités en bioinformatique : modélisations, solutions logicielles et matérielles

Archive ouverte | Giraud, Mathieu | CCSD

Ce tutoriel expose certains problèmes fondamentaux en algorithmique du texte pour la bioinformatique, leurs solutions actuelles ainsi que quelques perspectives de recherche. Après une introduction expliquant pourquoi la bioinforma...

Seed-based Genomic Sequence Comparison using a FPGA/FLASH Accelerator

Archive ouverte | Lavenier, Dominique | CCSD

International audience. This paper presents a parallel architecture for computing genomic sequence alignments using seed-based algorithms. Originality comes from the simultaneous use of FPGA components and FLASH mem...

Chargement des enrichissements...