Designing Efficient Spaced Seeds for SOLiD Read Mapping.

Archive ouverte

Noé, Laurent | Gîrdea, Marta | Kucherov, Gregory

Edité par CCSD ; Hindawi Publishing Corporation -

International audience. The advent of high-throughput sequencing technologies constituted a major advance in genomic studies, offering new prospects in a wide range of applications.We propose a rigorous and flexible algorithmic solution to mapping SOLiD color-space reads to a reference genome. The solution relies on an advanced method of seed design that uses a faithful probabilistic model of read matches and, on the other hand, a novel seeding principle especially adapted to read mapping. Our method can handle both lossy and lossless frameworks and is able to distinguish, at the level of seed design, between SNPs and reading errors. We illustrate our approach by several seed designs and demonstrate their efficiency.

Consulter en ligne

Suggestions

Du même auteur

Back-translation for discovering distant protein homologies in the presence of frameshift mutations

Archive ouverte | Gîrdea, Marta | CCSD

International audience. BackgroundFrameshift mutations in protein-coding DNA sequences produce a drastic change in the resulting protein sequence, which prevents classic protein alignment methods from revealing the ...

Protein similarity search with subset seeds on a dedicated reconfigurable hardware

Archive ouverte | Peterlongo, Pierre | CCSD

International audience. Genome sequencing of numerous species raises the need of complete genome comparison with precise and fast similarity searches. Today, advanced seed-based techniques (spaced seeds, multiple se...

Improved hit criteria for DNA local alignment.

Archive ouverte | Noé, Laurent | CCSD

International audience. BACKGROUND: The hit criterion is a key component of heuristic local alignment algorithms. It specifies a class of patterns assumed to witness a potential similarity, and this choice is decisi...

Chargement des enrichissements...