Evaluating the usefulness of alignment filtering methods to reduce the impact of errors on evolutionary inferences

Archive ouverte

Di Franco, Arnaud | Poujol, Raphaël | Baurain, Denis | Philippe, Herve

Edité par CCSD ; BioMed Central -

International audience. Background: Multiple Sequence Alignments (MSAs) are the starting point of molecular evolutionary analyses. Errors in MSAs generate a non-historical signal that can lead to incorrect inferences. Therefore, numerous efforts have been made to reduce the impact of alignment errors, by improving alignment algorithms and by developing methods to filter out poorly aligned regions. However, MSAs do not only contain alignment errors, but also primary sequence errors. Such errors may originate from sequencing errors, from assembly errors, or from erroneous structural annotations (such as incorrect intron/exon boundaries). Even though their existence is acknowledged, the impact of primary sequence errors on evolutionary inference is poorly characterized.

Suggestions

Du même auteur

Decontamination, pooling and dereplication of the 678 samples of the Marine Microbial Eukaryote Transcriptome Sequencing Project

Archive ouverte | van Vlierberghe, Mick | CCSD

International audience. Objectives: Complex algae are photosynthetic organisms resulting from eukaryote-to-eukaryote endosymbioticlike interactions. Yet the specific lineages and mechanisms are still under debate. T...

Lower statistical support with larger datasets: insights from the Ochrophyta radiation

Archive ouverte | Di Franco, Arnaud | CCSD

International audience. It is commonly assumed that increasing the number of characters has the potential to resolve evolutionary radiations. Here, we studied photosynthetic stramenopiles (Ochrophyta) using alignmen...

A Large and Consistent Phylogenomic Dataset Supports Sponges as the Sister Group to All Other Animals

Archive ouverte | Simion, Paul | CCSD

International audience. Resolving the early diversification of animal lineages has proven difficult, even using genome-scale datasets. Several phylogenomic studies have supported the classical scenario in which spon...

Chargement des enrichissements...