Complementarity of assembly-first and mapping-first approaches for alternative splicing annotation and differential analysis from RNAseq data

Archive ouverte

Benoit-Pilven, Clara | Marchet, Camille | Chautard, Emilie | Lima, Leandro | Lambert, Marie-Pierre | Sacomoto, Gustavo | Rey, Amandine | Cologne, Audric | Terrone, Sophie | Dulaurier, Louis | Claude, Jean-Baptiste | Bourgeois, Cyril | Auboeuf, Didier | Lacroix, Vincent

Edité par CCSD ; Nature Publishing Group -

International audience. Genome-wide analyses estimate that more than 90% of multi exonic human genes produce at least two transcripts through alternative splicing (AS). Various bioinformatics methods are available to analyze AS from RNAseq data. Most methods start by mapping the reads to an annotated reference genome, but some start by a de novo assembly of the reads. In this paper, we present a systematic comparison of a mapping-first approach (FARLINE) and an assembly-first approach (KISSPLICE). We applied these methods to two independent RNAseq datasets and found that the predictions of the two pipelines overlapped (70% of exon skipping events were common), but with noticeable differences. The assembly-first approach allowed to find more novel variants, including novel unannotated exons and splice sites. It also predicted AS in recently duplicated genes. The mapping-first approach allowed to find more lowly expressed splicing variants, and splice variants overlapping repeats. This work demonstrates that annotating AS with a single approach leads to missing out a large number of candidates, many of which are differentially regulated across conditions and can be validated experimentally. We therefore advocate for the combined use of both mapping-first and assembly-first approaches for the annotation and differential analysis of AS from RNAseq datasets.

Suggestions

Du même auteur

Annotation and differential analysis of alternative splicing using de novo assembly of RNAseq data

Archive ouverte | Benoit-Pilven, Clara | CCSD

Genome-wide analyses reveal that more than 90% of multi exonichuman genes produce at least two transcripts through alternative splicing (AS). Various bioinformatics methods are available to analyze ASfrom RNAseq data. Most method...

The RNA helicase DDX17 controls the transcriptional activity of REST and the expression of proneural microRNAs in neuronal differentiation

Archive ouverte | Lambert, Marie-Pierre | CCSD

International audience. The Repressor Element 1-silencing transcription factor (REST) represses a number of neuronal genes in non-neuronal cells or in undifferentiated neural progenitors. Here, we report that the DE...

Playing hide and seek with repeats in local and global de novo transcriptome assembly of short RNA-seq reads

Archive ouverte | Lima, Leandro | CCSD

International audience. AbstractBackground The main challenge in de novo genome assembly of DNA-seq data is certainly to deal with repeats that are longer than the reads. In de novo transcriptome assembly of RNA-seq...

Chargement des enrichissements...