Comparison of normalization methods for differential gene expression analysis in RNA-Seq experiments: A matter of relative size of studied transcriptomes. Comparison of normalization methods for differential gene expression analysis in RNA-Seq experiments: A matter of relative size of studied transcriptomes: A matter of relative size of studied transcriptomes

Archive ouverte

Maza, Elie | Frasse, Pierre | Senin, Pavel | Bouzayen, Mondher | Zouine, Mohamed

Edité par CCSD ; Taylor & Francis Open -

International audience. In recent years, RNA-Seq technologies became a powerful tool for transcriptome studies. However, computational methods dedicated to the analysis of high-throughput sequencing data are yet to be standardized. In particular, it is known that the choice of a normalization procedure leads to a great variability in results of differential gene expression analysis. The present study compares the most widespread normalization procedures and proposes a novel one aiming at removing an inherent bias of studied transcriptomes related to their relative size. Comparisons of the normalization procedures are performed on real and simulated data sets. Real RNA-Seq data sets analyses, performed with all the different normalization methods, show that only 50% of significantly differentially expressed genes are common. This result highlights the influence of the normalization step on the differential expression analysis. Real and simulated data sets analyses give similar results showing 3 different groups of procedures having the same behavior. The group including the novel method named "Median Ratio Normalization" (MR N) gives the lower number of false discoveries. Within this group the MR N method is less sensitive to the modification of parameters related to the relative size of transcriptomes such as the number of down- and upregulated genes and the gene expression levels. The newly proposed MR N method efficiently deals with intrinsic bias resulting from relative size of studied transcriptomes. Validation with real and simulated data sets confirmed that MR N is more consistent and robust than existing methods.

Suggestions

Du même auteur

About stable combinations of non-stable genes as reference genes for RT-qPCR data normalization

Archive ouverte | Djari, Anis | CCSD

International audience. Gene expression profiling is of key importance in all domains of life sciences, as medicine, environment, and plants, for both basic and applied research. In spite of the emergence of microar...

Comprehensive Profiling of Ethylene Response Factor Expression Identifies Ripening-Associated ERF Genes and Their Link to Key Regulators of Fruit Ripening in Tomato

Archive ouverte | Liu, Mingchun | CCSD

International audience. Our knowledge of the factors mediating ethylene-dependent ripening of climacteric fruit remains limited. The transcription of ethylene-regulated genes is mediated by ethylene response factors...

Overexpression of the class D MADS-box gene Sl-AGL11 impacts fleshy tissue differentiation and structure in tomato fruits

Archive ouverte | Huang, Baowen | CCSD

International audience. MADS-box transcription factors are key elements of the genetic networks controlling flower and fruit development. Among these, the class D clade gathers AGAMOUS-like genes which are involved ...

Chargement des enrichissements...