Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut

Archive ouverte

Armero Villanueva, Alix | Baudouin, Luc | Bocs, Stéphanie | This, Dominique

Edité par CCSD ; Public Library of Science -

AGAP : équipe ID. The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm (Elaeis guineensis Jacq) is the main protagonist of the oil market. In this study, we present a workflow that exploits the comparative genomics between a target species (coconut) and a reference species (oil palm) to improve the transcriptomic data, providing a proteome useful to answer functional or evolutionary questions. This workflow reduces redundancy and fragmentation, two inherent problems of transcriptomic data, while preserving the functional representation of the target species. Our approach was validated in Arabidopsis thaliana using Arabidopsis lyrata and Capsella rubella as references species. This analysis showed the high sensitivity and specificity of our strategy, relatively independent of the reference proteome. The workflow increased the length of proteins products in A. thaliana by 13%, allowing, often, to recover 100% of the protein sequence length. In addition redundancy was reduced by a factor greater than 3. In coconut, the approach generated 29,366 proteins, 1,246 of these proteins deriving from new contigs obtained with the BRANCH software. The coconut proteome presented a functional profile similar to that observed in rice and an important number of metabolic pathways related to secondary metabolism. The new sequences found with BRANCH software were enriched in functions related to biotic stress. Our strategy can be used as a complementary step to de novo transcriptome assembly to get a representative proteome of a target species. The results of the current analysis are available on the website PalmComparomics (http://palm-comparomics.southgreen.fr/).

Suggestions

Du même auteur

Reconstructing the genome of the most recent common ancestor of flowering plants

Archive ouverte | Murat, Florent | CCSD

We describe here the reconstruction of the genome of the most recent common ancestor (MRCA) of modern monocots and eudicots, accounting for 95% of extant angiosperms, with its potential repertoire of 22,899 ancestral genes conserv...

Understanding Brassicaceae evolution through ancestral genome reconstruction

Archive ouverte | Murat, Florent | CCSD

Brassicaceae is a family of green plants of high scientific and economic interest, including thale cress (Arabidopsis thaliana), cruciferous vegetables (cabbages) and rapeseed.We reconstruct an evolutionary framework of Brassicace...

Coconut genome assembly enables evolutionary analysis of palms and highlights signaling pathways involved in salt tolerance

Archive ouverte | Yang, Yaodong | CCSD

International audience. Coconut (Cocos nucifera) is the emblematic palm of tropical coastal areas all around the globe. It provides vital resources to millions of farmers. In an effort to better understand its evolu...

Chargement des enrichissements...