The Protist Ribosomal Reference database (PR2): a catalog of unicellular eukaryote Small Sub-Unit rRNA sequences with curated taxonomy

Archive ouverte

Guillou, Laure | Bachar, Dipankar | Audic, Stéphane | Bass, David | Berney, Cédric | Bittner, Lucie | Boutte, Christophe | Burgaud, Gaëtan | de Vargas, Colomban | Decelle, Johan | Campo, Javier, Del | Dolan, John R. | Dunthorn, Micah | Edvardsen, Bente | Holzmann, Maria | Kooistra, Wiebe H. C. F. | Lara, Enrique | Le Bescot, Noan | Logares, Ramiro | Mahé, Frédéric | Massana, Ramon | Montresor, Marina | Morard, Raphaël | Not, Fabrice | Pawlowski, Jan | Probert, Ian | Sauvadet, Anne-Laure | Siano, Raffaele | Stoeck, Thorsten | Vaulot, Daniel | Zimmermann, Pascal | Christen, Richard

Edité par CCSD ; Oxford University Press -

International audience. The interrogation of genetic markers in environmental meta-barcoding studies is currently seriously hindered by the lack of taxonomically curated reference data sets for the targeted genes. The Protist Ribosomal Reference database (PR2, http://ssu-rrna.org/) provides a unique access to eukaryotic small sub-unit (SSU) ribosomal RNA and DNA sequences, with curated taxonomy. The database mainly consists of nuclear-encoded protistan sequences. However, metazoans, land plants, macrosporic fungi and eukaryotic organelles (mitochondrion, plastid and others) are also included because they are useful for the analysis of high-troughput sequencing data sets. Introns and putative chimeric sequences have been also carefully checked. Taxonomic assignation of sequences consists of eight unique taxonomic fields. In total, 136 866 sequences are nuclear encoded, 45 708 (36 501 mitochondrial and 9657 chloroplastic) are from organelles, the remaining being putative chimeric sequences. The website allows the users to download sequences from the entire and partial databases (including representative sequences after clustering at a given level of similarity). Different web tools also allow searches by sequence similarity. The presence of both rRNA and rDNA sequences, taking into account introns (crucial for eukaryotic sequences), a normalized eight terms ranked-taxonomy and updates of new GenBank releases were made possible by a long-term collaboration between experts in taxonomy and computer scientists.

Suggestions

Du même auteur

Marine protist diversity in European coastal waters and sediments as revealed by high-throughput sequencing

Archive ouverte | Massana, Ramon | CCSD

International audience. Although protists are critical components of marine ecosystems, they are still poorly characterized. Here we analysed the taxonomic diversity of planktonic and benthic protist communities col...

Benthic protists: the under-charted majority

Archive ouverte | Forster, Dominik | CCSD

International audience. Marine protist diversity inventories have largely focused on planktonic environments, while benthic protists have received relatively little attention. We therefore hypothesize that current d...

Patterns of Rare and Abundant Marine Microbial Eukaryotes

Archive ouverte | Logares, Ramiro | CCSD

International audience. Background : Biological communities are normally composed of a few abundant and many rare species. This pattern is particularly prominent in microbial communities, in which most constituent t...

Chargement des enrichissements...