RegulonDB version 9.0: high-level integration of gene regulation, coexpression, motif clustering and beyond

Archive ouverte

Gama-Castro, Socorro | Salgado, Heladia | Santos-Zavaleta, Alberto | Ledezma-Tejeida, Daniela | Muniz-Rascado, Luis | Santiago Garcia-Sotelo, Jair | Alquicira-Hernandez, Kevin | Martinez-Flores, Irma | Pannier, Lucia | Castro-Mondragon, Jaime Abraham | Medina-Rivera, Alejandra | Solano-Lira, Hilda | Bonavides-Martinez, Cesar | Perez-Rueda, Ernesto | Alquicira-Hernandez, Shirley | Porron-Sotelo, Liliana | Lopez-Fuentes, Alejandra | Hernandez-Koutoucheva, Anastasia | del Moral-Chavez, Victor | Rinaldi, Fabio | Collado-Vides, Julio

Edité par CCSD ; Oxford University Press -

International audience. RegulonDB (http://regulondb.ccg.unam.mx) is one of the most useful and important resources on bacterial gene regulation, as it integrates the scattered scientific knowledge of the best-characterized organism, Escherichia coli K-12, in a database that organizes large amounts of data. Its electronic format enables researchers to compare their results with the legacy of previous knowledge and supports bioinformatics tools and model building. Here, we summarize our progress with RegulonDB since our last Nucleic Acids Research publication describing RegulonDB, in 2013. In addition to maintaining curation up-to-date, we report a collection of 232 interactions with small RNAs affecting 192 genes, and the complete repertoire of 189 Elementary Genetic Sensory-Response units (GENSOR units), integrating the signal, regulatory interactions, and metabolic pathways they govern. These additions represent major progress to a higher level of understanding of regulated processes. We have updated the computationally predicted transcription factors, which total 304 (184 with experimental evidence and 120 from computational predictions); we updated our position-weight matrices and have included tools for clustering them in evolutionary families. We describe our semiautomatic strategy to accelerate curation, including datasets from high-throughput experiments, a novel coexpression distance to search for `neighborhood' genes to known operons and regulons, and computational developments.

Consulter en ligne

Suggestions

Du même auteur

Theoretical and empirical quality assessment of transcription factor-binding motifs

Archive ouverte | Medina-Rivera, Alejandra | CCSD

International audience. Position-specific scoring matrices (PSSMs) are routinely used to predict transcription factor (TF)-binding sites in genome sequences. However, their reliability to predict novel binding sites...

Overview of the interactive task in BioCreative V

Archive ouverte | Wang, Qinghua | CCSD

International audience. Fully automated text mining (TM) systems promote efficient literature searching, retrieval, and review but are not sufficient to produce ready-to-consume curated documents. These systems are ...

Pf7: an open dataset of Plasmodium falciparum genome variation in 20,000 worldwide samples

Archive ouverte | Abdel Hamid, Muzamil Mahdi | CCSD

International audience. We describe the MalariaGEN Pf7 data resource, the seventh release of Plasmodium falciparum genome variation data from the MalariaGEN network. It comprises over 20,000 samples from 82 partner ...

Chargement des enrichissements...