Eoulsan: a cloud computing-based framework facilitating high throughput sequencing analyses.

Archive ouverte

Jourdren, Laurent | Bernard, Maria | Dillies, Marie-Agnès | Le Crom, Stéphane

Edité par CCSD ; Oxford University Press (OUP) -

International audience. We developed a modular and scalable framework called Eoulsan, based on the Hadoop implementation of the MapReduce algorithm dedicated to high-throughput sequencing data analysis. Eoulsan allows users to easily set up a cloud computing cluster and automate the analysis of several samples at once using various software solutions available. Our tests with Amazon Web Services demonstrated that the computation cost is linear with the number of instances booked as is the running time with the increasing amounts of data.

Consulter en ligne

Suggestions

Du même auteur

Teolenn: an efficient and customizable workflow to design high-quality probes for microarray experiments.

Archive ouverte | Jourdren, Laurent | CCSD

International audience. Despite the development of new high-throughput sequencing techniques, microarrays are still attractive tools to study small genome organisms, thanks to sample multiplexing and high-feature de...

Comparative Transcriptomics Highlights New Features of the Iron Starvation Response in the Human Pathogen Candida glabrata

Archive ouverte | Benchouaia, Médine | CCSD

International audience. In this work, we used comparative transcriptomics to identify regulatory outliers (ROs) in the human pathogen Candida glabrata. ROs are genes that have very different expression patterns comp...

Aozan: an automated post-sequencing data-processing pipeline

Archive ouverte | Perrin, Sandrine | CCSD

International audience. Motivation: Data management and quality control of output from Illumina sequencers is a disk space- and time-consuming task. Thus, we developed Aozan to automatically handle data transfer, de...

Chargement des enrichissements...