Effective normalization for copy number variation in Hi-C data

Archive ouverte

Servant, Nicolas | Varoquaux, Nelle | Heard, Edith | Barillot, Emmanuel | Vert, Jean-Philippe

Edité par CCSD ; BioMed Central -

International audience. Background: Normalization is essential to ensure accurate analysis and proper interpretation of sequencing data, and chromosome conformation capture data such as Hi-C have particular challenges. Although several methods have been proposed, the most widely used type of normalization of Hi-C data usually casts estimation of unwanted effects as a matrix balancing problem, relying on the assumption that all genomic regions interact equally with each other. Results: In order to explore the effect of copy-number variations on Hi-C data normalization, we first propose a simulation model that predict the effects of large copy-number changes on a diploid Hi-C contact map. We then show that the standard approaches relying on equal visibility fail to correct for unwanted effects in the presence of copy-number variations. We thus propose a simple extension to matrix balancing methods that model these effects. Our approach can either retain the copy-number variation effects (LOIC) or remove them (CAIC). We show that this leads to better downstream analysis of the three-dimensional organization of rearranged genomes. Conclusions: Taken together, our results highlight the importance of using dedicated methods for the analysis of Hi-C cancer data. Both CAIC and LOIC methods perform well on simulated and real Hi-C data sets, each fulfilling different needs.

Suggestions

Du même auteur

Effective normalization for copy number variation in Hi-C data

Archive ouverte | Servant, Nicolas | CCSD

Normalization is essential to ensure accurate analysis and proper interpretation of sequencing data. Chromosome conformation data, such as Hi-C, is not different. The most widely used type of normalization of Hi-C data casts estim...

HiC-Pro: An optimized and flexible pipeline for Hi-C data processing

Archive ouverte | Servant, Nicolas | CCSD

International audience. HiC-Pro is an optimized and flexible pipeline for processing Hi-C data from raw reads to normalized contact maps. HiC-Pro maps reads, detects valid ligation products, performs quality control...

Contribution of epigenetic landscapes and transcription factors to X-chromosome reactivation in the inner cell mass

Archive ouverte | Borensztein, Maud | CCSD

International audience. X-chromosome inactivation is established during early development. In mice, transcriptional repression of the paternal X-chromosome (Xp) and enrichment in epigenetic marks such as H3K27me3 is...

Chargement des enrichissements...