Chromosome-level quality scaffolding of brown algal genomes using InstaGRAAL, a proximity ligation-based scaffolder

Archive ouverte

Baudry, Lyam | Marbouty, Martial | Marie-Nelly, Hervé | Cormier, Alexandre | Guiglielmoni, Nadège | Avia, Komlan | Mie, Yann Loe | Godfroy, Olivier | Sterck, Lieven | Cock, J. Mark | Zimmer, Christophe | Coelho, Susana M. | Koszul, Romain

Edité par CCSD ; BioRxiv -

Posté sur BioRxiv le 23 décembre 2019. International audience. Hi-C has become a popular technique in recent genome assembly projects. Hi-C exploits contact frequencies between pairs of loci to bridge and order contigs in draft genomes, resulting in chromosome-level assemblies. However, application of this approach is currently hampered by a lack of robust programs that are capable of effectively treating this type of data, particularly open source programs. We developed instaGRAAL, a complete overhaul of the GRAAL program, which has adapted the latter to allow efficient assembly of large genomes. Both GRAAL, and instaGRAAL use a Markov Chain Monte Carlo algorithm to perform Hi-C scaffolding, but instaGRAAL features a number of improvements including a modular polishing approach that optionally integrates independent data. To validate the program, we used it to generate chromosome-level assemblies for two brown algae, Desmarestia herbacea and the model Ectocarpus sp., and quantified improvements compared to the initial draft for the latter. Overall, instaGRAAL is a program able to generate, using default parameters with minimal human intervention, near-complete assemblies.

Suggestions

Du même auteur

instaGRAAL: chromosome-level quality scaffolding of genomes using a proximity ligation-based scaffolder

Archive ouverte | Baudry, Lyam | CCSD

International audience. Hi-C exploits contact frequencies between pairs of loci to bridge and order contigs during genome assembly, resulting in chromosome-level assemblies. Because few robust programs are available...

Re-annotation, improved large-scale assembly and establishment of a catalogue of noncoding loci for the genome of the model brown alga Ectocarpus

Archive ouverte | Cormier, Alexandre | CCSD

International audience. The genome of the filamentous brown alga Ectocarpus was the first to be completely sequenced from within the brown algal group and has served as a key reference genome both for this lineage a...

Chromosome-level genome assembly reveals homologous chromosomes and recombination in asexual rotifer Adineta vaga

Archive ouverte | Simion, Paul | CCSD

International audience. Bdelloid rotifers are notorious as a speciose ancient clade comprising only asexual lineages. Thanks to their ability to repair highly fragmented DNA, most bdelloid species also withstand com...

Chargement des enrichissements...