LocoGSE, a sequence-based genome size estimator for plants

Archive ouverte

Guenzi-Tiberi, Pierre | Istace, Benjamin | Alsos, Inger Greve | Coissac, Eric | Lavergne, Sébastien | Aury, Jean-Marc | Denoeud, France

Edité par CCSD ; Frontiers -

International audience. Extensive research has focused on exploring the range of genome sizes in eukaryotes, with a particular emphasis on land plants, where significant variability has been observed. Accurate estimation of genome size is essential for various research purposes, but existing sequence-based methods have limitations, particularly for low-coverage datasets. In this study, we introduce LocoGSE, a novel genome size estimator designed specifically for low-coverage datasets generated by genome skimming approaches. LocoGSE relies on mapping the reads on single copy consensus proteins without the need for a reference genome assembly. We calibrated LocoGSE using 430 low-coverage Angiosperm genome skimming datasets and compared its performance against other estimators. Our results demonstrate that LocoGSE accurately predicts monoploid genome size even at very low depth of coverage (<1X) and on highly heterozygous samples. Additionally, LocoGSE provides stable estimates across individuals with varying ploidy levels. LocoGSE fills a gap in sequence-based plant genome size estimation by offering a user-friendly and reliable tool that does not rely on high coverage or reference assemblies. We anticipate that LocoGSE will facilitate plant genome size analysis and contribute to evolutionary and ecological studies in the field. Furthermore, at the cost of an initial calibration, LocoGSE can be used in other lineages.

Consulter en ligne

Suggestions

Du même auteur

Corrigendum: LocoGSE, a sequence-based genome size estimator for plants

Archive ouverte | Guenzi-Tiberi, Pierre | CCSD

International audience

High resolution ancient sedimentary DNA shows that alpine plant diversity is associated with human land use and climate change

Archive ouverte | Garcés-Pastor, Sandra | CCSD

International audience. Abstract The European Alps are highly rich in species, but their future may be threatened by ongoing changes in human land use and climate. Here, we reconstructed vegetation, temperature, hum...

ORTHOSKIM: In silico sequence capture from genomic and transcriptomic libraries for phylogenomic and barcoding applications

Archive ouverte | Pouchon, Charles | CCSD

International audience

Chargement des enrichissements...