Extending the IICR to multiple genomes and identification of limitations of some demographic inferential methods

Archive ouverte

Chikhi, Lounès | Rodríguez, Willy | Paris, Cyriel | Ha-Shan, Marine | Jouniaux, Alexane | Arredondo, Armando | Noûs, Camille | Grusea, Simona | Corujo, Josué | Lourenço, Inês | Boitard, Simon | Mazet, Olivier

Edité par CCSD -

Reconstructing the demographic history of populations and species is one of the greatest challenges facing population geneticists. [50] introduced, for a sample of size k = 2 haploid genomes, a time- and sample-dependent parameter which they called the IICR (inverse instantaneous coalescence rate). Here we extend their work to larger sample sizes and focus on T k , the time to the first coalescence event in a haploid sample of size k where k ≥ 2. We define the IICR k as the Inverse Instantaneous Coalescence Rate among k lineages. We show that (i) under a panmictic population is equivalent to N e , (ii) the IICR k can be obtained by either simulating T k values or by using the Q -matrix approach of [61] and we provide the corresponding Python and R scripts. We then study the properties of the under a limited set of n -island and stepping-stone models. We show that (iii) in structured models the is dependent on the sample size and on the sampling scheme, even when the genomes are sampled in the same deme. For instance, we find that plots for individuals sampled in the same deme will be shifted towards recent times with a lower plateau as k increases. We thus show that (iv) the cannot be used to represent “the demographic history” in a general sense, (v) the can be estimated from real or simulated genomic data using the PSMC/MSMC methods [44, 65] (vi) the MSMC2 method produces smoother curves that infer something that is not the , but are close to the in the recent past when all samples are obtained from the same deme. Altogether we argue that the PSMC, MSMC and MSMC2 plots are not expected to be identical even when the genomes are sampled from the same deme, that none can be said to represent the “demographic history of populations” and that they should be interpreted with care. We suggest that the PSMC, MSMC and MSMC2 could be used together with the to identify the signature of population structure, and to develop new strategies for model choice.

Suggestions

Du même auteur

The IICR and the non-stationary structured coalescent: towards demographic inference with arbitrary changes in population structure

Archive ouverte | Rodríguez, Willy | CCSD

Corrections available at https://doi.org/10.1038/s41437-021-00414-z. International audience. In the last years, a wide range of methods allowing to reconstruct past population size changes from genome-wide data have...

Inferring number of populations and changes in connectivity under the n-island model

Archive ouverte | Arredondo, Armando | CCSD

Correction to: Heredity https://doi.org/10.1038/s41437-021-00426-9, published online 12 April 2021.. International audience. Inferring the demographic history of species is one of the greatest challenges in populati...

Coalescence times for three genes provide sufficient information to distinguish population structure from population size changes

Archive ouverte | Grusea, Simona | CCSD

International audience. The increasing amount of genomic data currently available is expanding the horizons of population genetics inference. A wide range of methods have been published allowing to detect and date m...

Chargement des enrichissements...