The effect of rare alleles on estimated genomic relationships from whole genome sequence data

Archive ouverte

Eynard, Sonia | Windig, Jack | Leroy, Grégoire | van Binsbergen, Rianne | Calus, Mario

Edité par CCSD ; BioMed Central -

[b]BACKGROUND:[/b]Relationships between individuals and inbreeding coefficients are commonly used for breeding decisions, but may be affected by the type of data used for their estimation. The proportion of variants with low Minor Allele Frequency (MAF) is larger in whole genome sequence (WGS) data compared to Single Nucleotide Polymorphism (SNP) chips. Therefore, WGS data provide true relationships between individuals and may influence breeding decisions and prioritisation for conservation of genetic diversity in livestock. This study identifies differences between relationships and inbreeding coefficients estimated using pedigree, SNP or WGS data for 118 Holstein bulls from the 1000 Bull genomes project. To determine the impact of rare alleles on the estimates we compared three scenarios of MAF restrictions: variants with a MAF higher than 5%, variants with a MAF higher than 1% and variants with a MAF between 1% and 5%.[br/][b]RESULTS:[/b]We observed significant differences between estimated relationships and, although less significantly, inbreeding coefficients from pedigree, SNP or WGS data, and between MAF restriction scenarios. Computed correlations between pedigree and genomic relationships, within groups with similar relationships, ranged from negative to moderate for both estimated relationships and inbreeding coefficients, but were high between estimates from SNP and WGS (0.49 to 0.99). Estimated relationships from genomic information exhibited higher variation than from pedigree. Inbreeding coefficients analysis showed that more complete pedigree records lead to higher correlation between inbreeding coefficients from pedigree and genomic data. Finally, estimates and correlations between additive genetic (A) and genomic (G) relationship matrices were lower, and variances of the relationships were larger when accounting for allele frequencies than without accounting for allele frequencies.[br/][b]CONCLUSIONS:[/b]Using pedigree data or genomic information, and including or excluding variants with a MAF below 5% showed significant differences in relationship and inbreeding coefficient estimates. Estimated relationships and inbreeding coefficients are the basis for selection decisions. Therefore, it can be expected that using WGS instead of SNP can affect selection decision. Inclusion of rare variants will give access to the variation they carry, which is of interest for conservation of genetic diversity.

Suggestions

Du même auteur

Whole-genome sequence data uncover loss of genetic diversity due to selection

Archive ouverte | Eynard, Sonia | CCSD

International audience. Background: Whole-genome sequence (WGS) data give access to more complete structural genetic information of individuals, including rare variants, not fully covered by single nucleotide polymo...

The use of whole genome sequence data to estimate genetic relationships including rare alleles information

Archive ouverte | Leroy, Grégoire | CCSD

International audience

The Use of Whole Genome Sequence Data to Estimate Genetic Relationships Including Rare Alleles Information

Archive ouverte | Eynard, Sonia | CCSD

International audience

Chargement des enrichissements...