Exploring the link between additive heritability and prediction accuracy from a ridge regression perspective

Archive ouverte

Frouin, Arthur | Dandine-Roulland, Claire | Pierre-Jean, Morgane | Deleuze, Jean-François | Ambroise, Christophe | Le Floch, Edith

Edité par CCSD ; Frontiers Media -

International audience. Genome-Wide Association Studies (GWAS) explain only a small fraction of heritability for most complex human phenotypes. Genomic heritability estimates the variance explained by the SNPs on the whole genome using mixed models and accounts for the many small contributions of SNPs in the explanation of a phenotype. This paper approaches heritability from a machine learning perspective, and examines the close link between mixed models and ridge regression. Our contribution is two-fold. First, we propose estimating genomic heritability using a predictive approach via ridge regression and Generalized Cross Validation (GCV). We show that this is consistent with classical mixed model based estimation. Second, we derive simple formulae that express prediction accuracy as a function of the ratio np , where n is the population size and p the total number of SNPs. These formulae clearly show that a high heritability does not imply an accurate prediction when p > n. Both the estimation of heritability via GCV and the prediction accuracy formulae are validated using simulated data and real data from UK Biobank.

Suggestions

Du même auteur

Genome-wide haplotype association study in imaging genetics using whole-brain sulcal openings of 16,304 UK Biobank subjects

Archive ouverte | Karkar, Slim | CCSD

International audience. Neuroimaging-genetics cohorts gather two types of data: brain imaging and genetic data. They allow the discovery of associations between genetic variants and brain imaging features. They are ...

Multivariate haplotype analysis of 96 sulci opening for 15,612 UK-Biobank subjects

Archive ouverte | Karkar, S. | CCSD

International audience. Imaging genetic studies of large control cohorts such as UK Biobank enable to assess the range of normal variations in brain structures. Previous studies by our group have shown that the widt...

Identification of risk loci for primary aldosteronism in genome-wide association studies

Archive ouverte | Le Floch, Edith | CCSD

International audience. Abstract Primary aldosteronism affects up to 10% of hypertensive patients and is responsible for treatment resistance and increased cardiovascular risk. Here we perform a genome-wide associat...

Chargement des enrichissements...