Machine learning applied to transcriptomic data to identify genes associated with feed efficiency in pigs

Archive ouverte

Piles, Miriam | Fernandez-Lozano, Carlos | Velasco-Galilea, María | González-Rodríguez, Olga | Sánchez, Juan Pablo | Torrallardona, David | Ballester, Maria | Quintanilla, Raquel

Edité par CCSD ; BioMed Central -

International audience. AbstractBackgroundTo date, the molecular mechanisms that underlie residual feed intake (RFI) in pigs are unknown. Results from different genome-wide association studies and gene expression analyses are not always consistent. The aim of this research was to use machine learning to identify genes associated with feed efficiency (FE) using transcriptomic (RNA-Seq) data from pigs that are phenotypically extreme for RFI.MethodsRFI was computed by considering within-sex regression on mean metabolic body weight, average daily gain, and average backfat gain. RNA-Seq analyses were performed on liver and duodenum tissue from 32 high and 33 low RFI pigs collected at 153 d of age. Machine-learning algorithms were used to predict RFI class based on gene expression levels in liver and duodenum after adjusting for batch effects. Genes were ranked according to their contribution to the classification using the permutation accuracy importance score in an unbiased random forest (RF) algorithm based on conditional inference. Support vector machine, RF, elastic net (ENET) and nearest shrunken centroid algorithms were tested using different subsets of the top rank genes. Nested resampling for hyperparameter tuning was implemented with tenfold cross-validation in the outer and inner loops.ResultsThe best classification was obtained with ENET using the expression of 200 genes in liver [area under the receiver operating characteristic curve (AUROC): 0.85; accuracy: 0.78] and 100 genes in duodenum (AUROC: 0.76; accuracy: 0.69). Canonical pathways and candidate genes that were previously reported as associated with FE in several species were identified. The most remarkable pathways and genes identified were NRF2-mediated oxidative stress response and aldosterone signalling in epithelial cells, the DNAJC6, DNAJC1, MAPK8, PRKD3 genes in duodenum, and melatonin degradation II, PPARα/RXRα activation, and GPCR-mediated nutrient sensing in enteroendocrine cells and SMOX, IL4I1, PRKAR2B, CLOCK and CCK genes in liver.ConclusionsML algorithms and RNA-Seq expression data were found to provide good performance for classifying pigs into high or low RFI groups. Classification was better with gene expression data from liver than from duodenum. Genes associated with FE in liver and duodenum tissue that can be used as predictive biomarkers for this trait were identified.

Suggestions

Du même auteur

Disentangling the causal relationship between rabbit growth and cecal microbiota through structural equation models

Archive ouverte | Mora, Mónica | CCSD

International audience. AbstractBackgroundThe effect of the cecal microbiome on growth of rabbits that were fed under different regimes has been studied previously. However, the term “effect” carries a causal meanin...

Use of Bayes factors to evaluate the effects of host genetics, litter and cage on the rabbit cecal microbiota

Archive ouverte | Velasco-Galilea, María | CCSD

International audience. AbstractBackgroundThe rabbit cecum hosts and interacts with a complex microbial ecosystem that contributes to the variation of traits of economic interest. Although the influence of host gene...

Identification of transcriptional regulatory variants in pig duodenum, liver, and muscle tissues

Archive ouverte | Crespo-Piazuelo, Daniel | CCSD

International audience. Background In humans and livestock species, genome-wide association studies (GWAS) have been applied to study the association between variants distributed across the genome and a phenotype of...

Chargement des enrichissements...