Treffer: Deep learning and genomic best linear unbiased prediction integration: An approach to identify potential nonlinear genetic relationships between traits

Title:
Deep learning and genomic best linear unbiased prediction integration: An approach to identify potential nonlinear genetic relationships between traits
Contributors:
Génétique Animale et Biologie Intégrative (GABI), AgroParisTech-Université Paris-Saclay-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE), Eliance, Mathématiques et Informatique Appliquées (MIA Paris-Saclay), Génétique Quantitative et Evolution - Le Moulon (Génétique Végétale) (GQE-Le Moulon), AgroParisTech-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE)
Source:
Journal of Dairy Science. 108(6):6174-6189
Publisher Information:
CCSD, 2025.
Publication Year:
2025
Collection:
collection:AGROPARISTECH
collection:CNRS
collection:MIA-PARIS
collection:GQE
collection:UNIV-PARIS-SACLAY
collection:AGREENIUM
collection:INRAE
collection:UNIVERSITE-PARIS-SACLAY
collection:GENETIQUE_ANIMALE
collection:GS-MATHEMATIQUES
collection:GS-COMPUTER-SCIENCE
collection:GS-BIOSPHERA
collection:GS-LIFE-SCIENCES-HEALTH
collection:GABI
collection:MATHNUM
collection:RESEAU-EAU
collection:SAPS
collection:DIGIT-BIO
collection:BIOLOGIE_ET_AMELIORATION_DES_PLANTES
collection:TEST-MATHNUM
Original Identifier:
HAL: hal-05218863
Document Type:
Zeitschrift article<br />Journal articles
Language:
English
ISSN:
0022-0302
Relation:
info:eu-repo/semantics/altIdentifier/doi/10.3168/jds.2024-26057
DOI:
10.3168/jds.2024-26057
Rights:
info:eu-repo/semantics/OpenAccess
URL: http://creativecommons.org/licenses/by/
Accession Number:
edshal.hal.05218863v1
Database:
HAL

Weitere Informationen

Genomic prediction (GP) aims to predict the breeding values of multiple complex traits, usually assumed to be multivariate normally distributed by the largely used statistical methods, thus imposing linear genetic relationships between traits. Although these methods are valuable for GP, they do not account for potential nonlinear genetic relationships between traits in scenarios. For individual traits, this oversight may minimally affect prediction accuracy, but it can limit genetic progress when selection involves multiple traits. Deep learning (DL) offers a promising alternative for capturing nonlinear genetic relationships due to its ability to identify complex patterns without prior assumptions about the data structure. We proposed a novel hybrid model that that combines both DL and GBLUP (DLGBLUP), which uses the output of the traditional GBLUP, and enhances its predicted genetic values (PGV) by accounting for nonlinear genetic relationships between traits using DL. We simulated data with linear and nonlinear genetic relationships between traits in order to verify whether DLGBLUP was able to identify nonlinearity when present and avoid inducing it when absent. We found that DLGBLUP consistently provided more accurate PGV for traits simulated with strong nonlinear genetic relationships, accurately identifying these relationships. Over 7 generations of selection, a greater genetic progress was achieved with PGV that accounted for nonlinear relationships (DLGBLUP), compared with GBLUP. When applied to a real dataset from the French Holstein dairy cattle population, DLGBLUP detected nonlinear genetic relationships between pairs of traits, such as conception rate and protein content, and SCC and fat yield, although, no significant increase in prediction accuracy was observed. The integration of DL into GP enabled the modeling of nonlinear genetic relationships between traits, a possibility not previously discussed, given the linear nature of GBLUP. The detection of nonlinear genetic relationships between traits in the French Holstein population when using DLGBLUP indicates the presence of such relationships in real breeding data, suggesting that it may be relevant to further explore nonlinear relationships. This possibility of nonlinear genetic relationships between traits offers a different perspective into multitrait evaluations, with potential to further improve selection strategies in commercial livestock breeding programs. This is particularly relevant when integrating new traits into multitrait evaluations or incorporating new subpopulations, which may introduce different forms of nonlinearity. Finally, it is shown that DL can be used as a complement to the statistical methods deployed in routine genetic evaluations, rather than as an alternative, by enhancing their performance.