Genome-enabled prediction through machine learning... - BV FAPESP
Busca avançada
Ano de início
Entree
(Referência obtida automaticamente do Web of Science, por meio da informação sobre o financiamento pela FAPESP e o número do processo correspondente, incluída na publicação pelos autores.)

Genome-enabled prediction through machine learning methods considering different levels of trait complexity

Texto completo
Autor(es):
Barbosa, Ivan de Paiva [1] ; da Silva, Michele Jorge [1] ; da Costa, Weverton Gomes [1] ; de Castro Sant'Anna, Isabela [2] ; Nascimento, Moyses [3] ; Cruz, Cosme Damiao [1]
Número total de Autores: 6
Afiliação do(s) autor(es):
[1] Fed Univ Vicosa UFV, Dept Gen Biol, Bioinformat Lab, Vicosa, MG - Brazil
[2] Agron Inst IAC, Rubber Tree & Agroforestry Ctr, Votuporana, SP - Brazil
[3] Fed Univ Vicosa UFV, Dept Stat, Lab Computat Intelligence & Stat Learning, Vicosa, MG - Brazil
Número total de Afiliações: 3
Tipo de documento: Artigo Científico
Fonte: CROP SCIENCE; v. 61, n. 3, p. 1890-1902, MAY 2021.
Citações Web of Science: 1
Resumo

Genomic-wide selection (GWS) consists of the use of a large number of molecular markers for the prediction of genetic values and has been shown to be highly relevant for genetic improvement. The objective of this work was to evaluate and compare the predictive performance of statistical (ridge regression-best linear unbiased predictor {[}RR-BLUP] and BayesB) and machine learning methods through GWS in simulated populations with traits presenting different levels of heritability and quantitative trait loci (QTL) numbers in the presence of dominant and epistatic effects. The simulated genome of population F-2 was formed by 1,000 individuals and genotyped with 2,010 single nucleotide polymorphism (SNP) markers. Twenty-six traits were simulated considering QTL numbers ranging from two to 88 and heritabilities of .3 and .6. The selective and predictive performances were evaluated using the multilayer perceptron (MLP), radial basis function (RBF), decision trees (DT), bagging (BA), random forest (RF), and boosting (BO) machine learning models and the classical RR-BLUP and BayesB methods. A high effect of heritability was observed for the results of selective accuracy when compared to the increased QTL number. In addition, the selective accuracy based on the number of QTL demonstrates that the application of alternative machine learning models, such as RBF, BA, BO, and RF, can be suitable for the analysis according to QTL number. Machine learning methods are powerful tools for predicting genetic values with epistatic gene control in traits with different degrees of heritability and different numbers of controlling genes. (AU)

Processo FAPESP: 18/26408-0 - Diversidade genética e caracterização de descritores de seringueira
Beneficiário:Isabela de Castro Sant'Anna
Modalidade de apoio: Bolsas no Brasil - Pós-Doutorado