Busca avançada
Ano de início
Entree
(Referência obtida automaticamente do Web of Science, por meio da informação sobre o financiamento pela FAPESP e o número do processo correspondente, incluída na publicação pelos autores.)

An empirical test of the relationship between the bootstrap and likelihood ratio support in maximum likelihood phylogenetic analysis

Texto completo
Autor(es):
Machado, Denis Jacob [1, 2] ; de Luna Marques, Fernando Portella [3] ; Jimenez-Ferbans, Larry [4] ; Grant, Taran [3]
Número total de Autores: 4
Afiliação do(s) autor(es):
[1] Univ Sao Paulo, Programa Interunidades Posgrad Bioinformat, Rua Matao 1010, BR-05508090 Sao Paulo, SP - Brazil
[2] Univ N Carolina, Coll Comp & Informat, Dept Bioinformat & Genom, 9331 Robert D Snyder Rd, Charlotte, NC 28223 - USA
[3] Univ Sao Paulo, Inst Biociencias, Dept Zool, Tv 14, 101 Butanta, BR-05508090 Sao Paulo, SP - Brazil
[4] Univ Magdalena, Fac Ciencias Basicas, Santa Marta DTCH, Carrera 32 22-08, Magdalena 470004 - Colombia
Número total de Afiliações: 4
Tipo de documento: Artigo Científico
Fonte: CLADISTICS; DEC 2021.
Citações Web of Science: 0
Resumo

In maximum likelihood (ML), the support for a clade can be calculated directly as the likelihood ratio (LR) or log-likelihood difference (S, LLD) of the best trees with and without the clade of interest. However, bootstrap (BS) clade frequencies are more pervasive in ML phylogenetics and are almost universally interpreted as measuring support. In addition to theoretical arguments against that interpretation, BS has several undesirable attributes for a support measure. For example, it does not vary in proportion to optimality or identify clades that are rejected by the evidence and can be overestimated due to missing data. Nevertheless, if BS is a reliable predictor of S, then it might be an efficient indirect method of measuring support-an attractive possibility, given the speed of many BS implementations. To assess the relationship between S and BS, we analyzed 106 empirical datasets retrieved from TreeBASE. Also, to evaluate the degree to which S and BS are affected by the number of replicates during suboptimal tree searches for S and pseudoreplicates during BS estimation, we randomly selected 5 of the 106 datasets and analyzed them using variable numbers of replicates and pseudoreplicates, respectively. The correlation between S and BS was extremely weak in the datasets we analyzed. Increasing the number of replicates during tree search decreased the estimated values of S for most clades, but the magnitude of change was small. In contrast, although increasing pseudoreplicates affected BS values for only approximately 40% of clades, values both increased and decreased, and they did so at much greater magnitudes. Increasing replicates/pseudoreplicates affected the rank order of clades in each tree for both S and BS. Our findings show decisively that BS is not an efficient indirect method of measuring support and suggest that even quite superficial searches to calculate S provide better estimates of support. (AU)

Processo FAPESP: 18/15425-0 - Uma abordagem multidisciplinar para o estudo da diversificação dos anfíbios: fase 2
Beneficiário:Taran Grant
Modalidade de apoio: Auxílio à Pesquisa - Jovens Pesquisadores - Fase 2
Processo FAPESP: 15/18654-2 - Sequenciamento genômico do sapo pé-de-pá-oriental, Scaphiopus holbrookii (Amphibia: Anura: Scaphiopodidade) e do sapo flamenguinho, Melanophryniscus moreirae (Amphibia: Anura: Bufonidae)
Beneficiário:Denis Jacob Machado
Modalidade de apoio: Bolsas no Exterior - Estágio de Pesquisa - Doutorado
Processo FAPESP: 13/05958-8 - Genômica Comparada e a Evolução da Defesa Química em Anfíbios
Beneficiário:Denis Jacob Machado
Modalidade de apoio: Bolsas no Brasil - Doutorado
Processo FAPESP: 12/10000-5 - Uma abordagem multidisciplinar para o estudo da diversificação de anfíbios
Beneficiário:Taran Grant
Modalidade de apoio: Auxílio à Pesquisa - Jovens Pesquisadores