Advanced search
Start date
Betweenand
(Reference retrieved automatically from Web of Science through information on FAPESP grant and its corresponding number as mentioned in the publication by the authors.)

An empirical test of the relationship between the bootstrap and likelihood ratio support in maximum likelihood phylogenetic analysis

Full text
Author(s):
Machado, Denis Jacob [1, 2] ; de Luna Marques, Fernando Portella [3] ; Jimenez-Ferbans, Larry [4] ; Grant, Taran [3]
Total Authors: 4
Affiliation:
[1] Univ Sao Paulo, Programa Interunidades Posgrad Bioinformat, Rua Matao 1010, BR-05508090 Sao Paulo, SP - Brazil
[2] Univ N Carolina, Coll Comp & Informat, Dept Bioinformat & Genom, 9331 Robert D Snyder Rd, Charlotte, NC 28223 - USA
[3] Univ Sao Paulo, Inst Biociencias, Dept Zool, Tv 14, 101 Butanta, BR-05508090 Sao Paulo, SP - Brazil
[4] Univ Magdalena, Fac Ciencias Basicas, Santa Marta DTCH, Carrera 32 22-08, Magdalena 470004 - Colombia
Total Affiliations: 4
Document type: Journal article
Source: CLADISTICS; DEC 2021.
Web of Science Citations: 0
Abstract

In maximum likelihood (ML), the support for a clade can be calculated directly as the likelihood ratio (LR) or log-likelihood difference (S, LLD) of the best trees with and without the clade of interest. However, bootstrap (BS) clade frequencies are more pervasive in ML phylogenetics and are almost universally interpreted as measuring support. In addition to theoretical arguments against that interpretation, BS has several undesirable attributes for a support measure. For example, it does not vary in proportion to optimality or identify clades that are rejected by the evidence and can be overestimated due to missing data. Nevertheless, if BS is a reliable predictor of S, then it might be an efficient indirect method of measuring support-an attractive possibility, given the speed of many BS implementations. To assess the relationship between S and BS, we analyzed 106 empirical datasets retrieved from TreeBASE. Also, to evaluate the degree to which S and BS are affected by the number of replicates during suboptimal tree searches for S and pseudoreplicates during BS estimation, we randomly selected 5 of the 106 datasets and analyzed them using variable numbers of replicates and pseudoreplicates, respectively. The correlation between S and BS was extremely weak in the datasets we analyzed. Increasing the number of replicates during tree search decreased the estimated values of S for most clades, but the magnitude of change was small. In contrast, although increasing pseudoreplicates affected BS values for only approximately 40% of clades, values both increased and decreased, and they did so at much greater magnitudes. Increasing replicates/pseudoreplicates affected the rank order of clades in each tree for both S and BS. Our findings show decisively that BS is not an efficient indirect method of measuring support and suggest that even quite superficial searches to calculate S provide better estimates of support. (AU)

FAPESP's process: 18/15425-0 - A multi-disciplinary approach to the study of amphibian diversification: phase 2
Grantee:Taran Grant
Support Opportunities: Research Grants - Young Investigators Grants - Phase 2
FAPESP's process: 15/18654-2 - Whole-genome sequence of the eastern spadefoot toad, Scaphiopus holbrookii (Amphibia: Anura: Scaphiopodidade) and of the Maldonado redbelly toad, Melanophryniscus moreirae (Amphibia: Anura: Bufonidae)
Grantee:Denis Jacob Machado
Support Opportunities: Scholarships abroad - Research Internship - Doctorate
FAPESP's process: 13/05958-8 - Comparative Genomics and the Evolution of Amphibian Chemical Defense
Grantee:Denis Jacob Machado
Support Opportunities: Scholarships in Brazil - Doctorate
FAPESP's process: 12/10000-5 - A multi-disciplinary approach to the study of amphibian diversification
Grantee:Taran Grant
Support Opportunities: Research Grants - Young Investigators Grants