Reliable Confidence Intervals for RelTime Estimates of Evolutionary Divergence Times (original) (raw)

Journal Article

,

Institute for Genomics and Evolutionary Medicine, Temple University

, Philadelphia, PA

Department of Biology, Temple University

, Philadelphia, PA

Search for other works by this author on:

,

Department of Biological Sciences, Tokyo Metropolitan University

, Hachioji, Tokyo,

Japan

Research Center for Genomics and Bioinformatics, Tokyo Metropolitan University

, Hachioji, Tokyo,

Japan

Search for other works by this author on:

,

Department of Genetics, Federal University of Rio de Janeiro

, Rio de Janeiro,

Brazil

Search for other works by this author on:

Institute for Genomics and Evolutionary Medicine, Temple University

, Philadelphia, PA

Department of Biology, Temple University

, Philadelphia, PA

Center for Excellence in Genome Medicine and Research, King Abdulaziz University

, Jeddah, Saudi Arabia

Search for other works by this author on:

Qiqing Tao and Koichiro Tamura contributed equally to this work.

Author Notes

Published:

22 October 2019

Cite

Qiqing Tao, Koichiro Tamura, Beatriz Mello, Sudhir Kumar, Reliable Confidence Intervals for RelTime Estimates of Evolutionary Divergence Times, Molecular Biology and Evolution, Volume 37, Issue 1, January 2020, Pages 280–290, https://doi.org/10.1093/molbev/msz236
Close

Navbar Search Filter Mobile Enter search term Search

Abstract

Confidence intervals (CIs) depict the statistical uncertainty surrounding evolutionary divergence time estimates. They capture variance contributed by the finite number of sequences and sites used in the alignment, deviations of evolutionary rates from a strict molecular clock in a phylogeny, and uncertainty associated with clock calibrations. Reliable tests of biological hypotheses demand reliable CIs. However, current non-Bayesian methods may produce unreliable CIs because they do not incorporate rate variation among lineages and interactions among clock calibrations properly. Here, we present a new analytical method to calculate CIs of divergence times estimated using the RelTime method, along with an approach to utilize multiple calibration uncertainty densities in dating analyses. Empirical data analyses showed that the new methods produce CIs that overlap with Bayesian highest posterior density intervals. In the analysis of computer-simulated data, we found that RelTime CIs show excellent average coverage probabilities, that is, the actual time is contained within the CIs with a 94% probability. These developments will encourage broader use of computationally efficient RelTime approaches in molecular dating analyses and biological hypothesis testing.

Introduction

Reliable inference of the confidence intervals (CIs) around the estimates of divergence times is essential for testing biological hypotheses (Burbrink and Pyron 2008; Kumar and Hedges 2016). Multiple sources contribute to the uncertainty of molecular divergence time estimates (Rannala and Yang 2007; Zhu et al. 2015; Kumar and Hedges 2016). One of them is the error associated with branch length estimation in a phylogeny due to the limited number of sites and substitutions in the sequence alignment (Kumar and Hedges 2016; Warnock et al. 2017). The stochastic nature of the substitution process (e.g., Poisson process) and the uncertainty involved in accounting for the unobserved substitutions (multiple-hit correction) result in errors in branch length estimates, which lead to imprecise time estimates (Kumar and Hedges 2016). However, these errors decrease with an increase in the number of sampled sites (Rannala and Yang 2007; dos Reis and Yang 2013; Zhu et al. 2015) and become negligible for large phylogenomic data sets.

The second source of error is the variation of evolutionary rates among branches and lineages (Zhu et al. 2015; Kumar and Hedges 2016). Because rates and times are confounded, the variation of rates will naturally result in uncertainty of time estimates (Ho 2014; Zhu et al. 2015). This confounding effect cannot be eliminated by sampling more sites or genes in a data set (Zhu et al. 2015; Kumar and Hedges 2016), so it contributes more uncertainty to time estimates than errors in branch length estimation for a large data set. The uncertainty associated with clock calibrations due to the equivocal nature of fossil record presents a third source of error in divergence time estimation (Zhu et al. 2015; dos Reis et al. 2016; Warnock et al. 2017). The exact placement of fossil record in a phylogeny and the correct assignment of calibration constraints, especially the maximum constraint, are often difficult to justify, resulting in high uncertainty in the estimation of divergence time (Bromham et al. 2018).

In Bayesian analyses, the highest posterior density (HPD) intervals usually represent the uncertainty of inferred divergence times (Drummond et al. 2006). Bayesian methods compute HPD intervals directly from the density distribution of posterior times estimated using priors for branch rate heterogeneity, substitution process, and fossil calibrations (dos Reis et al. 2016; Bromham et al. 2018), so sources contributing to the uncertainties of time estimates are automatically incorporated in the HPD intervals. Currently, Bayesian HPD intervals are considered reliable estimates of uncertainties surrounding divergence time estimates, although they are not always the same as the 95% CIs in the frequentist statistics (Jaynes and Kempthorne 1976; MacKenzie et al. 2017). Unfortunately, the enormous computational burden imposed by Bayesian approaches has hindered their applications to analyze many phylogenomic data sets (e.g., Pyron 2014; Li et al. 2019).

In contrast, non-Bayesian methods can analyze large-scale data sets quickly and generate accurate time estimates (Smith and O’Meara 2012; Tamura et al. 2012, 2018). Unfortunately, the broad utility of these methods is still reduced by a lack of reliable calculation of the uncertainty surrounding divergence times, which are represented by CIs. Non-Bayesian approaches require the use of analytical formulations or bootstrap approaches to estimate CIs (Sanderson 2003; Xia and Yang 2011; Tamura et al. 2013). However, site-resampling bootstrap approaches do not capture the error caused by rate heterogeneity, leading to false precisions of time estimates. Recognizing the need for incorporating lineage rate variation into CIs, Tamura et al. (2013) formulated analytical equations for the RelTime method, a non-Bayesian approach that relaxes the molecular clock. However, this approach may overestimate the amount of variance and produce overly wide CIs (see below), resulting in low power for statistical testing (Kumar and Hedges 2016).

Bayesian and non-Bayesian methods also use different strategies to account for the uncertainty of fossil record. Non-Bayesian methods are currently limited to the use of minimum boundaries only, maximum boundaries only, or minimum and maximum boundary pairs as calibration constraints (Sanderson 2003; Tamura et al. 2013), whereas Bayesian methods allow the usage of probability densities as calibrations and automatically accommodate interactions among them (Inoue et al. 2010; Ho and Duchêne 2014). Mello et al. (2017) presented a simple procedure to derive minimum and maximum boundaries from the density distributions, but this strategy does not consider interactions among calibrations and may lead to overestimation of the variance of divergence times (see below).

Here, we present an analytical approach to estimate CIs for divergence times using the RelTime method. The new analytical approach accounts for the variance associated with the branch lengths estimation as well as the variance due to rate heterogeneity in CI calculation. We also present a simple approach to derive minimum and maximum boundaries from multiple calibration densities such that the calibration interactions are accommodated. Both approaches have been implemented in the MEGA X software for use in graphical and command-line interfaces (Kumar et al. 2012, 2018). The 95% CIs produced by RelTime in empirical analyses are compared with the 95% HPD intervals produced by Bayesian methods to examine the performance of the new approaches. The approaches presented here may be used, with modifications, to improve variance calculation of time estimates for other non-Bayesian methods, for example, penalized likelihood methods (Sanderson 2002).

New Methods

An Analytical Method to Estimate CIs

Considering a tree with three ingroup sequences (fig. 1), relative time (t) for each node and relative rate (r) for each lineage are functions of branch lengths (b) in RelTime, for example, _r_1, _r_2, _r_3, _r_4, _t_4, and _t_5 are given by the following equations when the geometric means are used (similar equations can be derived when the arithmetic mean is used) (Tamura et al. 2018):

An evolutionary tree of three tips showing branch lengths (bj’s), node times (ti’s), branch rates (rj’s), and a lineage length (l4).

Fig. 1.

An evolutionary tree of three tips showing branch lengths (_bj_’s), node times (_ti_’s), branch rates (_rj_’s), and a lineage length (_l_4).

The variance of the estimated time (ti) for node i, denoted by vti⁠, can be estimated by the delta method, assuming that there is no covariance among branch lengths (_bj_’s):

vti=∑jN∂ftib1,…,bN∂bj2vbj,

(7)

where N is the total number of branches, fti(b1,…,bN) stands for the analytical function of _bj_’s to compute ti (e.g., eqs. 5 and 6 for _t_4 and _t_5, respectively), and vbj stands for the variance of branch length for branch j. Therefore, vbj is required for computing vti⁠.

As mentioned before, the uncertainty of time is related to the number of sampling sites and the degree of rate heterogeneity. We consider the total variance of branch lengths, vbj⁠, which is required to compute vti⁠, as a summation of the variance due to site sampling, vSbj⁠, and the variance due to rate heterogeneity, vRbj⁠:

The value of vS(bj) can be estimated by using analytical formulations or a site-resampling approach. For example, an approximate estimate of this variance can be obtained by the curvature method when the maximum likelihood method is used (Edwards 1992; Tamura et al. 2013).

However, it is more complex to estimate vR(bj)⁠, so we do it indirectly. We first compute the variance of observed evolutionary rates for all the lineages, Vobs(R)⁠:

where R is a random variable representing all relative rates, rj is the relative rate for each branch j, and r- is the average of _rj_’s. It is important to note that the relative rate for branch j is estimated as the relative rate for lineage j (Tamura et al. 2018). For example, RelTime computes the relative rate for lineage _l_4 as the geometric mean of _r_1 and _r_2, which is assigned to be the rate for _b_4 in figure 1.

The variance of observed rates includes not only the variance introduced by rate heterogeneity, RVR⁠, but also the sampling variance associated with the branch length estimation, SV(R)⁠, because the observed relative rate rj is calculated from branch lengths (_bj_’s) (e.g., eqs. 1–4). So,

The value of SVR is obtained by summing the sampling variance of relative rate rj for each branch j, denoted by sv(rj)⁠:

sv(rj) can be estimated by the delta method, assuming that there is no covariance among _bj_’s:

svrj=∑jN∂frjb1,…,bN∂bj2vSbj,

(12)

where frj(b1,…,bN) stands for the analytical function of _bj_’s to compute rj (e.g., eqs. 1, 2, 3, and 4 for _r_1, _r_2, _r_3, and _r_4, respectively).

Using [equations (9)–(12)](#E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11 E12), we compute the variance introduced by rate heterogeneity:

RV(R)=1N∑jNrj - r-2- ∑jN∑jN∂frjb1,…,bN∂bj2vSbj.

(13)

Then, we can compute the rate heterogeneity variance for each branch j as being proportional to its branch length:

Using equations (8), (13), and (14), we can compute the total variance of branch length for branch j, denoted by vbj⁠. Then vbj can be used to compute the variance of time, vti⁠, by applying equation (7). For example, the variance of _t_4 and _t_5 are given by the following equations:

vt4=b2b3b1b2+2b4216b1b1b2+b43vb1+b1b3b1b2+2b4216b2b1b2+b43 vb2+ b1b24b3b1b2+b4vb3+b1b2b34b1b2+b43vb4,

(15)

vt5=b2b316b1b1b2+b4vb1+b1b316b2b1b2+b4vb2+b1b2+b44b3vb3+ b34b1b2+b4vb4.

(16)

For larger numbers of taxa, such analytical formulations become complicated to derive, especially for deeper nodes. Thus, we compute the variance of divergence times for deeper nodes from tips to the root recursively. For example, using equations (15) and (16), we can derive

vt5=b1b2+b42b1b2+2b42vt4+b4b3vb3+b3b4b1b2+b42vb4.

(17)

Therefore, the calculation of vt5 requires only vt4⁠, vb3⁠, and vb4⁠, which are the variances for _t_4 and branches _b_3 and _b_4, respectively. The variance of branches that do not directly connect to node 5, that is, vb1 and vb2 in this case (fig. 1) is not needed, if the value of vt4 is computed beforehand. Thus, for any node in a phylogeny, we can calculate the variance of divergence time recursively from tips to the root by using the variance of times for direct descendant and ancestral nodes and the variance of directly connected branches. This procedure extraordinarily simplifies the computation of the variance of inferred time for each internal node in a tree with a large number of taxa.

It is important to note that times in the equations listed above are relative times, not absolute times, because no calibrations are involved in the above equations. When one or multiple calibrations (minimum boundaries only, maximum boundaries only, or minimum and maximum boundary pairs) are given, RelTime will compute a global time factor (f) by altering relative times such that all calibration constraints are satisfied. When a range of f values can satisfy all calibration constraints, RelTime selects the midpoint of the range to be the best estimate of f. When one or more of the absolute times computed using the f value falls outside the calibration constraints, RelTime adjusts relative times and f such that the deviations of absolute times from the calibration constraints are minimized. This process requires local alteration of relative rates and reoptimization of all other node times in the tree recursively (Tamura et al. 2013). For example, if the minimum age constraint of a node is violated, that is, the age estimated using f is younger than the minimum constraint, RelTime decreases its estimate of the evolutionary rate proportionally in that lineage to adjust the age of this node to be older, such that the divergence time becomes the same as the minimum age constraint. The resulting slowdown is transmitted to all the descendant nodes, and it affects the ancestral rates as well.

Similarly, if the maximum age constraint of a node is violated, that is, the age estimated using f is older than the maximum constraint, RelTime increases the estimated evolutionary rate proportionally in that lineage such that the divergence time matches the maximum age constraint. The effects of this rate change will be transmitted to the descendant and ancestral nodes automatically. Consequently, RelTime will ensure that the absolute times for calibrated nodes are consistent with the user-desired calibration constraints.

In the final step, CIs are computed analytically using the final set of relative rates and the equations given above, such that the uncertainty associated with clock calibrations can be incorporated into the CI calculation in RelTime. If the lower or upper bounds of CIs fall outside the user-specified calibration constraints, then CIs are truncated based on the imposed calibration constraints. Therefore, RelTime uses “hard” minimum and maximum bounds in CI calculation, as in BEAST (Drummond et al. 2012; Bouckaert et al. 2014; Barba-Montoya et al. 2017).

An Approach to Derive Effective Calibration Boundaries from Calibration Densities

As stated above, calibration uncertainty is another critical source of estimation error in the inference of divergence times. Bayesian methods use various probability densities to accommodate the calibration uncertainty. However, the current non-Bayesian methods do not allow direct use of probability densities and do not provide provisions to incorporate interactions among calibration constraints. Therefore, we developed a new procedure for use in the RelTime method to derive calibration boundaries from probability densities that accounts for their interactions.

For each calibrated node with an associated probability density, we randomly sample two dates from the given probability density. We use these two sampled dates as the minimum and maximum (min–max) constraints for that node and derive such a min–max constraint for every node for which a probability density is specified. Then, we use all of these min–max boundaries to conduct RelTime analysis. We retain the RelTime time estimates only for the calibrated nodes, and then repeat the process of random sampling and dating for 10,000 times. A large number of iterations of this process ensure that calibration dates with tiny probabilities (0.01%) can be sampled.

The iterative procedure above produces a distribution of 10,000 inferred dates for each calibrated node. In the final step, we derive the minimum bound at 2.5% and the maximum bound at 97.5% of the distribution of inferred dates for each calibrated node. We refer to bounds derived during this process to be “effective bounds.” These effective bounds can be used together with the analytical approach described above to infer the divergence times and CIs in RelTime. It is important to note that effective bounds are used as calibration constraints, not densities. The actual shapes of the distribution of 10,000 inferred dates may vary slightly if one is to repeat 10,000 resamplings multiple times, but 2.5% and 97.5% boundaries of the distribution are expected to be stable, producing stable estimates of divergence times and CIs.

Our procedure is analogous to that in Bayesian methods, as both types of methods require resampling of calibration constraints from user-specified densities, inference of divergence times using each set of sampled calibrations, and summarization of distributions of time estimates obtained from using all sets of sampled calibrations. Therefore, the use of effective bounds allows RelTime to accommodate the interactions among calibration densities. However, it does not mean that RelTime and Bayesian methods are the same. Bayesian methods conduct calibration resampling and time inference steps simultaneously during the MCMC integration, whereas these steps are implemented sequentially in the RelTime method as proposed here.

We compared the effective bounds to calibration bounds derived using Mello et al. (2017)’s procedure (referenced as “Mello bounds” in the following) (fig. 2), in which the minimum bound was directly placed at 2.5% of the density age, and the maximum bound was placed at 97.5% of the density age. Effective bounds were similar to the Mello bounds when the user-specified calibration density was reliable and informative, which meant that the true age of a node fell in the calibration density with a high probability. For example, effective bounds and Mello bounds almost overlapped for Homo sapiens–Callithrix jacchus split in which an exponential distribution was used as the calibration (fig. 2b) (see the Materials and Methods section). When the user-specified density was uninformative, for example, a diffused uniform distribution, Mello bounds were often diffused and matched the original density (fig. 2c). In contrast, our new procedure generated narrower bounds due to the accommodation of the interactions among different calibration densities and constraints. These interactions reshaped the original, wider distribution and made it tighter (fig. 2c). Consequently, the use of effective bounds is likely to produce narrower CIs.

(a) A primate phylogeny with a user-specified uniform calibration density (gray shade) and an exponential calibration density (green shade). Red dots are the nodes shown in panels b–e. Effective bounds derived using our method (solid blue line) and bounds derived using Mello et al. (2017) procedure (solid orange line) are compared when user-specified calibrations are reliable (b and c) and when user-specified calibration of Homo sapiens–Callithrix jacchus split is unreliable (d and e). The dashed red line represents the “true simulated age.”

Fig. 2.

(a) A primate phylogeny with a user-specified uniform calibration density (gray shade) and an exponential calibration density (green shade). Red dots are the nodes shown in panels b_–_e. Effective bounds derived using our method (solid blue line) and bounds derived using Mello et al. (2017) procedure (solid orange line) are compared when user-specified calibrations are reliable (b and c) and when user-specified calibration of Homo sapiens–Callithrix jacchus split is unreliable (d and e). The dashed red line represents the “true simulated age.”

In our analysis, when the user-specified calibration was unreliable, that is, the true age of the node fell in its calibration density with a low probability, our effective bounds turned out to be better than Mello bounds. For example, when the true time of H. sapiens–C. jacchus split was located in the user-specified exponential density with <2.5% probability (fig. 2d), Mello bounds did not include the true time, resulting in incorrect time estimates. In contrast, our method did not ignore the low probability regions since it sampled 10,000 times from the user-specified density to ensure that dates with very low probabilities were considered. Thus, effective bounds are likely to contain the true time (fig. 2d), and the use of effective bounds in RelTime may improve the accuracy and precision of time estimates.

Results and Discussions

RelTime Produces CIs Comparable to Bayesian HPD Intervals in Empirical Analyses

We applied our methods to five empirical data sets containing nucleotide or protein sequence alignments from primates, spiders, insects, birds, and sun orchids (table 1). We first present results from the primate data set from Barba-Montoya et al. (2017), which contains a relatively small alignment of 9,361 base pairs from 9 primate species and 1 outgroup (fig. 2a and supplementary fig. S1a, Supplementary Material online). In this phylogeny, every internal node has been assigned a calibration density. Barba-Montoya et al. (2017) used two calibration strategies in MCMCTree (Yang 2007) and BEAST (Bouckaert et al. 2014) and compared the time estimates. We examined if the RelTime method produced estimates comparable to those obtained from Bayesian methods when all analyses employed the same alignment, phylogeny, substitution model, and calibration uncertainty densities (e.g., uniform distributions).

Table 1.

Empirical Data Sets Analyzed in This Article.

a

N, nucleotides; A, amino acids.

b

Sequence count excludes the outgroup taxa.

c

A Cauchy density distribution and exponential density distribution are used as the skewed density in MCMCTree and BEAST, respectively.

d

Software used in the original study for estimating divergence times. Both BEAST and BEAST 2 are referred to as BEAST here.

Table 1.

Empirical Data Sets Analyzed in This Article.

a

N, nucleotides; A, amino acids.

b

Sequence count excludes the outgroup taxa.

c

A Cauchy density distribution and exponential density distribution are used as the skewed density in MCMCTree and BEAST, respectively.

d

Software used in the original study for estimating divergence times. Both BEAST and BEAST 2 are referred to as BEAST here.

For analyses of primate data sets where uniform densities were used as calibrations, we observed a high concordance between RelTime and Bayesian time estimates. The linear regression slopes were 0.97 and 1.03 when Bayesian analyses were conducted in MCMCTree and BEAST, respectively (fig. 3a and b). This is a rather small difference. Although the width of RelTime CIs was slightly smaller than the width of Bayesian HPD intervals, RelTime CIs overlapped Bayesian HPD intervals for all the nodes (fig. 4a and b). For primate data sets where a mixture of uniform and skewed densities were used as calibrations, RelTime estimates were again similar to Bayesian estimates, with a linear regression slope of 0.96 with MCMCTree (fig. 3c) and 1.00 with BEAST estimates (fig. 3d). RelTime CIs overlapped with MCMCTree and BEAST HPD intervals for all the nodes (fig. 4c and d).

Comparisons of RelTime and Bayesian estimates of divergence times and the associated uncertainties for sequences from primates with uniform calibration densities (MCMCTree in panel a and BEAST in panel b), primates with uniform and skewed calibration densities (MCMCTree in panel c and BEAST in panel d), spiders (panel e), insects (panel f), birds (panel g) and sun orchids (panel h). Gray bars represent the Bayesian 95% HPD intervals (x-axis) and RelTime 95% CIs (y-axis). The black dashed line represents a 1:1 ratio. Each graph contains the slope and coefficient of determination (R2) values of the linear regression through the origin. Calibrated nodes are shown in green. The data set name inside each panel refers to table 1.

Fig. 3.

Comparisons of RelTime and Bayesian estimates of divergence times and the associated uncertainties for sequences from primates with uniform calibration densities (MCMCTree in panel a and BEAST in panel b), primates with uniform and skewed calibration densities (MCMCTree in panel c and BEAST in panel d), spiders (panel e), insects (panel f), birds (panel g) and sun orchids (panel h). Gray bars represent the Bayesian 95% HPD intervals (_x_-axis) and RelTime 95% CIs (_y_-axis). The black dashed line represents a 1:1 ratio. Each graph contains the slope and coefficient of determination (_R_2) values of the linear regression through the origin. Calibrated nodes are shown in green. The data set name inside each panel refers to table 1.

Comparisons of RelTime 95% CIs (dark red), MCMCTree 95% HPD intervals (gray), and BEAST 95% HPD intervals (blue). Dots are point estimates of divergence times. The data set name inside each panel refers to table 1 and panels a-h are described in figure 3.

Fig. 4.

Comparisons of RelTime 95% CIs (dark red), MCMCTree 95% HPD intervals (gray), and BEAST 95% HPD intervals (blue). Dots are point estimates of divergence times. The data set name inside each panel refers to table 1 and panels a-h are described in figure 3.

We then analyzed spider and insect data sets to examine the performance of our methods for larger data sets (>40 species and >50,000 sites). These data sets consisted of protein sequences and presented more extensive rate variation among branches and lineages as compared with the primate data set (supplementary fig. S1b and c, Supplementary Material online). Fewer calibrations were used in these data sets with 8 calibrated nodes in the spider data set and 38 calibrated nodes in the insect data set. This means that 20–26% nodes of the phylogeny were assigned calibration values. We again observed strong concordance between RelTime and Bayesian time estimates, with a linear slope of 0.98 and 0.98 for the spider and insect data set, respectively (fig. 3e and f). The high similarity between RelTime and Bayesian node times remained even after we excluded nodes on which user-specified calibrations were assigned (slope was 0.97 and 0.98 for the spider and insect data set, respectively). Although CIs produced by RelTime were slightly wider than HPD intervals produced by the Bayesian method, they were comparable with more than 97% of the nodes in spider and insect data sets showing overlapping CIs and HPD intervals (fig. 4e and f). When CIs and HPD intervals did not overlap, they were <5 My apart. Therefore, RelTime CIs can be considered similar to Bayesian HPD intervals for these two data sets.

Although it is becoming more common to apply many internal calibrations to empirical studies, researchers may only have a limited number of calibrations because reliable fossil record for most taxonomic groups is limited. So, we analyzed another two nucleotide sequence alignments in which only a few calibrations have been used. One of them is a bird phylogeny (supplementary fig. S1d, Supplementary Material online) containing 220 ingroup taxa with only 13 calibrations (i.e., 6% nodes are calibrated). Again, RelTime produced time estimates similar to Bayesian estimates, showing a linear slope of 1.03 with a slightly weaker linear relationship than seen for other data sets above (fig. 3g). CIs produced by RelTime overlapped with HPD intervals provided by the Bayesian method for all the nodes except for two (fig. 4g). For these two nodes, CIs and HPD intervals were <2 My apart. In the analysis of the sun orchid data set (supplementary fig. S1e, Supplementary Material online), in which 57 sequences were included, and only a single internal calibration was used, time estimates obtained from RelTime and Bayesian methods showed a good linear relationship as well (fig. 3h). Although RelTime generated slightly older time estimates than those from the Bayesian method, CIs and HPD intervals overlapped for all the nodes (fig. 4h).

These results suggest that the application of our analytical method for computing CI combined with the approach to derive effective bounds is likely to produce estimates of times and CIs compatible with Bayesian estimates for data sets with a small and large number of calibrations when the same alignment, phylogeny, and calibration densities are used. We found that, compared with the previous implementations in RelTime for estimating CIs (Tamura et al. 2013; Mello et al. 2017), our new methods effectively improved the CI inference, because the width of CIs was reduced by 33–64% in the analysis of empirical data. This reduction was seen for both the calibrated and uncalibrated nodes. We attribute this improvement to the fact that the new analytical method accounts for the rate heterogeneity better, and the effective bounds reflect calibration interactions and reshape the original diffused calibration densities to generate narrower CIs. Consequently, the precision of divergence time estimates is improved. However, it is essential to note that the use of incorrect calibration constraints or densities can significantly impact the precision of time estimates (Warnock et al. 2017). Therefore, one needs to examine the reliability of calibrations before conducting dating analyses (Andújar et al. 2014; Battistuzzi et al. 2015; Hedges et al. 2018).

RelTime CIs Show High Coverage Probabilities

We conducted RelTime and Bayesian (MCMCTree) analyses on a broad set of simulation data sets, containing small and large numbers of sequences, to compare the overall coverage probabilities, that is, the proportion of RelTime CIs and Bayesian HPD intervals that included the true divergence times. In these analyses, we used no ingroup calibrations to avoid confounding the effect of calibrations on the coverage probabilities of estimated CIs and HPD intervals (see the Methods and Materials section).

RelTime performed well in the analysis of data sets in which branch rates evolved under an independent branch rate (IBR) model, showing high coverage probabilities (≥95%) in both small and large simulated data sets (fig. 5a). The Bayesian analyses also performed well for IBR data sets, showing high overall coverage probability for data sets containing 50 sequences (97%) but slightly lower overall coverage probability for data sets containing 100 sequences (84%) (fig. 5a). Interestingly, the coverage probability of HPD intervals declined further for IBR data sets with 200 sequences (60%). We found that the true divergence times were located close to the boundaries of HPD intervals. In these cases, the average percent time difference between a true age and the nearest bound of respective HPD interval was 10%. RelTime showed an overall coverage probability of 94% for data sets containing 50 sequences and evolving with autocorrelated branch rates (ABR) in a phylogeny, whereas the Bayesian method showed a slightly lower coverage probability (78%; fig. 5b). Bayesian coverage probabilities declined for data sets containing 100 and 200 sequences (52% and 35%, respectively), and the average percent time difference between the true ages and their nearest HPD interval boundaries was large (20–45%). In contrast, RelTime maintained its high coverage probability for large data sets (≥95%).

The overall coverage probabilities of RelTime CIs and Bayesian HPD intervals produced by analyzing data sets with different numbers of sequences simulated under an (a) independent branch rate, IBR, model and (b) autocorrelated branch rate, ABR, model.

Fig. 5.

The overall coverage probabilities of RelTime CIs and Bayesian HPD intervals produced by analyzing data sets with different numbers of sequences simulated under an (a) independent branch rate, IBR, model and (b) autocorrelated branch rate, ABR, model.

We found that the posterior distributions of many nodes for ABR data sets were not normal-like and skewed, which means that the interpretation of HPD intervals is not the same as the CIs, which has been noted earlier (Jaynes and Kempthorne 1976; MacKenzie et al. 2017). It is also known that the tree prior assumed in Bayesian analyses can have an impact on the estimation of divergence times and HPD intervals (Heled and Drummond 2015; Ritchie et al. 2017; Bromham et al. 2018). This impact is more prominent when there is limited number of calibrations because the tree prior provides node age priors for nodes without calibrations (Barba-Montoya et al. 2017), which may explain the observed low coverage probabilities. We expect that the Bayesian method will perform better if more informative calibrations are applied because the tree prior becomes less critical, and informative calibrations reduce the uncertainty of time estimates. A more extensive analysis of this problem is beyond the scope of this article, but we plan to pursue it in the future.

Conclusions

Our new analytical method to estimate CIs, as well as the approach for deriving effective bounds, will now allow the use of more biological information, such as the rate variation among lineages and the probability density of calibrations, in 95% CIs inference. RelTime is computationally efficient, requiring only a fraction of the time and resources demanded by Bayesian approaches (Tamura et al. 2012, 2018). Results from the analysis of empirical nucleotide and protein sequence alignments containing small and large numbers of sequences and calibrations suggest that RelTime will serve as a reliable approach for dating the tree of life and conducting biological hypothesis testing. We also anticipate that our new analytical method and the new approach for utilizing calibration densities, with modifications, can be applied to generate CIs for other non-Bayesian dating methods, for example, penalized likelihood methods (Sanderson 2002).

Materials and Methods

Comparisons of User-Specified Calibration Density, Mello Bounds, and Effective Bounds

We used the BEAST-generated primate timetree published in Barba-Montoya et al. (2017) as the true tree (fig. 2a) and simulated an alignment of 9,361 sites under HKY + G (Hasegawa et al. 1985) model in SeqGen (Grassly et al. 1997) with parameters derived from the original empirical molecular data. Branch-specific rates were sampled from an uncorrelated lognormal distribution with a mean rate of 0.0069 substitutions per site per million years ago (Ma) and a standard deviation of 0.4 (log-scale). The simulated alignment was used to derive effective bounds.

We tested the performance of using effective bounds and Mello bounds under two calibration scenarios: reliable and unreliable scenarios. The reliable scenario represents the case where true ages of all the nodes are located in their calibration densities with high probabilities. The unreliable scenario refers to situations in which true ages of some nodes are found in calibration densities with low probabilities, whereas the true ages for some other nodes are found in calibration densities with high probabilities. An informative exponential density was used at H. sapiens–C. jacchus split (true age = 44.8 Ma) and an uninformative uniform density was used at H. sapiens–Otolemur gamettii split (true age = 68 Ma) under both scenarios. In the reliable calibration scenario, we assumed that a minimum age of 40 Ma at H. sapiens–C. jacchus split and maximum age of 130 Ma at H. sapiens–O. gamettii split were known. Therefore, we used an exponential density (mean = 4 Ma and offset = 40 Ma) and a uniform density (min = 40 Ma and max = 130 Ma) at H. sapiens–C. jacchus split and at H. sapiens–O. gamettii split, respectively. The true ages of both nodes located in their densities with high probabilities. Under the unreliable calibration scenario, we assumed that a minimum age of 30 Ma at H. sapiens–C. jacchus split and maximum age of 130 Ma at H. sapiens–O. gamettii split were known. Therefore, we used an exponential density (mean = 3 Ma and offset = 30 Ma) and a uniform density (min = 30 Ma and max = 130 Ma) at H. sapiens–C. jacchus split and at H. sapiens–O. gamettii split, respectively. This resulted in the true age of H. sapiens–C. jacchus split located in its density with a low probability (<2.5%), whereas the true age of H. sapiens–O. gamettii split located in its density with a high probability.

Empirical Analyses

We obtained empirical data sets that employed different calibration strategies from five published studies (table 1) (Bond et al. 2014; Tong et al. 2015; Barba-Montoya et al. 2017; Nauheimer et al. 2018; Oliveros et al. 2019). Molecular data were obtained from supplementary files of original studies. Calibration densities and Bayesian timetrees (including HPD intervals) were provided by authors or derived from the original studies, except for Bond et al. (2014)’s data, which was obtained from Mello et al. (2017). In RelTime analyses, we used the same alignments, substitution models, tree topologies, and calibration densities for ingroup nodes as the original studies to ensure comparability with Bayesian results. RelTime analyses were conducted in MEGA X (Kumar et al. 2018). For Oliveros et al. (2019)’s data, the published Bayesian timetree was summarized from ten timetrees inferred using ten different random subsets of the full data set, because BEAST (Drummond et al. 2012) was computationally infeasible to analyze the full data set. Since the original study has shown that ten subsets provided similar results, we only conducted RelTime analysis using one subset. We compared RelTime time estimates and CIs with Bayesian time estimates and HPD intervals. We did not test whether the slope between RelTime and Bayesian time estimates was one because the P value will always reject the hypothesis of the slope of one when the data sample size is large, which makes its use less meaningful (Halsey 2019; Wasserstein et al. 2019). To compare the performance of our methods and the previous CI calculation methods for RelTime, we reanalyzed all empirical data sets using Mello bounds and the Tamura et al. (2013)’s method, which was implemented in MEGA 7 (Kumar et al. 2012, 2016). All empirical data sets are available at https://github.com/cathyqqtao/RelTime-confidence-interval, last accessed October 25, 2019.

Simulation Analyses

We used the data sets simulated by Tamura et al. (2012), in which sequence alignments were generated using IBR and ABR models. In IBR cases, branch rates were sampled from a uniform distribution in the interval [0.5_r_, 1.5_r_], where r was the evolutionary rate derived from empirical genes. In ABR cases, branch rates were simulated using Kishino et al. (2001) model (lognormal distribution) with the initial rate of r and autocorrelation parameter v =0.1 (time unit = 100 My). GC contents, transition/transversion ratios, and sequence lengths were all derived from empirical genes and varied among data sets. Both ABR and IBR scenarios contained 100 simulated data sets, and each data set contained 400 ingroup sequences. Because the Bayesian method required a long runtime for analyzing a data set with 400 sequences, we subsampled 50, 100, and 200 sequences from the original full data sets and conducted RelTime and Bayesian analyses for these subsets.

To examine the performance of RelTime on simulated data sets, we used the minimum number of calibrations, in order to avoid the possibility that the use of many informative calibrations mediated the similarity of performance of RelTime and Bayesian methods. In the Bayesian analysis, we used MCMCTree and a single calibration at the root with a diffused uniform density (true age ± 50 My). The use of diffused density could reduce the impact of calibration on constraining the width of HPD intervals. We used 100 My as the time unit and “rgene_gamma = 2 10” and “sigma2_gamma = 2 20” as priors, so the prior values of mean rate, independent rate variation, and autocorrelation parameter were similar to the true values. The use of lognormal distribution as the rate model in Bayesian analyses was appropriate because the lognormal distribution fit the distribution of evolutionary rates for IBR and ABR data sets, although IBR data sets were simulated using a uniform distribution. We used “BDparas = 2 2 0.1” as the tree prior because it generated a uniform node age prior, and it was commonly used in practice (Yang 2006). Two independent runs of 100,000 generations each were conducted, and results were checked for good ESS values (>200) and convergence. In the RelTime analysis, we did not use any calibrations, so there was no calibration effect on constraining the width of CIs. However, because no calibration was used, RelTime provided relative times instead of absolution times. To make the fair comparison between RelTime and Bayesian results, we normalized the RelTime times (and CIs), Bayesian times (and HPD intervals), and true times to their ingroup root age, which was analogous to the case where the age of the ingroup crown node was fixed. We computed the coverage probability using these normalized times. The coverage probability of each node was the proportion of 100 data sets where the CI (or HPD interval) of this node contained the true time. The overall coverage probability was the mean value of the coverage probabilities of all the ingroup nodes.

Acknowledgments

We thank Dr Jose Barba-Montoya and Dr Simon Ho for sharing Bayesian results, and Mary Kathleen Durnan and Joy Wenslas for the technical support. We also thank Dr Sayaka Miura and Dr Jose Barba-Montoya for critical comments. This research was supported by grants from the National Institutes of Health (NIH GM0126567-02), National Science Foundation (NSF 1661218), and National Aeronautics and Space Administration (NASA NNX16AJ30G) to S.K., Brazilian Research Council (CNPq 233920/2014-5 and 409152/2018-8) to B.M., and Tokyo Metropolitan University (DB105) to K.T.

References

Andújar

C

,

Soria-Carrasco

V

,

Serrano

J

,

Gómez-Zurita

J.

2014

.

Congruence test of molecular clock calibration hypotheses based on Bayes factor comparisons

.

Methods Ecol Evol

.

5

(

3

):

226

242

.

Barba-Montoya

J

,

dos Reis

M

,

Yang

Z.

2017

.

Comparison of different strategies for using fossil calibrations to generate the time prior in Bayesian molecular clock dating

.

Mol Phylogenet Evol

.

114

:

386

400

.

Battistuzzi

FU

,

Billing-Ross

P

,

Murillo

O

,

Filipski

A

,

Kumar

S.

2015

.

A protocol for diagnosing the effect of calibration priors on posterior time estimates: a case study for the Cambrian explosion of animal phyla

.

Mol Biol Evol

.

32

(

7

):

1907

1912

.

Bond

JE

,

Garrison

NL

,

Hamilton

CA

,

Godwin

RL

,

Hedin

M

,

Agnarsson

I.

2014

.

Phylogenomics resolves a spider backbone phylogeny and rejects a prevailing paradigm for orb web evolution

.

Curr Biol

.

24

(

15

):

1765

1771

.

Bouckaert

R

,

Heled

J

,

Kühnert

D

,

Vaughan

T

,

Wu

C-H

,

Xie

D

,

Suchard

MA

,

Rambaut

A

,

Drummond

AJ.

2014

.

BEAST 2: a software platform for Bayesian evolutionary analysis

.

PLoS Comput Biol

.

10

(

4

):

e1003537.

Bromham

L

,

Duchêne

S

,

Hua

X

,

Ritchie

AM

,

Duchêne

DA

,

Ho

S.

2018

.

Bayesian molecular dating: opening up the black box

.

Biol Rev

.

93

(

2

):

1165

1191

.

Burbrink

FT

,

Pyron

RA.

2008

.

The taming of the skew: estimating proper confidence intervals for divergence dates

.

Syst Biol

.

57

(

2

):

317

328

.

dos Reis

M

,

Donoghue

PC

,

Yang

Z.

2016

.

Bayesian molecular clock dating of species divergences in the genomics era

.

Nat Rev Genet

.

17

(

2

):

71

80

.

dos Reis

M

,

Yang

Z.

2013

.

The unbearable uncertainty of Bayesian divergence time estimation

.

J Syst Evol

.

51

(

1

):

30

43

.

Drummond

AJ

,

Ho

SYW

,

Phillips

MJ

,

Rambaut

A.

2006

.

Relaxed phylogenetics and dating with confidence

.

PLoS Biol

.

4

(

5

):

e88

e99

.

Drummond

AJ

,

Suchard

MA

,

Xie

D

,

Rambaut

A.

2012

.

Bayesian phylogenetics with BEAUti and the BEAST 1.7

.

Mol Biol Evol

.

29

(

8

):

1969

1973

.

Edwards

AW.

1992

.

Likelihood

.

Baltimore (MD

):

Johns Hopkins University Press

.

Grassly

NC

,

Adachi

J

,

Rambaut

A.

1997

.

Seq-Gen: an application for the Monte Carlo simulation of protein sequence evolution along phylogenetic trees

.

Comput Appl Biosci

.

13

:

235

238

.

Halsey

LG.

2019

.

The reign of the p-value is over: what alternative analyses could we employ to fill the power vacuum?

Biol Lett

.

15

(

5

):

Hasegawa

M

,

Kishino

H

,

Yano

T.

1985

.

Dating of the human-ape splitting by a molecular clock of mitochondrial DNA

.

J Mol Evol

.

22

(

2

):

160

174

.

Hedges

SB

,

Tao

Q

,

Walker

M

,

Kumar

S.

2018

.

Accurate timetrees require accurate calibrations

.

Proc Natl Acad Sci U S A

.

115

(

41

):

E9510

E9511

.

Heled

J

,

Drummond

AJ.

2015

.

Calibrated birth-death phylogenetic time-tree priors for Bayesian inference

.

Syst Biol

.

64

(

3

):

369

383

.

Ho

SY.

2014

.

The changing face of the molecular evolutionary clock

.

Trends Ecol Evol

.

29

(

9

):

496

503

.

Ho

SY

,

Duchêne

S.

2014

.

Molecular-clock methods for estimating evolutionary rates and timescales

.

Mol Ecol

.

23

(

24

):

5947

5965

.

Inoue

J

,

Donoghue

PCJ

,

Yang

Z.

2010

.

The impact of the representation of fossil calibrations on Bayesian estimation of species divergence times

.

Syst Biol

.

59

(

1

):

74

89

.

Jaynes

ET

,

Kempthorne

O.

1976

. Confidence intervals vs Bayesian intervals. In:

Harper

WL

,

Hooker

CA

, editors.

Foundations of probability theory, statistical inference, and statistical theories of science

.

Dordrecht (the Netherlands

):

Springer

. p.

175

257

.

Kishino

H

,

Thorne

JL

,

Bruno

WJ.

2001

.

Performance of a divergence time estimation method under a probabilistic model of rate evolution

.

Mol Biol Evol

.

18

(

3

):

352

361

.

Kumar

S

,

Hedges

SB.

2016

.

Advances in time estimation methods for molecular data

.

Mol Biol Evol

.

33

(

4

):

863

869

.

Kumar

S

,

Stecher

G

,

Li

M

,

Knyaz

C

,

Tamura

K.

2018

.

MEGA X: molecular evolutionary genetics analysis across computing platforms

.

Mol Biol Evol

.

35

(

6

):

1547

1549

.

Kumar

S

,

Stecher

G

,

Peterson

D

,

Tamura

K.

2012

.

MEGA-CC: computing core of molecular evolutionary genetics analysis program for automated and iterative data analysis

.

Bioinformatics

28

(

20

):

2685

2686

.

Kumar

S

,

Stecher

G

,

Tamura

K.

2016

.

MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets

.

Mol Biol Evol

.

33

(

7

):

1870

1874

.

Li

H-T

,

Yi

T-S

,

Gao

L-M

,

Ma

P-F

,

Zhang

T

,

Yang

J-B

,

Gitzendanner

MA

,

Fritsch

PW

,

Cai

J

,

Luo

Y

, et al. .

2019

.

Origin of angiosperms and the puzzle of the Jurassic gap

.

Nat Plants

5

(

5

):

461

470

.

MacKenzie

DI

,

Nichols

JD

,

Royle

JA

,

Pollock

KH

,

Bailey

L

,

Hines

JE.

2017

.

Occupancy estimation and modeling: inferring patterns and dynamics of species occurrence

. 2nd ed.

Burlington (VT

):

Academic Press

.

Mello

B

,

Tao

Q

,

Tamura

K

,

Kumar

S.

2017

.

Fast and accurate estimates of divergence times from big data

.

Mol Biol Evol

.

34

(

1

):

45

50

.

Nauheimer

L

,

Schley

RJ

,

Clements

MA

,

Micheneau

C

,

Nargar

K.

2018

.

Australasian orchid biogeography at continental scale: molecular phylogenetic insights from the Sun Orchids (Thelymitra, Orchidaceae)

.

Mol Phylogenet Evol

.

127

:

304

319

.

Oliveros

CH

,

Field

DJ

,

Ksepka

DT

,

Barker

FK

,

Aleixo

A

,

Andersen

MJ

,

Alström

P

,

Benz

BW

,

Braun

EL

,

Braun

MJ

, et al. .

2019

.

Earth history and the passerine superradiation

.

Proc Natl Acad Sci U S A

.

116

(

16

):

7916

7925

.

Pyron

RA.

2014

.

Biogeographic analysis reveals ancient continental vicariance and recent oceanic dispersal in amphibians

.

Syst Biol

.

63

(

5

):

779

797

.

Rannala

B

,

Yang

Z.

2007

.

Inferring speciation times under an episodic molecular clock

.

Syst Biol

.

56

(

3

):

453

466

.

Ritchie

AM

,

Lo

N

,

Ho

S.

2017

.

The impact of the tree prior on molecular dating of data sets containing a mixture of inter- and intraspecies sampling

.

Syst Biol

.

66

(

3

):

413

425

.

Sanderson

MJ.

2002

.

Estimating absolute rates of molecular evolution and divergence times: a penalized likelihood approach

.

Mol Biol Evol

.

19

(

1

):

101

109

.

Sanderson

MJ.

2003

.

r8s: inferring absolute rates of molecular evolution and divergence times in the absence of a molecular clock

.

Bioinformatics

19

(

2

):

301

302

.

Smith

SA

,

O’Meara

BC.

2012

.

treePL: divergence time estimation using penalized likelihood for large phylogenies

.

Bioinformatics

28

(

20

):

2689

2690

.

Tamura

K

,

Battistuzzi

FU

,

Billing-Ross

P

,

Murillo

O

,

Filipski

A

,

Kumar

S.

2012

.

Estimating divergence times in large molecular phylogenies

.

Proc Natl Acad Sci U S A

.

109

(

47

):

19333

19338

.

Tamura

K

,

Stecher

G

,

Peterson

D

,

Filipski

A

,

Kumar

S.

2013

.

MEGA6: molecular evolutionary genetics analysis version 6.0

.

Mol Biol Evol

.

30

(

12

):

2725

2729

.

Tamura

K

,

Tao

Q

,

Kumar

S.

2018

.

Theoretical foundation of the RelTime method for estimating divergence times from variable evolutionary rates

.

Mol Biol Evol

.

35

:

1170

1182

.

Tong

KJ

,

Duchêne

S

,

Ho

SY

,

Lo

N.

2015

.

Comment on “Phylogenomics resolves the timing and pattern of insect evolution.”

Science

349

(

6247

):

487

487

.

Warnock

RCM

,

Yang

Z

,

Donoghue

P.

2017

.

Testing the molecular clock using mechanistic models of fossil preservation and molecular evolution

.

Proc R Soc B

284

(

1857

):

Wasserstein

RL

,

Schirm

AL

,

Lazar

NA.

2019

.

Moving to a world beyond “p< 0.05.”

Am Stat

.

73

:

1

19

.

Xia

X

,

Yang

Q.

2011

.

A distance-based least-square method for dating speciation events

.

Mol Phylogenet Evol

.

59

(

2

):

342

353

.

Yang

Z.

2006

.

Computational molecular evolution

.

Oxford

:

Oxford University Press

.

Yang

Z.

2007

.

PAML 4: phylogenetic analysis by maximum likelihood

.

Mol Biol Evol

.

24

(

8

):

1586

1591

.

Zhu

T

,

dos Reis

M

,

Yang

Z.

2015

.

Characterization of the uncertainty of divergence time estimation under relaxed molecular clock models using multiple loci

.

Syst Biol

.

64

(

2

):

267

280

.

Author notes

Qiqing Tao and Koichiro Tamura contributed equally to this work.

© The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com

Associate Editor: Bing Su

Search for other works by this author on:

Supplementary data

Citations

Views

Altmetric

Metrics

Total Views 3,166

2,268 Pageviews

898 PDF Downloads

Since 10/1/2019

Month: Total Views:
October 2019 82
November 2019 93
December 2019 89
January 2020 179
February 2020 130
March 2020 71
April 2020 55
May 2020 38
June 2020 51
July 2020 39
August 2020 55
September 2020 44
October 2020 30
November 2020 36
December 2020 53
January 2021 63
February 2021 71
March 2021 36
April 2021 99
May 2021 42
June 2021 60
July 2021 54
August 2021 60
September 2021 34
October 2021 44
November 2021 41
December 2021 29
January 2022 45
February 2022 58
March 2022 46
April 2022 58
May 2022 33
June 2022 25
July 2022 42
August 2022 56
September 2022 41
October 2022 69
November 2022 36
December 2022 42
January 2023 27
February 2023 30
March 2023 42
April 2023 74
May 2023 44
June 2023 55
July 2023 52
August 2023 30
September 2023 68
October 2023 30
November 2023 42
December 2023 44
January 2024 57
February 2024 38
March 2024 37
April 2024 55
May 2024 56
June 2024 41
July 2024 40
August 2024 30
September 2024 23
October 2024 22

Citations

38 Web of Science

×

Email alerts

Email alerts

Citing articles via

More from Oxford Academic