The Effect of Inappropriate Calibration: Three Case Studies in Molecular Ecology (original) (raw)

Open Access

Peer-reviewed

Research Article

The Effect of Inappropriate Calibration: Three Case Studies in Molecular Ecology

PLOS

x

Figures

Abstract

Time-scales estimated from sequence data play an important role in molecular ecology. They can be used to draw correlations between evolutionary and palaeoclimatic events, to measure the tempo of speciation, and to study the demographic history of an endangered species. In all of these studies, it is paramount to have accurate estimates of time-scales and substitution rates. Molecular ecological studies typically focus on intraspecific data that have evolved on genealogical scales, but often these studies inappropriately employ deep fossil calibrations or canonical substitution rates (e.g., 1% per million years for birds and mammals) for calibrating estimates of divergence times. These approaches can yield misleading estimates of molecular time-scales, with significant impacts on subsequent evolutionary and ecological inferences. We illustrate this calibration problem using three case studies: avian speciation in the late Pleistocene, the demographic history of bowhead whales, and the Pleistocene biogeography of brown bears. For each data set, we compare the date estimates that are obtained using internal and external calibration points. In all three cases, the conclusions are significantly altered by the application of revised, internally-calibrated substitution rates. Collectively, the results emphasise the importance of judicious selection of calibrations for analyses of recent evolutionary events.

Citation: Ho SYW, Saarma U, Barnett R, Haile J, Shapiro B (2008) The Effect of Inappropriate Calibration: Three Case Studies in Molecular Ecology. PLoS ONE 3(2): e1615. https://doi.org/10.1371/journal.pone.0001615

Academic Editor: Peter Bennett, University of Kent, United Kingdom

Received: October 21, 2007; Accepted: January 22, 2008; Published: February 20, 2008

Copyright: © 2008 Ho et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding: S.Y.W.H. was supported by the Leverhulme Trust. U.S. was supported by the Estonian Science Foundation (grant 7040). J.H. was supported by the Arts and Humanities Research Board. B.S. was supported by the Royal Society. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Evolutionary time-scales estimated from molecular data form the foundation for a diverse range of molecular ecological studies, including those of biogeography, speciation, conservation genetics, and population biology. Molecular chronologies allow us to examine correlations between evolutionary events and palaeoclimatic phenomena, such as glaciations and sea level changes [1][3], or biotic factors, including faunal and floral changes and human migration [4][6]. For conservation purposes, we can measure the phylogenetic distinctiveness of a species or the antiquity of a specific population [7], [8], while past and present effective population sizes can be inferred in studies of population biology [9], [10]. All of these studies depend on accurate estimates of molecular time-scales and substitution rates which, primarily due to methodological limitations, have been made with varying degrees of rigour in the past.

Estimating time-scales from genetic data is laden with methodological obstacles. One of the chief difficulties is the selection of an appropriate calibration [11][14], which is necessary for converting measures of genetic divergence into units of absolute or geological time. Calibrating information can be incorporated into an analysis in one of several ways [15], of which the most widely used are: (i) fixing the age of a phylogenetic divergence event on the basis of independent palaeontological, archaeological, or biogeographic data; and (ii) importing a substitution rate obtained from independent data. A third calibration method is the inclusion of heterochronous sequences of known age, such as ancient DNA sequences extracted from radiocarbon-dated samples [16], [17].

The fossil record has played a key role in calibrating molecular estimates of evolutionary rates and divergence times in the tree of life [18]. This role has expanded in recent years due to advances in phylogenetic methods [15], [19]. Ecological studies, however, often investigate evolutionary processes within species, such as the timing of dispersal, migration, or extinction events [4], [20]. These occur on genealogical time-scales rather than the longer phylogenetic time-scales typically associated with fossil calibrations. Using deep fossil calibrations in ecological studies can present a methodological problem because different stages of the substitution process are being observed over genealogical and phylogenetic time-scales [21]. When sequences are taken from individuals within a population or species, the differences among them represent segregating sites or polymorphisms, many of which are transient and will be removed by genetic drift or purifying selection [21][24]. In contrast, differences between sequences taken from different species represent past fixations (substitutions).

This is, of course, a simplistic portrayal of the situation, as the species boundary is often indistinct and the two scales are directly linked because mutation and substitution are aspects of the same population genetic processes. For example, the observed genetic variation within a species can be inflated by the presence of ancestral polymorphisms that have been inherited from the parent species [25].

Nevertheless, it is apparent that considerable disparities can result between long-term substitution rates and instantaneous mutation rates, with direct and indirect estimates of the latter frequently yielding high values [26][28]. This disparity can be magnified by saturation at mutational hot spots [29], which can obscure the presence of past polymorphisms and cause long-term rates to be underestimated if inadequate corrections are applied.

In a phylogenetic analysis, transient polymorphisms manifest themselves as excess nucleotide changes on the branches near the tips of the tree (Figure 1) [30], [31]. They disappear from the population through time, so that the deeper a branch is in the tree, the fewer transient polymorphisms it will carry. When multiple species are compared within a single tree, the majority of nucleotide changes observed along the deepest branches, including the branch between the study species and outgroup, are likely to represent substitutions. Consequently, deeper calibration points will lead to slower observed rates [21], [23]. This can lead to considerable overestimates of times to divergence, particularly when slow substitution rates are obtained from interspecific comparisons in a phylogenetic context and then extrapolated to population-level data [21], [32].

thumbnail

Figure 1. Phylogenetic tree illustrating the impact of using extraspecific and intraspecific calibration points.

The tree shows the locations of nucleotide changes (small vertical bars). The nucleotide changes within the study species represent segregating sites, some of which will be fixed in the long term, but most of which will be removed by drift or selection. The changes between the study species and outgroup species represent substitutions. If an estimate of the evolutionary rate is calibrated using an external calibration point, such as the split between the study and outgroup species, then the intraspecific rate will be underestimated. This will lead to an overestimation of times to divergence, including the age of the most recent common ancestor of the study species.

https://doi.org/10.1371/journal.pone.0001615.g001

We believe that the inappropriate use of extraspecific or external calibration points may have misled a substantial number of molecular ecological studies. Here we present three detailed case studies in which the conclusions are significantly altered by using intraspecific, internal calibration points.

Results and Discussion

Avian Speciation in the Late Pleistocene

In Eurasia and North America, the Pleistocene was a time of significant fluctuations in climate and the distribution of habitats. Cyclical glacial advance and retreat produced dramatic changes in local environmental conditions over the past 250 kyr [33], leading to the view that these changes were conducive to an increased rate of avian speciation, a concept embodied in the “Late Pleistocene Origins” (LPO) hypothesis [for a recent review, see 1]. This hypothesis readily lends itself to testing because it makes specific predictions about divergence times, which can be estimated from molecular data. A study of mitochondrial DNA from avian sister species found that divergences occurred prior to the Pleistocene [34]. On the basis of this evidence the authors suggested that the speciation events were associated with early glacial expansion in the Northern Hemisphere around 2.4 Myr before present (BP). In contrast, an analysis of diverging conspecific populations or ‘phylogroups’, which were interpreted as reflecting incipient speciation, supported a more recent time-scale for avian diversification [35]. The debate has continued in numerous subsequent studies [36][42].

In nearly all of these studies, the authors collected a series of genetic divergence estimates, then divided the distances by a known substitution rate in order to produce an estimate of the time since divergence. The reliability of such estimates depends absolutely on the accuracy of the imported rate. Without exception, the substitution rate used in tests of the LPO hypothesis has been the ‘traditional’ mitochondrial rate of 0.01 substitutions per site per million years (subs/site/Myr), which has long been adopted as a standard in avian molecular studies [43], [44]. There are several reasons why the use of this rate may be questionable. First, the rate is not universally applicable among different avian taxa, with considerable rate heterogeneity detected among lineages [45], [46]. Second, the rate was calculated in a phylogenetic context, whereas the study of incipient species (and perhaps recently diverged species) is a genealogical issue [32]. Studies of intraspecific data from birds have generally yielded faster substitution rate estimates [23], [47], peaking at a rate of 0.95 subs/site/Myr for the mitochondrial hypervariable region 1 in Adélie penguins [48]. To investigate the impact of employing these higher rates in investigations of the LPO hypothesis, we re-analysed sets of genetic distances made in published studies.

Using a rate of 0.01 subs/site/Myr, only three of the 22 species had phylogroup divergences occurring within the past 250 kyr, whereas three species had Pliocene (>1.8 Myr) ages (Table 1). By applying a higher, revised rate of 0.075 subs/site/Myr, obtained from an internally-calibrated analysis of amakihi subspecies [22] (see Materials and Methods), we found that the evidence shifted significantly towards support for late Pleistocene divergences. Only three species yielded phylogroup divergences exceeding 250 kyr. Our date estimates do not take into account the uncertainty associated with the imported, revised rate, but if the highest rate implied by the upper 95% credibility limit on the original rate estimate [0.111 subs/site/Myr; 22] is applied to the data, then all of the resulting divergence time estimates fall within the past 250 kyr.

Our results should be interpreted cautiously because of the dubious applicability of a single substitution rate to different avian species. Collectively, however, our analyses demonstrate that the published molecular evidence used to challenge the LPO hypothesis is weakened by the application of a revised rate. This does not necessarily signify that glacial cycles provided conditions that were exceptionally suitable for allopatric speciation among birds, and it remains unclear whether the tempo of avian speciation in the Pleistocene was actually elevated in comparison to preceding geological periods [36], [39], [40]. Nevertheless, the sensitivity of tests of the LPO to the assumed substitution rate emphasises the importance of selecting an appropriate calibration. This concern also applies to similar studies of late Pleistocene divergences among other organisms [49].

Demographic History of the Bowhead Whale

The bowhead whale (Balaena mysticetus) was subjected to intensive commercial exploitation from the 17th to early 20th centuries. During this period, over 90,000 individuals were taken from the Spitsbergen stock alone [50]. This stock remains critically endangered, while two of the other four designated stocks are endangered. Bowhead whales are exceptionally long-lived, with a maximum longevity well in excess of 100 years, and appear to reach sexual maturity at around 25 years of age [51]. These observations suggest a long generation time, which presents obvious difficulties for population management and stock recovery. For these reasons, the demographic history of this species is of significant ecological, commercial, and conservational interest.

An analysis of modern DNA samples from the mitochondrial control region of 98 individuals found that the most recent common ancestor (MRCA) of modern bowhead whales lived around 267 kyr ago [9]. The estimate was calibrated using a value of 3.4 Myr for the age of the split between bowhead whales (Balaena) and right whales (Eubalaena). This calibration point was informed by the fossil record, which is relatively sparse for balaenids; the oldest fossil that can be confidently assigned to either the Balaena or Eubalaena lineage dates from the Pliocene (5.2 to 2.6 Myr BP). Several lines of evidence, however, suggest a more protracted evolutionary history for the two lineages. First, the Pliocene Balaena fossils appear to be highly derived [52]. Second, there is a conspicuous hiatus between the Pliocene fossils and the most ancient fossil representative of Balaenidae, Morenocetus parvus, which dates from the earliest Miocene around 20–23 Myr BP [52]. Independent support for a deeper split between the two genera was provided by analysis of whole mitochondrial genomes, which produced an age estimate of 17 Myr [53].

The palaeontological uncertainty over the age of the bowhead-right whale split, the possible antiquity of the divergence event, and the external nature of this calibration all argue against its use in studying the recent demographic history of bowhead whales. To investigate the effect of calibration choice, we use ancient DNA sequences to infer an internally-calibrated evolutionary time-scale.

We reconstructed the demographic history of bowhead whales using Bayesian skyline plot (BSP) analysis, which generates a plot of the estimated effective population size through time [5] (see Materials and Methods). We found evidence of a population expansion in the late Pleistocene (Figure 2). This pattern emerges for three data configurations (modern only, ancient only, and combined modern and ancient), suggesting that it is not an exclusive artefact of the chronologically heterogeneous sampling of ancient DNA sequences. The age of the MRCA of all individuals was estimated at 153 kyr BP, with a 95% highest posterior density (HPD) of 49.6–294 kyr BP. The estimated timings of the MRCA and population expansion of bowhead whales are more recent than those inferred by Rooney et al. [9].

thumbnail

Figure 2. Bayesian skyline plots showing the recent demographic history of bowhead whales, estimated using phylogenetic analysis of three alignments of the mitochondrial control region: (a) combined alignment of 68 modern haplotypes and 99 radiocarbon-dated, ancient DNA sequences; (b) modern sequences only; and (c) ancient sequences only.

All three plots are drawn to the vertical and horizontal scales.

https://doi.org/10.1371/journal.pone.0001615.g002

The estimated substitution rate was 0.159 subs/site/Myr (95% HPD: 0.051–0.272 subs/site/Myr). Although this is lower than other rate estimates obtained in some ancient DNA studies [for lists], [see 22,54], it is significantly higher than the substitution rates obtained using fossil-based point calibrations of either 3.4 or 17 Myr for the bowhead-right whale split, which yield estimates of 0.017 subs/site/Myr (95% HPD: 0.009–0.0029 subs/site/Myr) and 0.0034 subs/site/Myr (95% HPD: 0.0019–0.0057 subs/site/Myr), respectively. By using these two rate estimates to specify a prior on the substitution rate in analyses of the modern bowhead whale sequences, the age of the MRCA of all individuals is estimated to be 1.1 Myr BP (95% HPD: 0.497–1.88 Myr BP) and 4.12 Myr BP (95% HPD: 2.07–6.39 Myr BP), respectively. These are substantially older than the estimate of 150 kyr obtained in the BSP analyses calibrated by radiocarbon dates.

The results suggest that previous estimates of the evolutionary time-scale of bowhead whales have been misled by an inappropriate fossil calibration, the effect of which has been to produce overestimates of times to coalescence. Our re-analysis suggests that population expansion in bowhead whales occurred relatively recently and that short time periods may suffice for the generation of appreciable levels of genetic diversity. This has important consequences for studies of conservation genetics, many of which have relied on substitution rates obtained by adopting relatively deep calibration points.

Pleistocene Biogeography of the Brown Bear

The last few years have seen a proliferation of large data sets that sample individuals from a population over thousands to tens of thousands of generations. These data enable the direct testing of hypotheses concerning the relationship between past and present distributions of species and populations [20], and the impact of environmental events on the phylogeography of a species [4], [55]. In these studies, it is evident that without accurate estimates of substitution rates, there is a risk of misidentifying causal environmental factors for inferred demographic changes.

Brown bears (Ursus arctos) present an interesting illustration of this point. Modern brown bear populations are distributed throughout Europe, Asia, and North America, and exhibit significant maternally-linked phylogeographic structure across this range [20], [56][58] (see tree topology in Figure 3). The modern distribution of brown bears is believed to be the consequence of post-glacial expansion from local refuges, and to have remained relatively stable since this expansion [3], [57], [59][61]; however, a more recent study of ancient DNA found a more complex phylogeographic history for Western European brown bears [62]. Although this work did not entirely exclude the classic refuge scenario, it insinuated that past refugium theories might be oversimplifications [62]. Nevertheless, ancient DNA studies have revealed that the modern European distribution of brown bear lineages differs from that prior to the last glacial maximum [55], whereas in North America four distinct periods of constant population structure could be identified over the last 50 kyr, interposed by brief periods of rapid demographic change [20]. These findings suggest that brown bears in both Europe and North America experienced frequent local extinctions and range expansions during the climatically volatile late Pleistocene.

thumbnail

Figure 3. Maximum clade credibility tree from Bayesian analysis of mitochondrial control region sequences from 56 modern and 51 ancient brown bear samples, with a time scale calibrated using the radiocarbon dates of the ancient sequences.

Major clades and their geographic localities are given. Posterior probabilities are given for the nodes A-N, which are referred to in the text and in Table 2.

https://doi.org/10.1371/journal.pone.0001615.g003

Saarma et al. [3] investigated the possibility that the present distribution of bears in northern Europe is the result of post-glacial expansion from a refuge in the West Carpathian mountains. Using a substitution rate estimated from radiocarbon-dated North American brown bears [20], they estimated that the MRCA of the Eastern lineage (node I, Figure 3) existed 24 kyr BP (95% HPD: 6–50 kyr BP), with dates of 67 kyr BP (95% HPD: 20–131 kyr BP) for the Western lineage (node A) and 174 kyr BP (95% HPD: 61–314 kyr BP) for all European bears (node N). The estimates came with wide 95% HPD intervals, but were consistent with recent expansions of the Eastern and Western lineages and a mid-late Pleistocene MRCA for all brown bears.

These relatively recent estimates for the origin of the European brown bear lineages stand in contrast to the more ancient estimates of Hofreiter et al. [63], which were calibrated using the split between brown bears and their sister species (cave bears; Ursus speleaus) at 1.2–1.7 Myr [64], [65]. This calibration put the ages of the MRCAs for the two European lineages of brown bears during the mid-early Pleistocene: 640 kyr BP (95% CI: 290–1,390 kyr BP) for the Eastern lineage and 350 kyr BP (95% CI: 150–790 kyr BP) for the Western lineage.

The disparity between the two sets of estimates illustrates the difference between internal and external calibration, but the estimates are difficult to compare directly because they were made using different approaches. To investigate this further, we performed a Bayesian phylogenetic analysis of an alignment of 107 brown bear sequences using two separate modes of calibration: (i) internal calibration using 31 radiocarbon-dated ancient DNA samples; and (ii) importation of a rate estimated using the external cave-brown bear split.

The inferred tree topology was identical for both analyses (Figure 3). Internal calibration produced a rate estimate of 0.390 subs/site/Myr (95% HPD: 0.264–0.526 subs/site/Myr), whereas external calibration yielded a slower rate of 0.061 subs/site/Myr (95% HPD: 0.045–0.075 subs/site/Myr), with correspondingly older MRCA age estimates (Table 2).

Both of our age estimates for the MRCA of the Western lineage (clade I) were similar to those made by previous studies [3], [20]. The internally-calibrated date estimate suggests an MRCA during the previous (Weichselian) glacial period, but the 95% HPD of the externally-calibrated estimate encompasses several glacial and interglacial periods, prohibiting a detailed correlation with environmental change. The evolutionary time-frame of the Eastern European clade, which makes up part of clade IIIa, is less clear. It is interesting that there is no distinct subdivision between European and Alaskan brown bears in clade IIIa, which could be interpreted as evidence for recent expansion into Alaska and Europe from a single source population, although further sampling of late Pleistocene brown bears in Europe and Asia will be required to test this hypothesis.

Brown bears are believed to have established the first populations in North America around 50–75 kyr BP, during the early and middle parts of the last (Wisconsinan; equivalent to the Weichselian in Europe) glaciation [66], [67]. Age estimates for clades II, III, and IV (Table 2) therefore support previous claims that the modern lineages were established prior to the initial colonization of North America [57], [61], most probably during the previous interglacial/glacial transition. Conversely, the MRCA of bears from the Alaskan ABC islands (clade IIa, node C) existed around 16 kyr BP (95% HPD: 10–26 kyr BP), consistent with the timing of the most recent glacial/interglacial transition.

The age of the MRCA of polar bears (node D) is estimated at 43 kyr BP (95% HPD 19–73 kyr BP), and the divergence between polar bears and ABC islands bears (node F) is estimated to have occurred about 58 kyr BP (95% HPD: 48–72 kyr BP). Despite the internal calibration with a multitude of dated tips, the 95% HPD intervals for these estimates are wide, indicating only that polar bears diverged from other brown bears some time after the warmest part of the Wisconsinan glacial period, but lacking the power to identify any specific environmental event or geographic location.

This example illustrates the impact of inappropriate calibration on inferences made from molecular data. It also demonstrates that it can be difficult to correlate demographic data inferred from phylogenies with large-scale environmental fluctuations, due to the considerable statistical uncertainty that is often associated with intraspecific estimates of divergence times. It is possible to reduce this uncertainty by increasing sample size and alignment length, but the most powerful approach might involve the use of molecular data in conjunction with alternative methods, such as radiocarbon dating to find terminal occurrences.

Concluding Remarks

Our three case studies demonstrate that the findings of molecular ecological studies can be altered significantly by the choice of calibration. Our results show that divergence time estimates can change by an order of magnitude when relatively recent, internal calibration points are used (Table 3). Deep calibration points, particularly those based on the fossil record, lead to inferred evolutionary scenarios that are drastically different from those implied by internal, intraspecific calibrations. The large disparity between estimates obtained by internal and external calibration also highlights the importance of judicious selection of calibrations for recent evolutionary events, even if one does not subscribe to the hypothesis of time-dependent rate estimates [68], [69].

Intraspecific calibrations can be obtained in one of several ways. As illustrated in the bowhead whale and brown bear examples above, the radiocarbon ages of ancient samples can be used as calibrations on the tips of the tree. Alternatively, internal calibrations can be obtained from biogeography [70], [71], although such calibrations can be difficult to interpret and specify correctly [12], [68]. The fossil record will rarely be able to offer intraspecific calibrations unless substantial diagnostic variation exists within a species, for example among subspecies or conspecific populations. Frequently, however, intraspecific calibration data are unavailable for many data sets, leaving the less desirable option of importing a substitution rate estimated from another (preferably related) species.

A considerable disadvantage of intraspecific calibrations is that they will often produce date estimates with a larger degree of uncertainty than those made using external calibrations. There are two reasons for this: (i) external calibrations are often placed at the root of the tree; and (ii) the removal of outgroup species reduces the information in the data set. Moreover, molecular date estimates can come with substantial uncertainty, so that it can be difficult to draw strong conclusions from molecular date estimates if the associated estimation error is taken into account. Therefore, a reduction in precision appears to be the consequence of using intraspecific calibrations, but the improvement in accuracy should come as a worthwhile recompense.

Materials and Methods

Avian Speciation in the Late Pleistocene

In order to test the hypothesis of late Pleistocene origins for birds, we obtained a set of pairwise genetic distances estimated from avian mitochondrial cytochrome b by Weir and Schluter [42]. Of these genetic distances, we restrict our re-analysis to the 22 measured between conspecific phylogroups from species with a geographic range extending above 40°N, the approximate boundary of Pleistocene glaciation in North America [72] (Table 1). To convert the distance estimates to time durations, two different substitution rates were applied. The first rate was 0.01 subs/site/Myr, which was used in the original study and corresponds to the canonical 1% rate in birds. The second rate was 0.075 subs/site/Myr, based on an analysis of cytochrome b sequences from two subspecies of amakihi from Hawaii and Maui [22]; this is one of the few internally-calibrated rate estimates available for avian cytochrome b.

Demographic History of the Bowhead Whale

To investigate the demographic history of modern bowhead whale populations, we assembled a data set comprising both ancient and modern DNA sequences. Ancient DNA (aDNA) sequences (ages from ranging from 30 years to 51 kyr) from the mitochondrial control region of 99 bowhead whales were obtained from the study by Borge et al. [73]. Modern DNA sequences for 68 haplotypes were obtained from the study by Rooney et al. [9]. Demographic history was inferred from these data using Bayesian skyline plot [BSP; 5] analyses, performed using BEAST 1.4 [74] with the substitution model selected by comparison of Akaike Information Criterion values. Three BSP analyses were performed: (i) aDNA sequences only; (ii) modern DNA sequences only; and (iii) ancient and modern DNA sequences combined. For all three analyses, 10 group sizes were used for the BSP. Posterior distributions of parameters were investigated using Markov chain Monte Carlo (MCMC) analysis, with samples from the posterior drawn every 10,000 steps over a total of 50,000,000 steps, with the first 10% discarded as burn-in.

In the two analyses using aDNA sequences, time-scales were calibrated using the radiocarbon dates of the 99 ancient samples. The analysis of modern DNA sequences was calibrated by specifying a normally-distributed prior on the substitution rate (mean 0.149 subs/site/Myr, standard deviation 0.052 subs/site/Myr), based on the value estimated in the aDNA analysis. The modern and ancient DNA data sets were obtained from different stocks (Bering-Chukchi-Beaufort Seas and Spitsbergen, respectively), but there appears to be little molecular or morphological evidence for the discreteness of any of the five designated stocks of bowhead whales [73], [75], [76].

To estimate the average substitution rate across balaenids, control region sequences were obtained from GenBank for the three extant species of right whale (Eubalaena japonica, E. australis, and E. glacialis). These were aligned with the corresponding sequence from Balaena mysticetus haplotype A. The alignment was analysed with BEAST using fossil calibrations. The oldest fossil representative of either the Eubalaena or Balaena lineage dates from the Pliocene (1.8–5.3 Myr BP) [77]. Rooney et al. [9] summarised this information by taking a mean value of 3.4 Myr, which we have adopted for comparative purposes. Following the results of Sasaki et al. [53] and in view of the fossil hiatus for balaenids during the Miocene, we repeated the analysis using a calibration age of 17 Myr for the _Balaena_-Eubalaena split. In the MCMC analysis, samples were drawn from the posterior every 10,000 steps over a total of 10,000,000 steps, with the first 10% discarded as burn-in. All input files are available upon request from the corresponding author.

Pleistocene Biogeography of the Brown Bear

To investigate the timing of brown bear movements in the Pleistocene, we collected ancient and modern DNA for phylogenetic analysis. Mitochondrial control region sequences from 51 ancient brown bears were obtained from published sources [20], [55], [62], [78]. In addition, we obtained all 56 modern haplotypes that were available on GenBank, including only the sequences that spanned the same 195 bp stretch as the aDNA sequences [see 20].

To estimate a substitution rate based on the cave-brown bear split, five cave bear sequences covering the same mitochondrial fragment were obtained from GenBank and aligned with five randomly-selected brown bear sequences. Bayesian phylogenetic analyses were performed using BEAST, using the HKY substitution model and a constant-size coalescent prior on the tree. To estimate an externally-calibrated substitution rate, the age of the cave-brown bear split was fixed to a value of 1.45 Myr in accordance with fossil evidence [64], [65]. Samples from the posterior were drawn every 20,000 MCMC steps over a total of 20,000,000 steps, with the first 10% discarded as burn-in.

Two molecular clock analyses were then performed on the alignment of 107 brown bears, using the model settings described above. In the first analysis, the externally-calibrated rate was used to place a normally-distributed prior (mean and standard deviation of 4.84×10−3 and 8.05×10−4 subs/site/Myr, respectively) on the substitution rate. In the second analysis, rate estimates were calibrated using the radiocarbon ages of the 51 ancient samples. Other details of the MCMC analysis were as described above. All input files are available upon request from the corresponding author.

Acknowledgments

We wish to thank Love Dalén and two reviewers for helpful comments, and Lutz. Bachmann for providing a paper in press.

Author Contributions

Conceived and designed the experiments: SH. Analyzed the data: BS SH US. Wrote the paper: BS RB SH JH US.

References

  1. 1.Lovette IJ (2005) Glacial cycles and the tempo of avian speciation. Trends Ecol Evol 20: 57–59.
  2. 2.Hofreiter M, Rabeder G, Jaenicke-Despres V, Withalm G, Nagel D, et al. (2004) Evidence for reproductive isolation between cave bear populations. Curr Biol 14: 40–43.
  3. 3.Saarma U, Ho SYW, Pybus OG, Kaljuste M, Tumanov IM, et al. (2007) Mitogenetic structure of brown bears (Ursus arctos L.) in north-eastern Europe and a new time-frame for the formation of brown bear lineages. Mol Ecol 16: 401–413.
  4. 4.Shapiro B, Drummond AJ, Rambaut A, Wilson MC, Matheus PE, et al. (2004) Rise and fall of the Beringian steppe bison. Science 306: 1561–1565.
  5. 5.Drummond AJ, Rambaut A, Shapiro B, Pybus OG (2005) Bayesian coalescent inference of past population dynamics from molecular sequences. Mol Biol Evol 22: 1185–1192.
  6. 6.Culver M, Johnson WE, Pecon-Slattery J, O'Brien SJ (2000) Genomic ancestry of the American puma (Puma concolor). J Hered 91: 186–197.
  7. 7.Buckley-Beason VA, Johnson WE, Nash WG, Stanyon R, Menninger JC, et al. (2006) Molecular evidence for species-level distinctions in clouded leopards. Curr Biol 16: 2371–2376.
  8. 8.Baker CS, Perry A, Bannister JL, Weinrich MT, Abernethy RB, et al. (1993) Abundant mitochondrial DNA variation and world-wide population structure in humpback whales. Proc Natl Acad Sci USA 90: 8239–8243.
  9. 9.Rooney AP, Honeycutt RL, Derr JN (2001) Historical population size change of bowhead whales inferred from DNA sequence polymorphism data. Evolution 55: 1678–1685.
  10. 10.Alter SE, Rynes E, Palumbi SR (2007) DNA evidence for historic population size and past ecosystem impacts of gray whales. Proc Natl Acad Sci USA 104: 15162–15167.
  1. 11.Graur D, Martin W (2004) Reading the entrails of chickens: Molecular timescales of evolution and the illusion of precision. Trends Genet 20: 80–86.
  1. 12.Heads M (2005) Dating nodes on molecular phylogenies: a critique of molecular biogeography. Cladistics 21: 62–78.
  1. 13.Ho SYW (2007) Calibrating molecular estimates of substitution rates and divergence times in birds. J Avian Biol 38: 409–414.
  1. 14.Ritchie PA, Millar CD, Gibb GC, Baroni C, Lambert DM (2004) Ancient DNA enables timing of the Pleistocene origin and Holocene expansion of two Adélie penguin lineages in Antarctica. Mol Biol Evol 21: 240–248.
  1. 15.Drummond AJ, Ho SYW, Phillips MJ, Rambaut A (2006) Relaxed phylogenetics and dating with confidence. PLoS Biol 4: e88.
  1. 16.Rambaut A (2000) Estimating the rate of molecular evolution: incorporating non-contemporaneous sequences into maximum likelihood phylogenies. Bioinformatics 16: 395–399.
  1. 17.Ho SYW, Kolokotronis S-O, Allaby RG (2007) Elevated substitution rates estimated from ancient DNA. Biol Lett 3: 702–705.
  1. 18.Benton MJ, Donoghue PC (2007) Paleontological evidence to date the tree of life. Mol Biol Evol 24: 26–53.
  1. 19.Yang Z, Rannala B (2006) Bayesian estimation of species divergence times under a molecular clock using multiple fossil calibrations with soft bounds. Mol Biol Evol 23: 212–226.
  1. 20.Barnes I, Matheus P, Shapiro B, Jensen D, Cooper A (2002) Dynamics of Pleistocene population extinctions in Beringian brown bears. Science 295: 2267–2270.
  1. 21.Penny D (2005) Relativity for molecular clocks. Nature 426: 183–184.
  1. 22.Ho SYW, Shapiro B, Phillips M, Cooper A, Drummond AJ (2007) Evidence for time dependency of molecular rate estimates. Syst Biol 56: 515–522.
  1. 23.Ho SYW, Phillips MJ, Cooper A, Drummond AJ (2005) Time dependency of molecular rate estimates and systematic overestimation of recent divergence times. Mol Biol Evol 22: 1561–1568.
  1. 24.Woodhams M (2006) Can deleterious mutations explain the time dependency of molecular rate estimates? Mol Biol Evol 23: 2271–2273.
  1. 25.Charlesworth B, Bartolomé C, Noël V (2005) The detection of shared and ancestral polymorphisms. Genet Res 86: 149–157.
  1. 26.Howell N, Smejkal CB, Mackey DA, Chinnery PF, Turnbull DM, et al. (2003) The pedigree rate of sequence divergence in the human mitochondrial genome: There is a difference between phylogenetic and pedigree rates. Am J Hum Genet 72: 659–670.
  1. 27.Denver DR, Morris K, Lynch M, Vassilieva LL, Thomas WK (2000) High direct estimate of the mutation rate in the mitochondrial genome of Caenorhabditis elegans. Science 289: 2342–2344.
  1. 28.Mao L, Zabel C, Wacker MA, Nebrich G, Sagi D, et al. (2006) Estimation of the mtDNA mutation rate in aging mice by proteome analysis and mathematical modeling. Exp Gerontol 41: 11–24.
  1. 29.Galtier N, Enard D, Radondy Y, Bazin E, Belkhir K (2006) Mutation hot spots in mammalian mitochondrial DNA. Genome Res 16: 215–222.
  1. 30.Williamson S, Orive ME (2002) The genealogy of a sequence subject to purifying selection at multiple sites. Mol Biol Evol 19: 1376–1384.
  1. 31.Nielsen R, Weinreich DM (1999) The age of nonsynonymous and synonymous mutations in animal mtDNA and implications for the mildly deleterious theory. Genetics 153: 497–506.
  1. 32.Ho SYW, Larson G (2006) Molecular clocks: when times are a-changin'. Trends Genet 22: 79–83.
  1. 33.Webb T III, Bartlein PJ (1992) Global changes during the last 3 million years: climatic controls and biotic responses. Ann Rev Ecol Syst 23: 141–173.
  1. 34.Klicka J, Zink RM (1997) The importance of recent ice ages in speciation: a failed paradigm. Science 277: 1666–1669.
  1. 35.Avise JC, Walker D (1998) Pleistocene phylogeographic effects on avian populations and the speciation process. Proc R Soc Lond B 265: 457–463.
  1. 36.Cicero C, Johnson NK (2006) The tempo of avian diversification: Reply. Evolution 60: 413–414.
  1. 37.Johnson NK, Cicero C (2004) New mitochondrial DNA data affirm the importance of Pleistocene speciation in North American birds. Evolution 58: 1112–1130.
  1. 38.Klicka J, Zink RM (1999) Pleistocene effects on North American songbird evolution. Proc R Soc Lond B 266: 695–700.
  1. 39.Zink RM, Klicka J, Barber BR (2004) The tempo of avian diversification during the Quaternary. Phil Trans R Soc Lond B 359: 215–220.
  1. 40.Zink RM, Klicka J (2006) The tempo of avian diversification: a comment on Johnson and Cicero. Evolution 60: 411–414.
  1. 41.Weir JT, Schluter D (2004) Ice sheets promote speciation in boreal birds. Proc R Soc Lond B 271: 1881–1887.
  1. 42.Weir JT, Schluter D (2007) The latitudinal gradient in recent speciation and extinction rates of birds and mammals. Science 315: 1574–1576.
  1. 43.Shields GF, Wilson AC (1987) Calibration of mitochondrial DNA evolution in geese. J Mol Evol 34: 212–217.
  1. 44.Randi E (1996) A mitochondrial cytochrome B phylogeny of the Alectoris partridges. Mol Phylogenet Evol 6: 214–227.
  1. 45.Pereira SL, Baker AJ (2006) A mitogenomic timescale for birds detects variable phylogenetic rates of molecular evolution and refutes the standard molecular clock. Mol Biol Evol 23: 1731–1740.
  1. 46.Lovette IJ (2004) Mitochondrial dating and mixed support for the “2% rule” in birds. Auk 121: 1–6.
  1. 47.Warren BH, Bermingham E, Bowie RC, Prys-Jones RP, Thebaud C (2003) Molecular phylogeography reveals island colonization history and diversification of western Indian Ocean sunbirds (Nectarinia: Nectariniidae). Mol Phylogenet Evol 29: 67–85.
  1. 48.Lambert DM, Ritchie PA, Millar CD, Holland B, Drummond AJ, et al. (2002) Rates of evolution in ancient DNA from Adélie penguins. Science 295: 2270–2273.
  1. 49.Avise JC, Walker D, Johns GC (1998) Speciation durations and Pleistocene effects on vertebrate phylogeography. Proc R Soc Lond B 265: 1707–1712.
  1. 50.Ross WG (1993) Commercial whaling in the North Atlantic sector. In: Burns JJ, Montague JJ, Cowles CJ, editors. The Bowhead Whale. Lawrence, Kansas: Allen Press. pp. 511–561.
  2. 51.George JC, Bada J, Zeh J, Scott L, Brown SE, et al. (1999) Age and growth estimates of bowhead whales (Balaena mysticetus) via aspartic acid racemization. Can J Zool 77: 571–580.
  1. 52.Barnes LG, McLeod SA (1984) The fossil record and phyletic relationships of gray whales. In: Jones ML, Swartz S, Leatherwood S, editors. The Gray Whale. New York: Academic Press. pp. 3–32.
  2. 53.Sasaki T, Nikaido M, Hamilton H, Goto M, Kato H, et al. (2005) Mitochondrial phylogenetics and evolution of mysticete whales. Syst Biol 54: 77–90.
  1. 54.Ho SYW, Heupink TH, Rambaut A, Shapiro B (2007) Bayesian estimation of sequence damage in ancient DNA. Mol Biol Evol 24: 1416–1422.
  1. 55.Hofreiter M, Serre D, Rohland N, Rabeder G, Nagel D, et al. (2004) Lack of phylogeography in European mammals before the last glaciation. Proc Natl Acad Sci USA 101: 12963–12968.
  1. 56.Taberlet P, Bouvet J (1994) Mitochondrial DNA polymorphism, phylogeography, and conservation genetics of the brown bear (Ursus arctos) in Europe. Proc R Soc Lond B 255: 195–200.
  1. 57.Waits L, Talbot S, Ward RH, Shields G (1998) Phylogeography of the North American brown bear and implications for conservation. Conserv Biol 12: 408–417.
  1. 58.Matsuhashi T, Masuda R, Mano T, Yoshida MC (1999) Microevolution of the mitochondrial DNA control region in the Japanese brown bear (Ursus arctos) population. Mol Biol Evol 16: 676–684.
  1. 59.Taberlet P, Fumagalli L, Wust-Saucy A-G, Cosson J-F (1998) Comparative phylogeography and postglacial conolozation routes in Europe. Mol Ecol 7: 453–464.
  1. 60.Hewitt G (2000) The genetic legacy of the Quaternary ice ages. Nature 405: 907–913.
  1. 61.Talbot SL, Shields GF (1996) Phylogeography of brown bears (Ursus arctos) of Alaska and paraphyly within the Ursidae. Mol Phylogenet Evol 5: 477–494.
  1. 62.Valdiosera CE, García N, Anderung C, Dalén L, Crégut-Bonnoure E, et al. (2007) Staying out in the cold: glacial refugia and mitochondrial DNA phylogeography in ancient European brown bears. Mol Ecol in press.
  1. 63.Hofreiter M, Capelli C, Krings M, Waits L, Conard N, et al. (2002) Ancient DNA analyses reveal high mitochondrial DNA sequence diversity and parallel morphological evolution of late Pleistocene cave bears. Mol Biol Evol 19: 1244–1250.
  1. 64.Loreille O, Orlando L, Patou-Mathis M, Philippe M, Taberlet P, et al. (2001) Ancient DNA analysis reveals divergence of the cave bear, Ursus spelaeus, and brown bear, Ursus arctos, lineages. Curr Biol 11: 200–203.
  1. 65.Kurtén B (1976) The Cave Bear Story. New York: Columbia University Press.
  2. 66.Kurtén B, Anderson E (1980) Pleistocene Mammals of North America. New York: Columbia University Press.
  3. 67.Matheus PE (1995) Diet and co-ecology of Pleistocene short-faced and brown bears in Eastern Beringia. Quaternary Res 44: 447–453.
  1. 68.Emerson BC (2007) Alarm bells for the molecular clock? No support for Ho et al.'s model of time-dependent molecular rate estimates. Syst Biol 56: 337–345.
  1. 69.Bandelt H-J, Kong QP, Richards M, Macaulay V (2006) Estimation of mutation rates and coalescence times: some caveats. In: Bandelt H-J, Macaulay V, Richards M, editors. Human mitochondrial DNA and the evolution of Homo sapiens. Berlin: Springer. pp. 47–90.
  2. 70.Fleischer RC, McIntosh CE, Tarr CL (1998) Evolution on a volcanic conveyor belt: using phylogeographic reconstructions and K-Ar-based ages of the Hawaiian Islands to estimate molecular evolutionary rates. Mol Ecol 7: 533–545.
  1. 71.Stoneking M, Sherry ST, Redd AJ, Vigilant L (1992) New approaches to dating suggest a recent age for human mtDNA ancestor. Phil Trans R Soc Lond B 337: 167–175.
  1. 72.Dyke AS, Andrews JT, Clark PU, England JH, Miller GH, et al. (2002) The Laurentide and Innuitian ice sheets during the Last Glacial Maximum. Quaternary Sci Rev 21: 9–31.
  1. 73.Borge T, Bachmann L, Björnstad G, Wiig Ø (2007) Genetic variation in Holocene bowhead whales from Svalbard. Mol Ecol 16: 2223–2235.
  1. 74.Drummond AJ, Rambaut A (2007) BEAST. 1.4 ed. Edinburgh: University of Edinburgh.
  2. 75.Heide-Jørgensen MP, Laidre KL, Jensen MV, Dueck L, Postma LD (2006) Dissolving stock discreteness with satellite tracking: Bowhead whales in Baffin Bay. Mar Mamm Sci 22: 34–45.
  1. 76.Moore SE, Reeves RR (1993) Distribution and movement. In: Burns JJ, Montague JJ, Cowles CJ, editors. The Bowhead Whale. Lawrence, Kansas: Allen Press. pp. 313–386.
  2. 77.McLeod SA, Whitmore FC Jr, Barnes LG (1993) Evolutionary relationships and classification. In: Burns JJ, Montague JJ, Cowles CJ, editors. The Bowhead Whale. Lawrence, Kansas: Allen Press. pp. 45–70.
  3. 78.Matheus P, Burns J, Weinstock J, Hofreiter M (2004) Pleistocene brown bears in the mid-continent of North America. Science 306: 1150.