The Role of Morphological Data in Phylogeny Reconstruction

We live in the age of comparative genomics, and it may seem that there is not much point in reconstructing phylogenies using morphological data anymore. As more and more genes and genomes are being sequenced, the possibility that thousands or even millions of informative, independently evolving molecular characters can be brought to bear on a given phylogenetic problem is quickly becoming a reality (e.g., Rokas et al., 2003). Given the rate that new sequence data are being added, and the rate at which new innovations continue to accelerate this process, it seems possible that in the not-too-distant future we will be able to have a perfectly accurate and well-supported phylogeny of most living species on earth using molecular data alone. So why bother with morphology?

A recent paper by Scotland et al. ( 2003; SEA hereafter) offered a reappraisal of the role of morphology in phylogeny reconstruction. This is certainly an important and timely topic to discuss, and their main thesis is bold and controversial. They state that “We view any attempt to include more morphological data in phylogeny reconstruction as inherently problematic” (p. 545). Unfortunately, most of their arguments are based on unsupported speculation, and they fail to mention numerous studies that clearly contradict their conclusions. Given that many of their comments are written as responses to book chapters written by my collaborators and myself (e.g., Hillis and Wiens, 2000; Poe and Wiens, 2000; Wiens, 2000a), I feel obligated to elucidate some of these problems. Many of the issues raised are central to how systematics is done and will be conducted in the future. I will argue that, despite many undeniable advantages of molecular data, it is still absolutely necessary that we continue to collect additional morphological data for phylogenetic analysis, and continue to improve our methods for morphology-based phylogenetics. Note that Jenner (2004) has provided an independent rebuttal of the SEA paper, and he describes a large number of substantive criticisms which show only limited overlap with my own.

Why We Still Need to Collect More Morphological Data

The Future

There are many reasons to continue to do morphological phylogenetics. But given the incredible rate of advances in molecular systematics, it may be useful to divide these reasons into those pertaining to the present and future. Those pertaining to the future may actually be the most relevant, because many present-day limitations of molecular phylogenetics seem likely to be overcome very soon. I will focus on the putative future first, and then deal with present-day issues.

The most compelling reason to continue to collect morphological data long into the future is to resolve the phylogenetic relationships of fossil taxa and their relationships to living taxa (e.g., Maddison, 1996; Hillis and Wiens, 2000; Jenner, 2004). The reconstructed Tree of Life must include fossil taxa. Considering all the species that have ever evolved, most are now extinct (>99% according to some estimates; Novacek and Wheeler, 1992), and many extinct groups were diverse, ecologically important, and very distinct from their closest living relatives. For now and the immediate future, the relationships of most fossil taxa can only be determined through phylogenetic analysis of morphological data (despite impressive molecular studies of very recent fossil taxa). Contrary to what SEA imply (p. 543), fossils are not merely important for their potential to help resolve relationships of living taxa, and Hillis and Wiens (2000) did not advocate thorough taxon sampling solely because of its potential benefits for phylogeny estimation.

Our understanding of the rate and timing of macroevolutionary processes in both living and fossil taxa also requires phylogenetic information from fossils. Although methods are available that allow divergence dates for living taxa to be estimated with molecular data (e.g., Sanderson, 1997, 2002; Yoder and Yang, 2000; Thorne and Kishino, 2002), they still require external calibration. That calibration usually comes from fossil evidence, which requires correctly assigning fossils to groups of living taxa. Because older fossils are rarely identical or conspecific with living species, we cannot simply assign fossils to living groups based on overall similarity. We need to estimate a phylogeny for the fossil and living taxa, and this can only be done using morphological data.

Perhaps most importantly, given that the earth's biota has changed dramatically and repeatedly through time, understanding how the modern world became the way that it is requires studying fossils. When we look at a fossil the first thing we need to know is “what is it?,” which really means “to what clade does it belong?” Again, this is a question that is best answered through phylogenetic analysis. Understanding evolutionary processes (e.g., character evolution) in fossil taxa also requires knowledge of their phylogenetic relationships, just as in living taxa (e.g., Gatesy and Dial, 1996).

SEA do mention fossils (p. 543), but mostly in the context of how their inclusion affects estimated relationships among living species. They question whether addition of fossil taxa can improve accuracy for phylogenetic studies focusing on living taxa, based on three points: (1) their belief that studies showing improved accuracy from increased taxon sampling used only large (>1000) numbers of characters, (2) a single example from seed plant phylogeny, and (3) the problem of missing data.

SEA question the benefits of increased taxon sampling in morphological studies, stating “less clear is the role of dense taxon sampling when there are fewer characters, as in morphological studies” (p. 542). Yet, contrary to what SEA imply, simulation studies have confirmed the benefits of increased taxon sampling for phylogenetic accuracy even when the number of characters is limited (e.g., 100 characters, Huelsenbeck, 1991 [cited but not mentioned by SEA]; Wiens, 1998b).

SEA also question the benefits of including fossil taxa for phylogenetic accuracy, based on a single example involving conflicts between morphological and molecular data in angiosperms (p. 543). The obvious counterexample (cited but not mentioned by SEA) is the morphological study of amniote relationships by Gauthier et al. (1988), in which lepidosaurs (tuataras, lizards, and snakes) are placed as basal within amniotes if certain fossil taxa are excluded. Numerous molecular studies confirm the traditional morphological hypothesis, showing that lepidosaurs are not basal ( Meyer and Zardoya, 2003). Thus, the addition of fossil taxa seems to increase congruence between the molecular and morphological results in this case, suggesting that addition of fossil taxa is important for phylogenetic accuracy.

Finally, SEA suggest that the accuracy of phylogenetic analyses that include fossil taxa is limited by their “large amounts of missing data” (p. 543). Recent simulation studies suggest that the amount or proportion of missing data in incomplete taxa may be irrelevant to their accurate placement (i.e., taxa that are 90% incomplete with nearly 2,000 missing data cells each can be placed with perfect accuracy on a phylogeny; Wiens, 2003a), although a high proportion of missing data may limit their ability to change relationships among more complete taxa ( Wiens, 2003b).

The Present

I have assumed a future in which the relationships of every living species are well established using molecular data. Although such a future seems likely (especially if rates of genome sequencing and species extinction both continue to accelerate), we are clearly not there yet. Given this, there are a number of compelling reasons to continue to reconstruct phylogenies using morphological data.

First, there are many taxa that are extant but may still be very difficult to include in molecular studies. For example, many reptile and amphibian species are known from a limited number of specimens, are preserved so as to make obtaining molecular data very difficult (i.e., formalin fixation), and may never be collected again (i.e., because of limited distributions, habitat destruction, and other factors). Given present technology, the only way that we may know anything about the relationships of these species is through phylogenetic analysis of morphology. A similar situation may exist in other taxa as well (e.g., insects and plants; Wilson, 1992; Donoghue and Alverson, 2000), and many species remain known from a single specimen that was collected decades ago. Nevertheless, such problems may be largely absent in other groups of organisms, and the technological barriers that presently limit obtaining significant DNA sequence data from some types of preserved specimens may soon be broken.

Second, until we reach the stage where all molecular phylogenies are reconstructed without error, it is still important to have rigorous, morphology-based phylogenies as a “reality check” for molecular results (e.g., Doyle, 1992; Hillis and Wiens, 2000; Jenner, 2004). There are many factors that may cause molecular analyses to reconstruct clades that are both incorrect and statistically well-supported, a possibility not considered by SEA. These factors include long-branch attraction (e.g., Felsenstein, 1978; Huelsenbeck, 1997), deviations between gene and species trees (e.g., Doyle, 1992; Maddison, 1997), and more mundane problems such as contamination and misidentification of specimens. Comparing molecular results to rigorous morphology-based phylogenies can help prevent us from being misled in these cases.

A real-world example illustrates this idea. Sites et al. (1996) presented a phylogeny for iguanid lizards based on mtDNA sequences from the ND4 gene, which conflicted with a tree based on morphology ( de Queiroz, 1987). An important source of incongruence was the position of Cyclura. ND4 data placed Cyclura near the base of the iguanid tree, whereas de Queiroz's data tentatively placed Cyclura with Iguana. A subsequent morphological study, using many additional characters, showed strong support for the Cyclura + Iguana clade ( Hollingsworth, 1998). Analyses based on two additional mtDNA data sets gave conflicting results, with cytochrome b supporting the ND4 tree ( Petren and Case, 1997), and ribosomal sequences (12S, 16S) supporting the Cyclura + Iguana clade (with Sauromalus potentially included as well; Rassmann, 1997). Further analysis and parametric simulations ( Wiens and Hollingsworth, 2000) showed that the basal placement of Cyclura was likely an artifact of long-branch attraction in these two fast-evolving protein-coding genes, associated with an accelerated rate of change in these genes in Cyclura (a seemingly widespread problem in cytochrome b at higher levels of divergence; Meyer, 1994). There are several interesting lessons from this study: (1) the misleading results were concordant between two molecular data sets, and the combined analyses of all three molecular data sets was misled by long-branch attraction; (2) long-branch attraction can be problematic even in well-sampled groups of closely-related species, and use of maximum likelihood and multiple genes does not necessarily ensure estimating the correct topology under these conditions, and (3) analysis of the molecular data alone would seemingly have led to the wrong phylogenetic placement of Cyclura, and the problem was detected primarily because of discordance with a thorough phylogenetic analysis of the morphology.

Of course, problems in individual molecular data sets can also be detected by comparison to other, independently evolving molecular data sets. But there may be cases where all molecular data sets may give the wrong answer (e.g., sequencing a misidentified specimen for many different genes). Furthermore, a typical set of morphological characters should draw on information from many different unlinked genes ( Doyle, 1992; Hillis and Wiens, 2000), whereas the characters in a given molecular data set are often linked and inherited as a single unit (i.e., nucleotide positions in a single gene).

Finally, it is important to note that we are very far from describing all the living species on earth, much less sequencing them. This issue is somewhat distinct from that of using morphology to build trees (the sole focus of the SEA paper, and most other morphology vs. molecules reviews), but is closely related and critically important ( Maddison, 1996). Conservative estimates are that ∼1.5 million species have been described and 5 to 10 million more await description (e.g., Wilson, 1992; May, 2000). With some exceptions, new species are generally discovered, delimited, and described using morphological data. Although species delimitation using morphological data is typically based on diagnostic characters rather than phylogenetic analysis, it can be, and perhaps should be, tree based ( Baum and Donoghue, 1995; Wiens and Penkrot, 2002). In fact, the best methodology for delimiting species using morphological data remains entirely unresolved, and these different methods can give very different species limits for the same morphological data ( Wiens and Penkrot, 2002). In the context of morphology-based taxonomy, phylogenetic analysis of morphological data is also critical for placing new species within a higher taxon, given the obvious shortcomings of using overall morphological similarity to classify species (e.g., Wiley, 1981).

Despite the many advantages of molecular data, it is absolutely critical that systematists continue to be trained in morphological systematics as well, particularly for poorly known groups ( Hillis and Wiens, 2000). If students are trained exclusively in molecular techniques, the next generation of systematists may be incapable of identifying the species in their study groups, and phylogenetic progress in these groups may quickly “grind” to a halt. Quick and accurate identification of species in the field and laboratory based on morphological characters also is critical to many other areas of biology besides systematics (e.g., ecology, behavior, physiology; Maddison, 1996).

Why the Sea Approach is Problematic

The main thesis of the SEA paper is that morphological data are so intrinsically problematic that they should not be used to reconstruct phylogenies. Instead, SEA argue that “unambiguous” morphological characters should be merely mapped onto phylogenies established by molecular data to determine if they add further support to specific nodes. This approach will not succeed. The most unassailable advantage of morphological data is that it allows us to address the phylogeny of fossil taxa and their relationships to living taxa. Yet, their approach will not allow us to address the position of fossil taxa for which molecular data cannot be obtained (i.e., almost all of them). Similarly, the phylogenetic placement of any extant taxa known only from morphological data will remain unresolved. Further, morphological data will not provide rigorous independent corroboration for trees based on molecular data, because no new morphology-based trees will be reconstructed. In fact, it is unclear what meaningful purpose morphological data would serve under their approach.

The SEA approach assumes that, for a given analysis, having more morphological characters is not necessarily better. Their reasoning is basically as follows. (1) Our premolecular knowledge of phylogeny comes mostly from morphological classifications, not morphological phylogenetics. (2) The researchers who built these classifications found the best characters. (3) Any additional characters are likely to be problematic in terms of homology assessment and character coding. (4) Even though adding characters generally increases accuracy in simulation studies, adding characters in morphological phylogenetics will not, because the problematic nature of these characters (e.g., missing data, polymorphism, continuous variation, uncertain homology assessment) will outweigh their potential benefits. SEA are wrong about each of these points.

Morphology-Based Classification = Morphology-Based Phylogeny

Fundamental to the SEA approach is the assumption that prephylogenetic morphology-based classifications are largely equivalent to modern morphology-based phylogeny estimates, and make phylogenetic studies of morphology unnecessary. They (p. 543) took exception to my statement ( Wiens, 2000a) that most of our knowledge of the Tree of Life is based on phylogenetic studies of morphological data. They claimed that most of our knowledge of phylogeny is based on classifications instead, but without any supporting evidence or citations. My statement was based largely on the observation that, in the 1980s and early 1990s, researchers undertook morphology-based phylogenetic analyses of many of the major groups of multi-cellular organisms, including plants (e.g., Mishler and Churchill, 1985; Doyle and Donoghue, 1986), animals ( Brusca and Brusca, 1990), arthropods ( Kristensen, 1981, 1991; Shultz, 1990), fish ( Lauder and Liem, 1983), amphibians ( Duellman and Trueb, 1986; Trueb and Cloutier, 1991), amniotes ( Gauthier et al., 1988), mammals ( Novacek, 1986), turtles ( Gaffney and Meylan, 1988), birds ( Cracraft, 1988), and squamates ( Estes et al., 1988). These morphological studies, and many others at similar or lower taxonomic levels, revealed that many traditionally recognized higher-taxa are not monophyletic (e.g., Class Pisces, Class Reptilia). These studies also supported the monophyly of many traditionally recognized taxa. The important point is that the classifications of these major groups were tested with phylogenetic analyses of morphology, long before the current explosion of DNA sequence studies. Contrary to what SEA assert, not all classifications were handed down untested from pre-Hennigian days.

Adding Morphological Characters: All the Good Ones Are Taken

SEA claim (p. 541) that “… there are few characters that seem to be uncontroversial in relation to homology assessment. These characters typically are identified in traditional classifications and are the first characters to be included in a phylogenetic data set. Increasing the number of characters increases the level of ambiguous or problematic characters.” There is no support for this claim. SEA do cite their own graphs showing the relationship between problematic homology assessment, problematic character coding, and number of characters (their Fig. 1). But these graphs lack any supporting data, and instead merely represent the unsupported opinion of the authors (see also Jenner, 2004). There is no a priori reason to assume that previous workers exhausted all the potentially useful characters. In fact, it may be more logical to assume that traditional workers primarily reported characters that supported their favored hypothesis, and not ones that they considered to be unimportant or misleading. My own experience in morphological phylogenetics (e.g., Wiens, 1993a, 1993b; Reeder and Wiens, 1996; Wiens and Reeder, 1997; Wiens and Penkrot, 2002; Stephens and Wiens, 2003; Wiens and Etheridge, 2003; Wiens et al., 2003) suggests that traditional workers did not report all of the potentially informative morphological characters, and they tended not to synthesize many different types of morphological characters in a single study. Thus, character lists in my own studies often credit many previous authors (but rarely a single author or set of authors) and contain characters not used by any of them. Furthermore, many types of morphological data may not have been available in older studies (e.g., characters from scanning electron microscopy).

Relationships between levels of character sampling (number of characters/taxa) and (a) homoplasy (increasing values of the retention index indicate lower levels of homoplasy), (b) bootstrap support (proportion of clades with bootstrap values >50%), and (c) resolution (proportion of clades resolved in a strict consensus of the shortest trees), based on analyses of 49 morphological data sets for green plants (from data summarized in Sanderson and Donoghue, 1996).

Adding “Problematic” Types of Morphological Characters Will Not Increase Phylogenetic Accuracy

SEA assume that traditional researchers found the best morphological characters, and that characters added subsequently will not increase accuracy because they are somehow problematic or ambiguous. Nonetheless, SEA report the claim made by Poe and Wiens (2000) that there is no justification for excluding characters because of variation within terminal taxa, continuous and quantitative variation, unknown polarity, and assumed levels of homoplasy. SEA disagree with this claim.

What is fundamentally different between the views of SEA and Poe and Wiens (2000)? Clearly, SEA think that adding “ambiguous” characters will not increase phylogenetic accuracy, whereas Poe and Wiens (2000) think that they will. So who is right? In fact, SEA have no support for their statement; their single empirical example (see below) is entirely lacking data on the number of “ambiguous” characters. In contrast, the assertions of Poe and Wiens (2000) were based explicitly on numerous statistical, simulation, and congruence studies that showed that these types of so-called ambiguous characters do contain useful phylogenetic information (i.e., continuous and quantitative variation [ Thiele, 1993, see also Wiens, 2001], polymorphic characters [ Wiens, 1995, 1998a, 1998b, 1999, 2000b; Wiens and Servedio, 1997, 1998], characters with missing data [ Wiens, 1998c]). Furthermore, many of these studies specifically addressed the question of whether accuracy was increased more by excluding these problematic characters or including them ( Wiens and Servedio, 1997; Wiens, 1998a, 1998b, 1998c). The results showed that accuracy was increased more by including these problematic characters than excluding them. In other words, the benefits of increasing the number of characters outweighed the potential costs of including so-called problematic characters. Remarkably, SEA failed to mention any of these results, although they are discussed extensively in the paper by Poe and Wiens (2000) which they quote.

A Single Example Shows That More Characters will Not Increase Accuracy

SEA (p. 542) discuss an example which they believe shows that increasing the number of morphological characters does not increase phylogenetic accuracy. They compare four studies of seed plant phylogeny ( Crane, 1985; Doyle and Donoghue, 1986; Nixon et al., 1994; Doyle, 1996). They claim that despite the generally increasing number of characters in these four studies over time (38, 62 103, 91, respectively) “all analyses lacked bootstrap support >50%” (p. 542). In fact, this claim is false. In the first three studies the authors performed no bootstrap analyses at all. Equating “no bootstrap analysis” with “lacking bootstrap support >50%” is rather misleading, to say the least. In the fourth study ( Doyle, 1996), the author did perform bootstrap analysis, but contrary to the claim of SEA, 15 nodes have bootstrap support >50%, with 5 nodes >90% (Doyle's Fig. 5). Even if their claims were true (i.e., if they performed new bootstrap analyses of three of these data sets or if they intended something different by “all analyses lacked bootstrap support >50%”), SEA made no attempt to make these studies comparable in terms of number of taxa, incompleteness of fossil taxa, or other parameters. Finally, SEA provide no evidence to support their interpretation that the increasing number of characters failed to yield “better” results specifically because more “problematic” characters were being added.

Even if SEA were right about every aspect of their example, a single example does not make a compelling case. Stacked against it are the many statistical, simulation, and congruence studies cited by Poe and Wiens (2000), which show the benefits of including more characters (even so-called problematic ones) in morphological phylogenetic studies. Furthermore, the previously mentioned studies of iguanid lizard phylogeny provide an interesting counterexample to that of SEA. de Queiroz (1987) generated a phylogeny of iguanid genera using ∼95 morphological characters. His matrix was reanalyzed by Sites et al. (1996). When they reduced de Queiroz's matrix to the same taxa as their ND4 data (leaving ∼58 parsimony-informative morphological characters), they found no support for placing Iguana with Cyclura. Hollingsworth (1998) increased the overall number of morphological characters to 142. Many of the added characters were polymorphic, a type of character considered “problematic” by SEA. Hollingsworth found the same generic-level phylogeny as de Queiroz, but resolved two polytomies in de Queiroz's tree. Reducing Hollingsworth's data set to (essentially) the same set of taxa analyzed by Sites et al. (1996) yields 84 parsimony-informative characters and very strong bootstrap support (95%) for the Iguana + Cyclura clade ( Wiens and Hollingsworth, 2000). This example suggests that when the number of taxa is made comparable, increasing the number of morphological characters can increase resolution and bootstrap support, even if many of the added morphological characters are of types considered to be problematic by SEA.

Testing the Assumptions of the Sea Approach

A key assumption of the SEA approach is that “increasing the number of characters increases the level of ambiguous or problematic characters” (p. 541). SEA provided no empirical or theoretical evidence to support this claim. Yet, this claim makes implicit predictions that can be readily tested. If increased numbers of morphological characters only lead to greater homoplasy and decreased accuracy, then phylogenetic analyses based on larger numbers of characters (relative to the number of taxa) should presumably show higher levels of homoplasy, lower levels of resolution, and lower levels of support (e.g., bootstrapping) than studies of morphologically similar organisms based on fewer characters.

Even a crude analysis shows that none of these predictions are supported. Sanderson and Donoghue (1996) summarized levels of homoplasy, resolution, and bootstrap support for 50 morphological data sets for green plants (i.e., organisms with comparable morphology), along with the number of taxa and characters for each data set. I performed simple regression analyses of the relationship between the character/taxon ratio and levels of homoplasy (retention index), resolution (proportion of resolved nodes in a strict consensus of the shortest trees from a parsimony analysis), and branch support (proportion of clades with bootstrap support >50%) for each data set (see Sanderson and Donoghue [1996] for details of how these measures were obtained). Given that “increasing the number of characters” presumably means that the number of taxa is held constant, I used the character/taxon ratio instead of the number of characters alone (and following SEA p. 540). One data set was a clear outlier in terms of the character/taxon ratio and was removed. The results ( Fig. 1) do not support the claims of SEA. There is no significant relationship between the number of characters and levels of homoplasy, but there is a significant positive relationship between the relative number of characters and levels of resolution and bootstrap support. In many ways, these results are hardly surprising, and there are many problematic aspects to the analyses (e.g., studies of different groups are not directly comparable, autapomorphies were not removed, resolution and bootstrap support are related to accuracy but not strictly equivalent). However, the assumptions of SEA clearly are not supported, and they have provided no similar test or supporting data themselves.

A more fundamental assumption that is implicit in the SEA paper is the idea that morphological data are much more homoplasious than molecular data. If morphological data were found to have lesser or similar levels of homoplasy to DNA sequence data, then their claims for greater problems of homology assessment in morphological data would be refuted, or at least rendered irrelevant. This issue has been addressed in empirical surveys of independent phylogenetic analyses of molecular and morphological data (e.g., Sanderson and Donoghue, 1989, 1996; Givnish and Sytsma, 1997). The most recent studies show that morphological data may be (on average) slightly less homoplasious than DNA sequence data ( Sanderson and Donoghue, 1996; based on the retention index in their Table 2) or slightly more ( Givnish and Sytsma, 1997; based on the consistency index, their Fig. 2). Obviously, these studies do not support the assumption that morphological data show much greater levels of homoplasy than DNA sequence data (although this can depend on which genome, gene, or nucleotide position is being considered). Yet, these studies provide a somewhat indirect comparison because levels of homoplasy in different characters in different groups of organisms are typically being compared.

The strongest evidence that one type of data is more homoplasious than the other would come from comparing levels of homoplasy in morphological and molecular data sets obtained for the same taxa. Baker et al. (1998) found that the consistency index of morphological data was higher than that of molecular data in 26 of 31 studies with matched molecular/morphological taxon sampling, indicating lower levels of homoplasy in the morphological data. A paired sign test shows that this difference is highly significant (P = 0.0002). Thus, the empirical results that are most directly relevant to this issue strongly reject the fundamental assumption of the SEA approach (that morphological data are more homoplasious than molecular data), and show that the opposite pattern is actually more common.

Integrating Molecular and Morphological Data

SEA suggest that the best approach for integrating molecular and morphological data is to map a limited number of “unambiguous” morphological characters onto the molecular phylogeny. Yet it is unclear what this approach really tells us about the veracity of the molecular or morphological results. The typical outcome of this exercise seems easy to predict; some morphological characters will be concordant with the molecular tree and some will not (e.g., Fig. 2 of SEA). This is what we might also expect from mapping morphological characters onto a morphological tree, or molecular characters onto a molecular tree (i.e., some characters will be homoplastic, some will not). Without a relevant null hypothesis and an associated statistical test, their character mapping approach seems pointless.

A better approach to integrating molecular and morphological data may be to (1) perform separate analyses to identify areas of strongly supported incongruence between data sets (i.e., areas where combined analysis might be expected to fail; de Queiroz et al., 1995; Huelsenbeck et al., 1996); (2) perform a combined analysis; and (3) consider regions of the comined-data tree that are strongly contested by different data sets to be ambiguously resolved until the source of error is identified, or (if the source is unknown) a majority of independent data sets clearly supports one hypothesis over another. This approach may be advantageous relative to unilateral separate or combined analysis, or even relative to those approaches that focus on testing for overall congruence between data sets to decide whether they are generally “combinable” or not (e.g., Bull et al., 1993; Farris et al., 1995). The reason is simple. For any phylogenetic problem with more than four taxa, trees from different data sets may contain a mixture of concordant, weakly discordant, and strongly discordant clades. Thus, combined analysis and separate analysis may each be favored in different parts of the same tree, almost guaranteeing that both approaches will be suboptimal in some part of the tree if applied globally to a complex phylogeny ( Wiens, 1998d). The approach outlined above (see Wiens and Reeder, 1997; Wiens, 1998d) utilizes the results from combined analysis in those parts of the tree where combined analysis should succeed (i.e., no or weakly supported incongruence) and should treat the combined-data results as ambiguous in those sections of the tree where combined analysis might be expected to fail (i.e., strongly supported incongruence).

SEA state that they consider combined analysis as “another possible solution” (p. 545). However, none of the approaches that they advocate allows for trees to be reconstructed from the morphological data alone. Thus, there is no opportunity for morphological data to help identify potential problems in the molecular results or to provide real corroboration for molecular phylogenies in the form of a rigorous independent phylogenetic analysis.

Improving Morphological Phylogenetics

The main premise underlying the SEA paper is that morphological phylogenetics is so problematic that trees should no longer be reconstructed using morphological data. Although I would readily agree that there are serious problems in the current practice of morphological phylogenetics, we are not as helpless in the face of these problems as suggested by SEA. For example, many of the ambiguities in character analysis that they lament (p. 541) may be solved by simply treating morphological characters as continuous quantitative traits ( Felsenstein, 1988; Wiens, 2001). Furthermore, several approaches can be used to address the accuracy of different methods for selecting, coding, and analyzing morphological characters, including simulations and congruence studies (i.e., comparing how well different methods of analyzing morphological data recover clades that are strongly supported by independent, nonmorphological data sets; Wiens 1998a). My own work on methods for coding polymorphic morphological characters suggests that these different approaches for testing phylogenetic accuracy can give highly concordant results ( Wiens, 1999, 2000b). I think that the 21st century should be an exciting time for morphological phylogenetics as well as molecular phylogenetics, as new methods for collecting and analyzing morphological data are developed (e.g., high-resolution computed tomography scanning; Rowe et al., 1997; new likelihood/Bayesian methods for morphological data; Lewis, 2001) and the performance of morphology-based methods can be more readily tested using the wealth of new molecular data. In contrast, SEA basically advocate giving up on using morphological data to reconstruct phylogenies.

Conclusions

The wealth of new molecular data raises the question: what should be the role of morphological data in phylogeny reconstruction? SEA have suggested that morphological data should not be used to build trees and new morphological data should not be included in phylogeny reconstruction. I have argued that their approach is built on a series of mistaken premises, and is bound to fail. Most importantly, their approach ignores fossil taxa, and they do not consider the importance of morphology in comprehensive sampling of living taxa, in alpha taxonomy, and in identifying problematic molecular results. Although there are many areas where morphological phylogenetics can be improved, the best solution to these problems is not to simply give up on using morphological data to build trees. Instead, many of these problems may be solved through more explicit methodology, development and application of new methods, and rigorous testing of these methods using simulations and congruence studies.

Acknowledgments

For many helpful comments on the manuscript, I thank Tim Collins, Alan de Queiroz, Michael Donoghue, Tag Engstrom, R. Geeta, Lucinda McDade, Steve Poe, Chris Simon, Patrick Stephens, Dan Stoebel, and Chris Wolfe. My research was supported by NSF grant DEB-0129142.

Associate Editor: Tim Collins