首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
We describe two new methods to partition phylogenetic data sets of discrete characters based on pairwise compatibility. The partitioning methods make no assumptions regarding the phylogeny, model of evolution, or characteristics of the data. The methods first build a compatibility graph, in which each node represents a character in the data set. Edges in the compatibility graph may represent strict compatibility of characters or they may be weighted based on a fractional compatibility scoring procedure that measures how close the characters are to being compatible. Given the desired number of partitions, the partitioning methods then seek to cluster the characters with the highest average pairwise compatibility, so that characters in each cluster are more compatible with each other than they are with characters in the other cluster(s). Partitioning according to these criteria is computationally intractable (NP-hard); however, spectral methods can quickly provide high-quality solutions. We demonstrate that the spectral partitioning effectively identifies characters with different evolutionary histories in simulated data sets, and it is better at highlighting phylogenetic conflict within empirical data sets than previously used partitioning methods.  相似文献   

2.
From basepairs to birdsongs: phylogenetic data in the age of genomics   总被引:4,自引:0,他引:4  
Given the quantity of molecular data now available, including complete genomes for some organisms, one can ask whether there is a need for any data beyond complete genomic sequences for phylogenetic analysis. One reason to look beyond the genome is that not all character information is encoded in organismal genomes. We propose a hierarchy of characters that ranges from biologically transmitted but nongenomically encoded characters, such as bird songs, to characters that are genomically encoded. All of these characters can retain historical information and are potentially useful for phylogenetic analysis. In addition, a number of phenotypic levels that are expressions of the genome can be identified. The question whether it is worth including any of these levels if all of the underlying sequence data have been collected arises, since issues of redundancy occur. Utilization of phenotypic levels that are ultimately based on sequences may facilitate reconstructing homologies that are not evident from sequence data alone. We propose the use of simultaneous analysis of sequence data and as many levels of phenotypic characters as possible to take advantage of homology information that may be more easily recovered from the latter. A method that eliminates redundancy to the degree that it can be detected is proposed.  相似文献   

3.
Character analysis in morphological phylogenetics: problems and solutions   总被引:1,自引:0,他引:1  
Many aspects of morphological phylogenetics are controversial in the theoretical systematics literature and yet are often poorly explained and justified in empirical studies. In this paper, I argue that most morphological characters describe variation that is fundamentally quantitative, regardless of whether they are coded qualitatively or quantitatively by systematists. Given this view, three fundamental problems in morphological character analysis (definition, delimitation, and ordering of character states) may have a common solution: coding morphological characters as continuous quantitative traits. A new parsimony method (step-matrix gap-weighting, a modification of Thiele's approach) is proposed that allows quantitative traits to be analyzed as continuous variables. The problem of scaling or weighting quantitative characters relative to qualitative characters (and to each other) is reviewed, and three possible solutions are described. The new coding method is applied to data from hoplocercid lizards, and the results show the sensitivity of phylogenetic conclusions to different scaling methods. Although some authors reject the use of continuous, overlapping, quantitative characters in phylogenetic analysis, quantitative data from hoplocercid lizards that are coded using the new approach contain significant phylogenetic structure and exhibit levels of homoplasy similar to those seen in data that are coded qualitatively.  相似文献   

4.
基于93个形态形状,采用13个被子植物基部类群做为外类群,对49个单子叶植物科级分类阶元进行了分支系统学分析。经过简约性分析,得到了1684棵同等最大简约分支树。严格一致树的分支结构图表明:1)古草本类植物和单子叶植物是姐妹群关系;2)具有网状脉的类群,薯蓣科,菝葜科,百部科是单子叶植物的最基部类群。由于性状状态间存在着较多的平行和逆转进化,这在一定程度上影响了系统发育重建的准确性;所选择的性状状态之间的演化很可能是平行的、多次的或者是特化的状态,因此这样复杂的演化关系的探索关键在于找到一些能确切反映其系统演化关系的形态性状。目前很难通过简约化的形态分支分析来解开整个单子叶植物的起源和演化之谜。为了避开对系统学分析造成干扰的误导性状,形态数据结合DNA序列分析很可能是必需的。  相似文献   

5.
We examined how alignment of internal transcribed spacers of rDNA in fungi and plants changes with increasing genetic distance by successive removal of sequences from each data set followed by realignment and phylogenetic analysis. Increasing genetic distance can negatively affect phylogenetic reconstruction in two ways. First, it may cause errors in the alignment and therefore the homology hypotheses of the sequence characters. Second, it may cause errors in the homology assessments of character states because of multiple hits on individual branches. These two causes of error in phylogenetic inference were distinguished from one another in our analysis. The errors in alignment caused by increasing genetic distance were primarily due to inserting too few gaps and inserting gaps at the wrong positions. Errors in tree resolution, topology, and/or branch-support values were more often caused by multiple hits than by misaligned positions. This suggests that increasing genetic distance negatively affects our primary homology assessments of character states more severely than our primary homology assessments of characters. We suggest that increasing taxon sampling with the aim of subdividing long branches is a strategy for obtaining reliable alignments.  相似文献   

6.
It has been shown that increased character sampling betters the accuracy of phylogenetic reconstructions in the case of molecular data. A recently published analysis of avian higher-level phylogenetics based on 2954 morphological characters now provides an empirical example to test whether this is also true in the case of morphological characters. Several clades are discussed which are supported by multiple analyses of mutually independent molecular data (sequences of nuclear genes on different chromosomes and mitochondrial genes) as well as morphological apomorphies, but did not result from parsimony analysis of the large morphological data set. Incorrect character scorings in that analysis notwithstanding, it is concluded that in the case of morphological data, increased character sampling does not necessarily better the accuracy of a phylogenetic reconstruction. Because morphological characters usually have a strongly varying complexity, many simple and homoplastic characters may overrule fewer ones of greater phylogenetic significance in large data sets, thus producing a low ratio of phylogenetic signal to 'noise' in the data.  相似文献   

7.
The notion that two characters evolve independently is of interest for two reasons. First, theories of biological integration often predict that change in one character requires complementary change in another. Second, character independence is a basic assumption of most phylogenetic inference methods, and dependent characters might confound attempts at phylogenetic inference. Previously proposed tests of correlated character evolution require a model phylogeny and therefore assume that nonphylogenetic correlation has a negligible effect on initial tree construction. This paper develops "tree-free" methods for testing the independence of cladistic characters. These methods can test the character independence model as a hypothesis before phylogeny reconstruction, or can be used simply to test for correlated evolution. We first develop an approach for visualizing suites of correlated characters by using character compatibility. Two characters are compatible if they can be used to construct a tree without homoplasy. The approach is based on the examination of mutual compatibilities between characters. The number of times two characters i and j share compatibility with a third character is calculated, and a pairwise shared compatibility matrix is constructed. From this matrix, an association matrix analogous to a dissimilarity matrix is derived. Eigenvector analyses of this association matrix reveal suites of characters with similar compatibility patterns. A priori character subsets can be tested for significant correlation on these axes. Monte Carlo tests are performed to determine the expected distribution of mutual compatibilities, given various criteria from the original data set. These simulated distributions are then used to test whether the observed amounts of nonphylogenetic correlation in character suites can be attributed to chance alone. We have applied these methods to published morphological data for caecilian amphibians. The analyses corroborate instances of dependent evolution hypothesized by previous workers and also identify novel partitions. Phylogenetic analysis is performed after reducing correlated suites to single characters. The resulting cladogram has greater topological resolution and implies appreciably less change among the remaining characters than does a tree derived from the raw data matrix.  相似文献   

8.
The importance of fossils to phylogenetic reconstruction is well established. However, analyses of fossil data sets are confounded by problems related to the less complete nature of the specimens. Taxa that are incompletely known are problematic because of the uncertainty of their placement within a tree, leading to a proliferation of most-parsimonious solutions and "wild card" behavior. Problematic taxa are commonly deleted based on a priori criteria of completeness. Paradoxically, a taxon's problematic behavior is tree dependent, and levels of completeness are not directly associated with problematic behavior. Exclusion of taxa on the basis of completeness eliminates real character conflict and, by not allowing incomplete taxa to determine tree topology, diminishes the phylogenetic hypothesis. Here, the phylogenetic trunk approach is proposed to allow optimization of taxonomic inclusion and tree stability. The use of this method in an analysis of the Paleozoic Lepospondyli finds a single most-parsimonious tree, or trunk, after the removal of one taxon identified as being problematic. Moreover, the 38 trees found at one additional step from this primary trunk were reduced to 2 by removal of one additional taxon. These trunks are compared with the trees that were found by excluding taxa with various degrees of completeness, and the effects of incomplete taxa are explored with regard to use of the trunk. Correlated characters associated with limblessness are discussed regarding the assumption of character independence; however, inclusion of intermediate taxa is found to be the single best method for breaking down long branches.  相似文献   

9.
There is much debate on the definitions of homoplasy and homology, and on how to spot them among character states used in a phylogenetic analysis. Many advocate what I call a "processual approach," in which information on genetics, development, function, or other criteria help a priori in identifying two character states as homologous or homoplastic. I argue that the processes represented by these criteria are insufficiently known for most organisms and most characters to be reliably used to identify homoplasies and homologies. Instead, while not foolproof, phylogeny should be the ultimate test for homology. Character states are assumed to be homologous a priori because this is falsifiable and because their initial inclusion in the character-state analysis is based on the assumption that they may be phylogenetically informative. If they fall out as symplesiomorphies or synapomorphies in a phylogenetic analysis, their status as homologies remains unfalsified. If they fall out as homoplasies, having evolved independently in more than one clade, their status as homologous is falsified, and a homoplasy is identified. The character-state transformation series, functional morphology, finer levels of morphological comparison, and the distribution and correlation of characters all help to explain the presence of homoplasies in a given phylogeny. Explaining these homoplasies, and not ignoring them as "noise," should be as much a goal of phylogenetic analysis as the production of a phylogeny. Examples from the fossil record of Miocene hominoids are given to illustrate the advantages of a process-informs-pattern-recognition-after-the-fact approach to understanding the evolution of character states.  相似文献   

10.
Indels in DNA sequences frequently affect more than a single nucleotide, creating problems for alignment, character coding and phylogenetic analysis. However, the size and frequency of multiple‐residue indels is not usually tested, and with popular alignment packages their reconstruction is indirectly acheived by reducing the affine (gap extension) cost. We explored the length distribution of indels in intron sequences of the gene Mp20 by modifying the gap opening and gap extension costs. Given a “known” tree for the study group, global homology levels were greatest under low gap cost, with gap extension costs of roughly 0.4‐fold the opening cost. Different approaches to gap coding and weighting suggested that taxonomic congruence was correlated with high frequencies of multiple‐position indels, with a maximum indel length of 2–5 bp and few indels above 15 bp, but also including a proportion of indels > 100 bp. Only a small minority of indels could be reconstructed as single‐position indels. Consequently, tree topologies improved when homologous multinucleotide indels were recoded as binary characters which are otherwise highly homoplastic and weighted characters in single‐position coding. In tree‐generating alignment procedures as implemented in POY, where gap penalty determines the character weight during tree search, the problem of assigning inappropriately high weight to multiple‐residue indels could partly be overcome by setting the extension costs to about 0.4‐fold lower than gap opening costs. We conclude that multiple consecutive gap positions are not independent characters and hence methods for parsimony reconstruction of long indels are required. Finally, we also observed a general lack of correlation between taxonomic and character congruence, demonstrating the difficulties of applying congruence criteria to decide among competing alignments. This highlights the value of recent model‐based alignment procedures which can implement the statistical distributions of indel size classes, and do not rely on potentially circular strategies for optimizing overall congruence. © The Willi Hennig Society 2006.  相似文献   

11.
A phylogenetic analysis can be no better than the characters on which it is based. Just as it is inappropriate to code character states of individual characters as separate presence/absence characters, it is inappropriate to combine independent characters because not all information in the data is being utilized. Composite characters link otherwise discernible states from different characters together to form new character states. There are two related problems with this coding. First, there is a loss of hierarchic information between the reductive and composite characters when unordered states are used. Second, the linking of separate characters that occurs during the construction of composite character states can create putative synapomorphies that were not present in the separate characters. For amino acid characters, the problem may occur whenever more than one position of a codon is variable among the terminals sampled. Groups that are resolved as paraphyletic with reductive coding may be resolved as monophyletic with composite coding. The artificial character states indicated by the amino acid characters are unlikely to be congruent with the true gene tree.  相似文献   

12.
Joyce, W.G. and Sterli J. 2010. Congruence, non‐homology, and the phylogeny of basal turtles.–Acta Zoologica (Stockholm) Modern cladistic analysis is characterized by the assembly of increasingly larger data sets coupled with the use of congruence as the final test of homology. Some critics of this development have recently called for a return to more detailed primary homology analysis while questioning the utility of congruence. This discussion appears to be central to the debate regarding the phylogenetic relationships of basal turtles, as the large data sets developed by us have been criticized recently for utilizing poorly constructed characters and including too many homoplasy‐prone characters. Our analysis of this critique reveals that (1) new information regarding poorly understood taxa has a greater impact on the outcome of turtle phylogenies than the characters under dispute; (2) most current turtle phylogenies differ in taxon sampling, not character sampling, and so it appears illogical to condemn a particular analysis for its character sampling; (3) even evolutionary taxonomists should agree that key characters utilized to resolve basal turtle relationships cannot be thought to be ‘infallible’; (4) whereas various criteria provide positive evidence for homology, only congruence provides positive evidence for non‐homology; and (5) a stalemate between conflicting camps within a congruence frame work is preferable to the ad hoc dismissal of data sets, because authoritative statements are untestable.  相似文献   

13.
In two areas of phylogenetics, contrary predictions have been developed and maintained for character analysis and weighting. With regard to adaptation, many have argued that adaptive characters are poorly suited to phylogenetic analysis because of a propensity for homoplasy, while others have argued that complex adaptive characters should be given high weight because homoplasy in complex characters is unlikely. Similarly, with regard to correlated sets of characters, one point of view is that such sets should be collapsed into a single character-a single piece of phylogenetic evidence. Another point of view is that a suite of correlated characters should be emphasized in phylogenetics, again because recurrence of detailed similarity in the same suite of features is unlikely. In this paper, I discuss the theoretical background of adaptation and functional integration with respect to phylogenetic systematics of primates. Several character examples are reviewed with regard to their functional morphology and phylogenetic signal: postorbital structures, tympanic morphology, fusion of the mandibular symphysis, the tooth comb, strepsirrhine talar morphology, and the prehensile tail. It is clear when considering characters such as these that some characters are synapomorphic of major clades and at the same time functionally important. This appears particularly to be the case when characters are integrated into a complex and maintained as stable configurations. Rather than being simply a problem in character analysis, processes of integration may help to explain the utility of phylogenetically informative characters. On the other hand, the character examples also highlight the difficulty in forming a priori predictions about a character's phylogenetic signal. Explanations of patterns of character evolution are often clade-specific, which does not allow for a simple framework of character selection and/or weighting.  相似文献   

14.
Parsimony methods infer phylogenetic trees by minimizing number of character changes required to explain observed character states. From the perspective of applicability of parsimony methods, it is important to assess whether the characters used to infer phylogeny are likely to provide a correct tree. We introduce a graph theoretical characterization that helps to assess whether given set of characters is appropriate to use with parsimony methods. Given a set of characters and a set of taxa, we construct a network called character overlap graph. We show that the character overlap graph for characters that are appropriate to use in parsimony methods is characterized by significant under-representation of subnetworks known as holes, and provide a validation for this observation. This characterization explains success in constructing evolutionary trees using parsimony method for some characters (e.g., protein domains) and lack of such success for other characters (e.g., introns). In the latter case, the understanding of obstacles to applying parsimony methods in a direct way has lead us to a new approach for detecting inconsistent and/or noisy data. Namely, we introduce the concept of stable characters which is similar but less restrictive than the well known concept of pairwise compatible characters. Application of this approach to introns produces the evolutionary tree consistent with the Coelomata hypothesis.  相似文献   

15.
Transformation Series as an Ideographic Character Concept   总被引:7,自引:0,他引:7  
An ideographic concept of character is indispensable to phylogenetic inference. Hennig proposed that characters be conceptualized as “transformation series”, a proposal that is firmly grounded in evolutionary theory and consistent with the method of inferring transformation events as evidence of phylogenetic propinquity. Nevertheless, that concept is usually overlooked or rejected in favor of others based on similarity. Here we explicate Hennig's definition of character as an ideographic concept in the science of phylogenetic systematics. As transformation series, characters are historical individuals akin to species and clades. As such, the related concept of homology refers to a historical identity relation and is not equivalent to or synonymous with synapomorphy. The distinction between primary and secondary homology is dismissed on the grounds that it conflates the concept of homology with the discovery operations used to detect instances of that concept. Although concern for character dependence is generally valid, it is often misplaced, focusing on functional or developmental correlation (both of which are irrelevant in phylogenetic systematics but may be valid in other fields) instead of the historical/transformational independence relevant to phylogenetic inference. As an ideographic science concerned with concrete objects and events (i.e. individuals), intensionally and extensionally defined properties are inconsistent with the individuation of characters for phylogenetic analysis, the utility of properties being limited to communicating results and facilitating future rounds of testing.  相似文献   

16.
For the establishment of polarity of character transformation prior to phylogenetic analysis, various logical and biological criteria are discussed; some are rejected on grounds of liability to systematic error, circularity or unwarranted assumptions aboutParallel tion is used as a non-polar term, with forward and reverse to indicate polarity. A computer program for the detection of parallel tion is described which takes taxa in groups of four. The characters, with two derived: two primitive states or three derived: one primitive state, are listed according to the distribution of states over the four taxa. To each of the 15 phylogenies there corresponds a compatible pair of character patterns. Discordant 2: 2 patterns are unconditionally incompatible (Le Quesne test failure), discordant 3: 1 patterns are incompatible conditional upon correct scoring of polarity. For any putative phylogeny the concordant and discordant characters are identified. In cases of competing alternatives these character sets have to be weighed against one another. Character weighting is discussed; it is argued that it is the individual character transformation which should be weighted, in each direction separately.  相似文献   

17.
Competing phylogenetic hypotheses have become the rule in modern systematics. While the problem of incongruence between character sets has become extremely acute due to the generation of molecular data, it is by no means specific to molecular and morphological comparisons. The role of the modern systematist is to interpret incongruence between character sets and to come to some conclusion regarding a phylogenetic hypothesis of the organisms in question. Two aspects of congruence analysis are examined using the Drosophilidae as an example. The first includes the quantification of congruence and the types of phylogenetic inference that can be made from such analyses. The second aspect concerns an examination of character evolution in order to identify characters and taxa that might be contributing to incongruence in phylogenetic analysis. © 1994 Wiley-Liss, Inc.  相似文献   

18.
The impact of including insertion/deletion events as phylogenetic characters was explored within North American Psoraleeae (Leguminosae). This comprehensive analysis of the impact of gap character incorporation spanned four different indel coding schemes, gaps coded as missing characters, simple binary characters, multi-state characters, and as a 5th state, across two optimality criteria: maximum parsimony and Bayesian Inference. Two nuclear (ITS and Waxy) and six chloroplast (trnS/G, trnL/F, trnK, matK, trnD/T, and rpoB-trnC) DNA regions were sequenced from 43 species of North American Psoraleeae as the foundation of the study. Our results suggest that gaps can provide a substantial percentage of informative characters and can increase phylogenetic resolution and nodal branch support. Phylogenetic signal within indels was higher in chloroplast regions relative to nuclear regions, demonstrating their inclusion as especially important in chloroplast-based phylogenetic studies. Phylogenetic analysis of generic relationships within Psoraleeae is largely congruent with that proposed by Grimes (1990) with a few exceptions. New World species are supported as a monophyletic group. Our analyses suggest that Otholobium may need to be split into two genera and that Psoralidium is polyphyletic and will require movement of Psoralidium tenuiflorum to Pediomelum.  相似文献   

19.
Abstract— There has been little formal discussion concerning character analysis in cladistics, even though characters and their character state trees are central to phylogenetic analyses. We refer to this field as Evolutionary Character Analysis. This paper defines the components of evolutionary character analysis: character state trees, transmodal characters, cladogram characters, attribute and character phylogenies; and the use of these components in phylogenetic inference and evolutionary studies. Character state trees and their effect on cladogram construction are discussed. A new method for numerically coding complex character state trees is described that further reduces the number of variables required to describe them. This method, ordinal coding, reduces the size of data matrices, and facilitates retrieval of state codes. This paper advocates the use of both biological evidence and evidence internal to the cladogram itself to construct character state trees (CSTs). We discuss general models of character evolution (morphocline analysis, Fitch minimum mutation model, etc.) and their role in forming CSTs. Character state trees formed with theories of character evolution are referred to as transmodal characters. These transmodal characters are contrasted with cladogram characters (Mickevich, 1982), and the place of each in a phylogenetic analysis is discussed. The method for determining cladogram characters is detailed with more complicated examples than found in previous publications. We advocate testing transmodal characters by comparing them with the resultant cladogram characters. This comparison involves transformation series analysis (TSA; Mickevich, 1982) which is viewed as an extension of reciprocal illumination. The TSA procedure and its place in hypothesis testing are reviewed. Tracing the evolution of characters interests both systematists and non-systematists alike. When character state trees (transmodal characters) are optimized on pre-existing phylogenies, character phylogenies and attribute phylogenies result. Attributes are defined as a feature that may or may not be homologous (i.e., ecological categories, plant hosts, etc.). We provide two illustrations of this approach, one involving the evolution of the anuran ear and another involving the coevolution of the butterfly Heliconius and its hostplants. Finally, the components of phylogenetic character analysis can be used to test more general evolutionary theories such as the biogenetic law and vicariance biogeography.  相似文献   

20.
Character construction, the methods by which characters and character states are produced from observations of variation, is a crucial but poorly understood step in phylogenetic analysis. Alternative approaches are used in practice, but there has been relatively little investigation of their theoretical bases and analytical consequences. We reviewed three published numerical analyses of the phylogenetic relationships within the Triassic Aetosauria. Combined data from these studies were used to explore the impact of alternative approaches to character construction. Some previous aetosaurian characters represent parallel variations in the morphology of osteoderms from different body regions, and their independence is questionable, leading us to propose more composite alternative constructions. Phylogenetic analyses revealed that inferred relationships within the Aetosauria are in general poorly resolved and weakly supported by the available data and are sensitive to alternative approaches to character construction. Thus, the results from this and previous studies should not, for the most part, be accepted as robust hypotheses of aetosaurian interrelationships. The treatment of systems of intraorganismal (e.g. serial, antimeric) homologues, such as osteoderms, in character construction is discussed. Applied to parallel variations in systems of intraorganismal homologues, previous advice on choosing among alternative character constructions and Hennig's auxiliary principle agree in favoring a more composite approach, in accordance with common practice.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号