首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Yue F  Zhang M  Tang J 《BMC genomics》2008,9(Z2):S15

Background

Because of the advent of high-throughput sequencing and the consequent reduction in the cost of sequencing, many organisms have been completely sequenced and most of their genes identified. It thus has become possible to represent whole genomes as ordered lists of gene identifiers and to study the rearrangement of these entities through computational means. As a result, genome rearrangement data has attracted increasing attentions from both biologists and computer scientists as a new type of data for phylogenetic analysis. The main events of genome rearrangements include inversions, transpositions and transversions. To date, GRAPPA and MGR are the most accurate methods for rearrangement phylogeny, both assuming inversion as the only event. However, due to the complexity of computing transposition distance, it is very difficult to analyze datasets when transpositions are dominant.

Results

We extend GRAPPA to handle transpositions. The new method is named GRAPPA-TP, with two major extensions: a heuristic method to estimate transposition distance, and a new transposition median solver for three genomes. Although GRAPPA-TP uses a greedy approach to compute the transposition distance, it is very accurate when genomes are relatively close. The new GRAPPA-TP is available from http://phylo.cse.sc.edu/.

Conclusion

Our extensive testing using simulated datasets shows that GRAPPA-TP is very accurate in terms of ancestor genome inference and phylogenetic reconstruction. Simulation results also suggest that model match is critical in genome rearrangement analysis: it is not accurate to simulate transpositions with other events including inversions.
  相似文献   

2.

Background  

As genomes evolve after speciation, gene content, coding sequence, gene expression, and splicing all diverge with time from ancestors with close relatives. A minimum evolution general method for continuous character analysis in a phylogenetic perspective is presented that allows for reconstruction of ancestral character states and for measuring along branch evolution.  相似文献   

3.
Phylogenetic position of Nemertea derived from phylogenomic data   总被引:1,自引:0,他引:1  
Nemertea and Platyhelminthes have traditionally been grouped together because they possess a so-called acoelomate organization, but lateral vessels and rhynchocoel of nemerteans have been regarded as coelomic cavities. Additionally, both taxa show spiral cleavage patterns prompting the placement of Nemertea as sister to coelomate Protostomia, that is, either to Neotrochozoa (Mollusca and Annelida) or to Teloblastica (Neotrochozoa plus Arthropoda). Some workers maintain a sister group relationship of Nemertea and Platyhelminthes as Parenchymia because of an assumed homology of G?tte's and Müller's larvae of polyclad Platyhelminthes and the pilidium larvae of heteronemerteans. So far, molecular data were only able to significantly reject a sister group relationship to Teloblastica. Although phylogenomic data are available for Platyhelminthes, Annelida, Mollusca, and Arthropoda, they are lacking for Nemertea. Herein, we present the first analysis specifically addressing nemertean phylogenetic position using phylogenomic data. More specifically, we collected expressed sequence tag data from Lineus viridis (O.F. Müller, 1774) and combined it with available data to produce a data set of 9,377 amino acid positions from 60 ribosomal proteins. Maximum likelihood analyses and Bayesian inferences place Nemertea in a clade together with Annelida and Mollusca. Furthermore, hypothesis testing significantly rejected a sister group relationship to either Platyhelminthes or Teloblastica. The Coelomata hypothesis, which groups coelomate taxa together to the exclusion of acoelomate and pseudocoelomate taxa, is not congruent with our results. Thus, the supposed acoelomate organization evolved independently in Nemertea and Platyhelminthes. In Nemertea, evolution of acoely is most likely due to a secondary reduction of the coelom as it is found in certain species of Mollusca and Annelida. Though looking very similar, the G?tte's and Müller's larvae of polyclad Platyhelminthes are not homologous to the pilidium larvae of heteronemerteans. Finally, the convergent evolution of segmentation in Annelida and Arthropoda is further substantiated.  相似文献   

4.
Cladistic analysis should not be equated with phylogenetic reconstruc Instead it is a means of describing character-state distributions among organisms and in this it resembles phenetic analysis. However, the claim that cladistic methods meet phenetic ('Gilmour-natural') criteria for classification as well as or better than traditional phenetic ones is shown to be based on an inadequate interpretation of these criteria. Instead, a new measure of naturalness is proposed in which the most natural classification is that which describes the distribution of all character states by the smallest number of statements. The possibility of extending this measure to provide a criterion for an optimally simple classification is noted. It is concluded that phylogenetic reconstruction must not only reflect the branching patterns suggested by cladistic analysis but also take account of the tionary history that is reflected in an optimal phenogram.  相似文献   

5.
Phylogenetic reconstruction of carnivore social organizations   总被引:2,自引:0,他引:2  
It is generally assumed that carnivore social organizations evolved directionally from a solitary ancestor into progressively more advanced forms of group living. Although alternative explanations exist, this evolutionary hypothesis has never been tested. Here, I used literature data and maximum likelihood reconstruction on a complete carnivore phylogeny to test this hypothesis against two others: one assuming directional evolution from a non-solitary ancestor, and one assuming parallel evolutions from a socially flexible ancestor, that is, an ancestor with abilities to live in a variety of social organizations. The phylogenetic reconstructions did not support any of the three hypotheses of social evolution at the root of Carnivora. At the family level, however, there was support for a non-solitary and socially flexible ancestor to Canidae, a socially flexible or solitary ancestor to Mustelidae, a solitary or socially flexible ancestor to Mephitidae, a solitary or group living ancestor to Phocidae, a group living ancestor to Otariidae and a solitary ancestor to Ursidae, Felidae, Herpestidae and Viverridae. There was equivocal support for the ancestral state of Procyonidae and Hyaenidae. It is unclear whether the common occurrence of a solitary ancestry at the family level was caused by a solitary ancestor at the root of Carnivora or by multiple transitions into a solitary state. The failure to support a solitary ancestor to Carnivora calls for caution when using this hypothesis in an evolutionary framework, and I suggest continued investigations of the pathways of the evolution of carnivore social organizations.  相似文献   

6.
Lateral gene transfer (LGT) is often seen as a form of noise, obscuring the phylogenetic signal with which we might hope to reconstruct the evolution of a group of organisms, or indeed the history of all life (the Tree of Life). Such reconstruction might still be possible if the subset of genes conserved among all genomes in a group (or common to all genomes) comprise a core that is relatively refractory to LGT. Several papers designed to test this notion have recently appeared, and here we re-analyze one, which claims that the core of single-copy orthologs shared by all sequenced genomes of the gammaproteobacteria is essentially free of LGT. This conclusion is unfortunately premature, and it is very hard to determine what fraction of this core has been affected by LGT. We discuss other difficulties with the core concept and suggest that, although the core idea must remain part of our understanding of phylogenetic relationships, it should not be the sole basis for defining such relationships, because these are not exclusively tree-like. We suggest instead a more complex but more natural framework for classification, which we call the Synthesis of Life.  相似文献   

7.
The qualitative dimension of gene expression data and its heterogeneous nature in cancerous specimens can be accounted for by phylogenetic modeling that incorporates the directionality of altered gene expressions, complex patterns of expressions among a group of specimens, and data-based rather than specimen-based gene linkage. Our phylogenetic modeling approach is a double algorithmic technique that includes polarity assessment that brings out the qualitative value of the data, followed by maximum parsimony analysis that is most suitable for the data heterogeneity of cancer gene expression. We demonstrate that polarity assessment of expression values into derived and ancestral states, via outgroup comparison, reduces experimental noise; reveals dichotomously expressed asynchronous genes; and allows data pooling as well as comparability of intra- and interplatforms. Parsimony phylogenetic analysis of the polarized values produces a multidimensional classification of specimens into clades that reveal shared derived gene expressions (the synapomorphies); provides better assessment of ontogenic pathways and phyletic relatedness of specimens; efficiently utilizes dichotomously expressed genes; produces highly predictive class recognition; illustrates gene linkage and multiple developmental pathways; provides higher concordance between gene lists; and projects the direction of change among specimens. Further implication of this phylogenetic approach is that it may transform microarray into diagnostic, prognostic, and predictive tool.  相似文献   

8.
9.
The phylogenetic position of Dictyostelium inferred from 18S rRNA data contradicts that from protein data. Protein trees always show the close affinity of Dictyostelium with animals, fungi, and plants, whereas in 18S rRNA trees the branching of Dictyostelium is placed at a position before the massive radiation of protist groups including the divergence of the three kingdoms. To settle this controversial issue and to determine the correct position of Dictyostelium, we inferred the phylogenetic relationship among Dictyostelium and the three kingdoms Animalia, Fungi, and Plantae by a maximum-likelihood method using 19 different protein data sets. It was shown at the significance level of 1 SE that the branching of Dictyostelium antedates the divergence of Animalia and Fungi, and Plantae is an outgroup of the Animalia-Fungi-Dictyostelium clade.Correspondence to: T. Miyata  相似文献   

10.
11.
Phylogenetic reconstruction of vertebrate Hox cluster duplications   总被引:6,自引:2,他引:6  
In vertebrates and the cephalochordate, amphioxus, the closest vertebrate relative, Hox genes are linked in a single cluster. Accompanying the emergence of higher vertebrates, the Hox gene cluster duplicated in either a single step or multiple steps, resulting in the four-cluster state present in teleosts and tetrapods. Mammalian Hox clusters (designated A, B, C, and D) extend over 100 kb and are located on four different chromosomes. Reconstructing the history of the duplications and its relation to vertebrate evolution has been problematic due to the lack of alignable sequence information. In this study, the problem was approached by conducting a statistical analysis of sequences from the fibrillar-type collagens (I, II, III, and IV), genes closely linked to each Hox cluster which likely share the same duplication history as the Hox genes. We find statistical support for the hypothesis that the cluster duplication occurred as multiple distinct events and that the four-cluster situation arose by a three- step sequential process.   相似文献   

12.
MOTIVATION: Phylogenetic reconstruction from gene-order data has attracted increasing attention from both biologists and computer scientists over the last few years. Methods used in reconstruction include distance-based methods (such as neighbor-joining), parsimony methods using sequence-based encodings, Bayesian approaches, and direct optimization. The latter, pioneered by Sankoff and extended by us with the software suite GRAPPA, is the most accurate approach, but cannot handle more than about 15 genomes of limited size (e.g. organelles). RESULTS: We report here on our successful efforts to scale up direct optimization through a two-step approach: the first step decomposes the dataset into smaller pieces and runs the direct optimization (GRAPPA) on the smaller pieces, while the second step builds a tree from the results obtained on the smaller pieces. We used the sophisticated disk-covering method (DCM) pioneered by Warnow and her group, suitably modified to take into account the computational limitations of GRAPPA. We find that DCM-GRAPPA scales gracefully to at least 1000 genomes of a few hundred genes each and retains surprisingly high accuracy throughout the range: in our experiments, the topological error rate rarely exceeded a few percent. Thus, reconstruction based on gene-order data can now be accomplished with high accuracy on datasets of significant size.  相似文献   

13.
Sibship reconstruction from genetic data with typing errors   总被引:13,自引:0,他引:13  
Wang J 《Genetics》2004,166(4):1963-1979
Likelihood methods have been developed to partition individuals in a sample into full-sib and half-sib families using genetic marker data without parental information. They invariably make the critical assumption that marker data are free of genotyping errors and mutations and are thus completely reliable in inferring sibships. Unfortunately, however, this assumption is rarely tenable for virtually all kinds of genetic markers in practical use and, if violated, can severely bias sibship estimates as shown by simulations in this article. I propose a new likelihood method with simple and robust models of typing error incorporated into it. Simulations show that the new method can be used to infer full- and half-sibships accurately from marker data with a high error rate and to identify typing errors at each locus in each reconstructed sib family. The new method also improves previous ones by adopting a fresh iterative procedure for updating allele frequencies with reconstructed sibships taken into account, by allowing for the use of parental information, and by using efficient algorithms for calculating the likelihood function and searching for the maximum-likelihood configuration. It is tested extensively on simulated data with a varying number of marker loci, different rates of typing errors, and various sample sizes and family structures and applied to two empirical data sets to demonstrate its usefulness.  相似文献   

14.
Haplotype reconstruction from genotype data using Imperfect Phylogeny   总被引:13,自引:0,他引:13  
Critical to the understanding of the genetic basis for complex diseases is the modeling of human variation. Most of this variation can be characterized by single nucleotide polymorphisms (SNPs) which are mutations at a single nucleotide position. To characterize the genetic variation between different people, we must determine an individual's haplotype or which nucleotide base occurs at each position of these common SNPs for each chromosome. In this paper, we present results for a highly accurate method for haplotype resolution from genotype data. Our method leverages a new insight into the underlying structure of haplotypes that shows that SNPs are organized in highly correlated 'blocks'. In a few recent studies, considerable parts of the human genome were partitioned into blocks, such that the majority of the sequenced genotypes have one of about four common haplotypes in each block. Our method partitions the SNPs into blocks, and for each block, we predict the common haplotypes and each individual's haplotype. We evaluate our method over biological data. Our method predicts the common haplotypes perfectly and has a very low error rate (<2% over the data) when taking into account the predictions for the uncommon haplotypes. Our method is extremely efficient compared with previous methods such as PHASE and HAPLOTYPER. Its efficiency allows us to find the block partition of the haplotypes, to cope with missing data and to work with large datasets. AVAILABILITY: The algorithm is available via a Web server at http://www.calit2.net/compbio/hap/  相似文献   

15.

Background  

Maximum parsimony phylogenetic tree reconstruction from genetic variation data is a fundamental problem in computational genetics with many practical applications in population genetics, whole genome analysis, and the search for genetic predictors of disease. Efficient methods are available for reconstruction of maximum parsimony trees from haplotype data, but such data are difficult to determine directly for autosomal DNA. Data more commonly is available in the form of genotypes, which consist of conflated combinations of pairs of haplotypes from homologous chromosomes. Currently, there are no general algorithms for the direct reconstruction of maximum parsimony phylogenies from genotype data. Hence phylogenetic applications for autosomal data must therefore rely on other methods for first computationally inferring haplotypes from genotypes.  相似文献   

16.
Like many lichen-forming fungi, species of the genus Rhizoplaca have wide geographical distributions, but studies of their genetic variability are limited. The information about the ITS rDNA sequences of three species of Rhizoplaca from Anatolia was generated and aligned with other species from other countries and also with the data belonging to Lecanora species. The examined species were collected from the volcanic rocks of Mount Erciyes which is located in the middle of Anatolia (Turkey). The sequence data aligned with eight other samples of Rhizoplaca and six different species of Lecanora were obtained from GenBank. The results support the concept maintained by Arup and Grube (2000) that Rhizoplaca may not be a genus separate from Lecanora. According to the phylogenetic tree, Rhizoplaca melanopthalma from Turkey with two different samples of R. melanopthalma from Arizona (AF159929, AF159934) and a sample from Austria formed a group under the same branch. R. peltata and R. chrysoleuca samples from Anatolia located in two other branches of the tree formed sister groups with the samples of the same species from different countries. Although R. peltata remained on the same branch with other samples of the same species from other countries it was placed in a different branch within the group. When the three species from Anatolia were considered alone, it was noticed that Rhizoplaca melanopthalma and Rhizoplaca peltata are phylogenetically closer to each other than Rhizoplaca chrysoleuca; the morphological characteristics also support this result.  相似文献   

17.
Phylogenetic signal in AFLP data sets   总被引:9,自引:0,他引:9  
AFLP markers provide a potential source of phylogenetic information for molecular systematic studies. However, there are properties of restriction fragment data that limit phylogenetic interpretation of AFLPs. These are (a) possible nonindependence of fragments, (b) problems of homology assignment of fragments, (c) asymmetry in the probability of losing and gaining fragments, and (d) problems in distinguishing heterozygote from homozygote bands. In the present study, AFLP data sets of Lactuca s.l. were examined for the presence of phylogenetic signal. An indication of this signal was provided by carrying out tree length distribution skewness (g1) tests, permutation tail probability (PTP) tests, and relative apparent synapomorphy analysis (RASA). A measure of the support for internal branches in the optimal parsimony tree (MPT) was made using bootstrap, jackknife, and decay analysis. Finally, the extent of congruence in MPTs for AFLP and internal transcribed spacer (ITS)-1 data sets for the same taxa was made using the partition homogeneity test (PHT) and the Templeton test. These analytical studies suggested the presence of phylogenetic signal in the AFLP data sets, although some incongruence was found between AFLP and ITS MPTs. An extensive literature survey undertaken indicated that authors report a general congruence of AFLP and ITS tree topologies across a wide range of taxonomic groups, suggesting that the present results and conclusions have a general bearing. In these earlier studies and those for Lactuca s.l., AFLP markers have been found to be informative at somewhat lower taxonomic levels than ITS sequences. Tentative estimates are suggested for the levels of ITS sequence divergence over which AFLP profiles are likely to be phylogenetically informative.  相似文献   

18.
The sequence of the second internal transcribed spacer of the ribosomal DNA was determined for the following strongyloid nematodes: Cylicocyclus insignis, Chabertia ovina, Oesophagostomum venulosum, Cloacina communis, Cloacina hydriformis, Labiostrongylus labiostrongylus, Parazoniolaimus collaris, Macropostrongylus macropostrongylus, Macropostrongylus yorkei, Rugopharynx australis, Rugopharynx rosemariae, Macropostrongyloides baylisi, Oesophagostomoides longispicularis and Paramacropostrongylus toraliformis, and compared with published sequences for species of Strongylus and for Hypodontus macropi. The resultant phylogenetic trees supported current hypotheses based on morphological evidence for the separation of the families Strongylidae and Chabertiidae, but did not support the separation of the endemic Australian genera as a distinctive clade within the Chabertiidae. The implications of this finding for the phylogenetic origins of the Australian strongyloids are discussed.  相似文献   

19.
Among the commelinid monocots, phylogenetic relationships involving Juncaceae and Cyperaceae have been difficult to resolve because of parallel and convergent evolution of morphological features. Using comparative sequencing of the chloroplast gene rbcL, hypotheses of relationships between these two families were tested. Sequences from 13 taxa were obtained for this study and analyzed using parsimony with 15 previously published sequences. Results of this analysis suggest that two genera of Juncaceae, Oxychloë and Prionium, are not closely related to the other genera of this family. Further, Cyperaceae appear to be more closely related to Juncaceae than to Poaceae, with which Cyperaceae are sometimes classified. In fact, Cyperaceae appear to be derived from within Juncaceae. The progenitor-derivative relationship of Juncaceae and Cyperaceae suggested by this study reveals an additional example of paraphyletic families which presents a series of taxonomic dilemmas.  相似文献   

20.
A well-supported molecular phylogeny for North American Gryllus species based on a combined data set of mitochondrial (mt) DNA is presented. A total of 26 individuals representing 13 populations of 11 species of the genus Gryllus and 4 individuals of two outgroup species, Teleogryllus oceanicus and Acheta domestica, were sampled in this study. The complete cytochrome b gene (1036 bp) and a 500-bp fragment of the 16S rRNA gene were sequenced for each individual. Since results from separate analyses of the cytochrome b and 16S data sets, as well as a previously published mtDNA restriction-site data set, were not conflicting, all data were combined for phylogenetic analyses. The clade of European Gryllus was clearly separated from the North American clade. The amount of sequence divergence between these clades was significantly greater than within the clades, suggesting a basal drift-vicariant event in the genus. This is the first phylogenetic analysis of North American Gryllus that includes western species. Four well-supported groups were identified but their relationships showed no clear east-west structure. Our phylogeny supports the recent reassignment of G. integer Scudder 1901 from Texas to G. texensis Cade and Otte 2000. The evolution of cricket song and life cycle is discussed using the new phylogenetic framework.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号