首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Phylogenies involving nonmodel species are based on a few genes, mostly chosen following historical or practical criteria. Because gene trees are sometimes incongruent with species trees, the resulting phylogenies may not accurately reflect the evolutionary relationships among species. The increase in availability of genome sequences now provides large numbers of genes that could be used for building phylogenies. However, for practical reasons only a few genes can be sequenced for a wide range of species. Here we asked whether we can identify a few genes, among the single-copy genes common to most fungal genomes, that are sufficient for recovering accurate and well-supported phylogenies. Fungi represent a model group for phylogenomics because many complete fungal genomes are available. An automated procedure was developed to extract single-copy orthologous genes from complete fungal genomes using a Markov Clustering Algorithm (Tribe-MCL). Using 21 complete, publicly available fungal genomes with reliable protein predictions, 246 single-copy orthologous gene clusters were identified. We inferred the maximum likelihood trees using the individual orthologous sequences and constructed a reference tree from concatenated protein alignments. The topologies of the individual gene trees were compared to that of the reference tree using three different methods. The performance of individual genes in recovering the reference tree was highly variable. Gene size and the number of variable sites were highly correlated and significantly affected the performance of the genes, but the average substitution rate did not. Two genes recovered exactly the same topology as the reference tree, and when concatenated provided high bootstrap values. The genes typically used for fungal phylogenies did not perform well, which suggests that current fungal phylogenies based on these genes may not accurately reflect the evolutionary relationships among species. Analyses on subsets of species showed that the phylogenetic performance did not seem to depend strongly on the sample. We expect that the best-performing genes identified here will be very useful for phylogenetic studies of fungi, at least at a large taxonomic scale. Furthermore, we compare the method developed here for finding genes for building robust phylogenies with previous ones and we advocate that our method could be applied to other groups of organisms when more complete genomes are available.  相似文献   

2.
A detailed phylogenetic analysis of tetraspanins from 10 fully sequenced metazoan genomes and several fungal and protist genomes gives insight into their evolutionary origins and organization. Our analysis suggests that the superfamily can be divided into four large families. These four families-the CD family, CD63 family, uroplakin family, and RDS family-are further classified as consisting of several ortholog groups. The clustering of several ortholog groups together, such as the CD9/Tsp2/CD81 cluster, suggests functional relatedness of those ortholog groups. The fact that our studies are based on whole genome analysis enabled us to estimate not only the phylogenetic relationships among the tetraspanins, but also the first appearance in the tree of life of certain tetraspanin ortholog groups. Taken together, our data suggest that the tetraspanins are derived from a single (or a few) ancestral gene(s) through sequence divergence, rather than convergence, and that the majority of tetraspanins found in the human genome are vertebrate (21 instances), tetrapod (4 instances), or mammalian (6 instances) inventions.  相似文献   

3.
Phenotypic behavior of a group of organisms can be studied using a range of molecular evolutionary tools that help to determine evolutionary relationships. Traditionally a gene or a set of gene sequences was used for generating phylogenetic trees. Incomplete evolutionary information in few selected genes causes problems in phylogenetic tree construction. Whole genomes are used as remedy. Now, the task is to identify the suitable parameters to extract the hidden information from whole genome sequences that truly represent evolutionary information. In this study we explored a random anchor (a stretch of 100 nucleotides) based approach (ABWGP) for finding distance between any two genomes, and used the distance estimates to compute evolutionary trees. A number of strains and species of Mycobacteria were used for this study. Anchor-derived parameters, such as cumulative normalized score, anchor order and indels were computed in a pair-wise manner, and the scores were used to compute distance/phylogenetic trees. The strength of branching was determined by bootstrap analysis. The terminal branches are clearly discernable using the distance estimates described here. In general, different measures gave similar trees except the trees based on indels. Overall the tree topology reflected the known biology of the organisms. This was also true for different strains of Escherichia coli. A new whole genome-based approach has been described here for studying evolutionary relationships among bacterial strains and species.  相似文献   

4.
Phenotypic behavior of a group of organisms can be studied using a range of molecular evolutionary tools that help to determine evolutionary relationships. Traditionally a gene or a set of gene sequences was used for generating phylogenetic trees. Incomplete evolutionary information in few selected genes causes problems in phylogenetic tree construction. Whole genomes are used as remedy. Now, the task is to identify the suitable parameters to extract the hidden information from whole genome sequences that truly represent evolutionary information. In this study we explored a random anchor (a stretch of 100 nucleotides) based approach (ABWGP) for finding distance between any two genomes, and used the distance estimates to compute evolutionary trees. A number of strains and species of Mycobacteria were used for this study. Anchor-derived parameters, such as cumulative normalized score, anchor order and indels were computed in a pair-wise manner, and the scores were used to compute distance/phylogenetic trees. The strength of branching was determined by bootstrap analysis. The terminal branches are clearly discernable using the distance estimates described here. In general, different measures gave similar trees except the trees based on indels. Overall the tree topology reflected the known biology of the organisms. This was also true for different strains of Escherichia coli. A new whole genome-based approach has been described here for studying evolutionary relationships among bacterial strains and species.  相似文献   

5.
Use of whole genome sequence data to infer baculovirus phylogeny   总被引:18,自引:0,他引:18       下载免费PDF全文
Several phylogenetic methods based on whole genome sequence data were evaluated using data from nine complete baculovirus genomes. The utility of three independent character sets was assessed. The first data set comprised the sequences of the 63 genes common to these viruses. The second set of characters was based on gene order, and phylogenies were inferred using both breakpoint distance analysis and a novel method developed here, termed neighbor pair analysis. The third set recorded gene content by scoring gene presence or absence in each genome. All three data sets yielded phylogenies supporting the separation of the Nucleopolyhedrovirus (NPV) and Granulovirus (GV) genera, the division of the NPVs into groups I and II, and species relationships within group I NPVs. Generation of phylogenies based on the combined sequences of all 63 shared genes proved to be the most effective approach to resolving the relationships among the group II NPVs and the GVs. The history of gene acquisitions and losses that have accompanied baculovirus diversification was visualized by mapping the gene content data onto the phylogenetic tree. This analysis highlighted the fluid nature of baculovirus genomes, with evidence of frequent genome rearrangements and multiple gene content changes during their evolution. Of more than 416 genes identified in the genomes analyzed, only 63 are present in all nine genomes, and 200 genes are found only in a single genome. Despite this fluidity, the whole genome-based methods we describe are sufficiently powerful to recover the underlying phylogeny of the viruses.  相似文献   

6.
基于Illumina平台对朱砂根和红凉伞叶绿体全基因组进行测序,利用生物信息学比较叶绿体基因组结构特征与变异程度,旨在明确朱砂根(Ardisia crenata)及红凉伞(Ardisia crenata var. bicolor)叶绿体基因组特征及差异,并与同科其他物种叶绿体全基因组进行比较分析,确定其在紫金牛属系统发育位置。结果表明,朱砂根和红凉伞均为由一个大单拷贝区(LSC)、一个小单拷贝区(SSC)和一对反向重复区(IRa/IRb)构成的环状四分体结构,注释得到132个基因,其重复序列的类型与分布模式相似,但数量有所差异。其中psbAmatKrpoC2ropBndhKaccDndhFndhDndhHycf1等基因的编码区存在差异,这些位点为朱砂根分子鉴定提供新位点。朱砂根及红凉伞叶绿体基因组具有较高保守性,叶绿体基因组之间没有重排或倒置,IR区序列变异最低,SSC区变异程度最高。系统发育树分析表明紫金牛科和报春花科为两个分支,朱砂根和红凉伞归为紫金牛科,且朱砂根与红凉伞亲缘关系最为密切,从分子水平为红凉伞作为朱砂根变种提供了科学解释。本研究解析了朱砂根及变种红凉伞叶绿体基因组结构,探讨了紫金牛科属间系统发育关系,也为紫金牛科药用植物分类鉴定、系统进化及资源开发利用研究奠定基础。  相似文献   

7.
With the development of genome sequencing more whole genomes of microorganisms were completed, many methods wereintroduced to reconstruct the phylogenetic tree of those microorganismswith the information extracted from the whole genomes through variousways of transforming or mapping the whole genome sequences into otherforms which can describe the evolutionary distance in a new way. We thinkit might be possible that there exists information buried in the wholegenome transferred along lineage, which remains stable and is moreessential than sequence conservation of individual genes or the arrangementof some genes of a selected set. We need to find one measurement that caninvolve as many phylogenetic features as possible that are beyond thegenome sequence itself. We converted each genome sequence of themicroorganisms into another linear sequence to represent the functionalstructure of the sequence, and we used a new information function tocalculate the discrepancy of sequences and to get one distance matrix of thegenomes, and built one phylogenetic tree with a neighbor joining method.The resulting tree shows that the major lineages are consistent with theresult based on their 16srRNA sequences. Our method discovered onephylogenetic feature derived from the genome sequences and the encodedgenes that can rebuild the phylogenetic tree correctly. The mapping of onegenome sequence to its new form representing the relative positions of thefunctional genes provides a new way to measure the phylogeneticrelationships, and with the more specific classification of gene functions theresult could be more sensitive.  相似文献   

8.
The kingdom of fungi provides model organisms for biotechnology, cell biology, genetics, and life sciences in general. Only when their phylogenetic relationships are stably resolved, can individual results from fungal research be integrated into a holistic picture of biology. However, and despite recent progress, many deep relationships within the fungi remain unclear. Here, we present the first phylogenomic study of an entire eukaryotic kingdom that uses a consistency criterion to strengthen phylogenetic conclusions. We reason that branches (splits) recovered with independent data and different tree reconstruction methods are likely to reflect true evolutionary relationships. Two complementary phylogenomic data sets based on 99 fungal genomes and 109 fungal expressed sequence tag (EST) sets analyzed with four different tree reconstruction methods shed light from different angles on the fungal tree of life. Eleven additional data sets address specifically the phylogenetic position of Blastocladiomycota, Ustilaginomycotina, and Dothideomycetes, respectively. The combined evidence from the resulting trees supports the deep-level stability of the fungal groups toward a comprehensive natural system of the fungi. In addition, our analysis reveals methodologically interesting aspects. Enrichment for EST encoded data-a common practice in phylogenomic analyses-introduces a strong bias toward slowly evolving and functionally correlated genes. Consequently, the generalization of phylogenomic data sets as collections of randomly selected genes cannot be taken for granted. A thorough characterization of the data to assess possible influences on the tree reconstruction should therefore become a standard in phylogenomic analyses.  相似文献   

9.
A genome space is a moduli space of genomes. In this space, each point corresponds to a genome. The natural distance between two points in the genome space reflects the biological distance between these two genomes. Currently, there is no method to represent genomes by a point in a space without losing biological information. Here, we propose a new graphical representation for DNA sequences. The breakthrough of the subject is that we can construct the moment vectors from DNA sequences using this new graphical method and prove that the correspondence between moment vectors and DNA sequences is one-to-one. Using these moment vectors, we have constructed a novel genome space as a subspace in RN. It allows us to show that the SARS-CoV is most closely related to a coronavirus from the palm civet not from a bird as initially suspected, and the newly discovered human coronavirus HCoV-HKU1 is more closely related to SARS than to any other known member of group 2 coronavirus. Furthermore, we reconstructed the phylogenetic tree for 34 lentiviruses (including human immunodeficiency virus) based on their whole genome sequences. Our genome space will provide a new powerful tool for analyzing the classification of genomes and their phylogenetic relationships.  相似文献   

10.
The complete sequenced genomes of chloroplast have provided much information on the origin and evolution of this organelle. In this paper we attempt to use these sequences to test a novel approach for phylogenetic analysis of complete genomes based on correlation analysis of compositional vectors. All protein sequences from 21 complete chloroplast genomes are analyzed in comparison with selected archaea, eubacteria, and eukaryotes. The distance-based analysis shows that the chloroplast genomes are most closely related to cyanobacteria, consistent with the endosymbiotic origin of chloroplasts. The chloroplast genomes are separated to two major clades corresponding to chlorophytes (green plants) s.l. and rhodophytes (red algae) s.l. The interrelationships among the chloroplasts are largely in agreement with the current understanding on chloroplast evolution. For instance, the analysis places the chloroplasts of two chromophytes (Guillardia and Odontella) within the rhodophyte lineage, supporting secondary endosymbiosis as the source of these chloroplasts. The relationships among the green algae and land plants in our tree also agree with results from traditional phylogenetic analyses. Thus, this study establishes the value of our simple correlation analysis in elucidating the evolutionary relationships among genomes. It is hoped that this approach will provide insights on comparative genome analysis.  相似文献   

11.
Traditional phylogenetic analysis is based on multiple sequence alignment. With the development of worldwide genome sequencing project, more and more completely sequenced genomes become available. However, traditional sequence alignment tools are impossible to deal with large-scale genome sequence. So, the development of new algorithms to infer phylogenetic relationship without alignment from whole genome information represents a new direction of phylogenetic study in the post-genome era. In the present study, a novel algorithm based on BBC (base-base correlation) is proposed to analyze the phylogenetic relationships of HEV (Hepatitis E virus). When 48 HEV genome sequences are analyzed, the phylogenetic tree that is constructed based on BBC algorithm is well consistent with that of previous study. When compared with methods of sequence alignment, the merit of BBC algorithm appears to be more rapid in calculating evolutionary distances of whole genome sequence and not requires any human intervention, such as gene identification, parameter selection. BBC algorithm can serve as an alternative to rapidly construct phylogenetic trees and infer evolutionary relationships.  相似文献   

12.

Background

Genome level analyses have enhanced our view of phylogenetics in many areas of the tree of life. With the production of whole genome DNA sequences of hundreds of organisms and large-scale EST databases a large number of candidate genes for inclusion into phylogenetic analysis have become available. In this work, we exploit the burgeoning genomic data being generated for plant genomes to address one of the more important plant phylogenetic questions concerning the hierarchical relationships of the several major seed plant lineages (angiosperms, Cycadales, Gingkoales, Gnetales, and Coniferales), which continues to be a work in progress, despite numerous studies using single, few or several genes and morphology datasets. Although most recent studies support the notion that gymnosperms and angiosperms are monophyletic and sister groups, they differ on the topological arrangements within each major group.

Methodology

We exploited the EST database to construct a supermatrix of DNA sequences (over 1,200 concatenated orthologous gene partitions for 17 taxa) to examine non-flowering seed plant relationships. This analysis employed programs that offer rapid and robust orthology determination of novel, short sequences from plant ESTs based on reference seed plant genomes. Our phylogenetic analysis retrieved an unbiased (with respect to gene choice), well-resolved and highly supported phylogenetic hypothesis that was robust to various outgroup combinations.

Conclusions

We evaluated character support and the relative contribution of numerous variables (e.g. gene number, missing data, partitioning schemes, taxon sampling and outgroup choice) on tree topology, stability and support metrics. Our results indicate that while missing characters and order of addition of genes to an analysis do not influence branch support, inadequate taxon sampling and limited choice of outgroup(s) can lead to spurious inference of phylogeny when dealing with phylogenomic scale data sets. As expected, support and resolution increases significantly as more informative characters are added, until reaching a threshold, beyond which support metrics stabilize, and the effect of adding conflicting characters is minimized.  相似文献   

13.
Complete nucleotide sequences have been determined for atp6 from Suillus luteus and cox3 from Suillus sinuspaulianus (Boletales, Hymenomycetes, Basidiomycota), which code for ATPase subunit 6 and cytochrome oxidase subunit 3, respectively. These sequences were used to design PCR primers for the amplification of partial atp6 and cox3 sequences from other members of the Boletales and outgroup taxa. In atp6 and cox3 from Russula rosacea, one of the outgroup taxa, we observed a number of in-frame TGA(trp) codons, which imply a Neurospora crassa-type mitochondrial code in R. rosacea and possibly in basidiomycetes in general. Interestingly, however, most basidiomycetes other than R. rosacea appear to strongly prefer the TGG(trp) codon, which is unusual, given the strong A + T bias in fungal mitochondrial genomes. Pairwise comparisons were performed between atp6 sequences from increasingly divergent fungal lineages, and results show that all three codon positions become saturated in substitutions after an estimated divergence time of approx 300 Ma. This means that atp6 is likely to provide phylogenetic resolution within fungal classes but not at higher taxonomic levels. Also, because of the strong A + T bias in fungal mitochondrial genomes, A/T transversions were found to be more common than any other type of substitution, resulting in transversions being about two to three times more common in most pairwise sequence comparisons. Finally, atp6 sequences were used to infer phylogenetic relationships between 27 taxa from the Boletales and 4 outgroup taxa. Analyses were performed (i) on nucleotide sequence data using parsimony (successive approximation) as well as maximum likelihood methods and (ii) on deduced amino acid sequences using distance methods based on empirical substitution probabilities. Results from the various analyses are largely concordant with each other as well as with prior analyses of partial mitochondrial large-subunit rDNA (mtLSU rDNA). Analysis of the combined atp6 and mtLSU rDNA sequences results in increased bootstrap support for several key branches. Relationships that have been resolved for the first time in the current analysis are discussed.  相似文献   

14.
为重建喉毛花属下系统发育关系,明晰属下皱边喉毛花及其近缘种之间的物种关系。本研究利用Illumina高通量测序平台对12 个叶绿体基因组进行双末端测序,获得大量高质量的Clean reads用于后续生物信息学分析。结果表明:(1)喉毛花属下物种的基因组差异较小,均在150 kb左右,基因总数为131 个,其中编码基因81 个。IR区核苷酸多态性比SC低,编码区比非编码区更保守。(2)进化分析结果显示,几乎所有的编码基因受到纯化选择的作用。(3)密码子偏好性分析表明有35 个密码子的RSCU值均大于1,说明使用这些密码子的频率较高,各项密码子偏好性衡量指标说明喉毛花属物种的密码子偏好性较弱。(4)系统发育分析表明CDS、密码子位置与基因间隔区数据集构建的系统发育树具有高度一致的拓扑结构,大部分分支的支持率高。这些结果表明皱边喉毛花及其近缘种的叶绿体基因组无明显差异,在系统发育树上无法按物种聚类,也为后续展开喉毛花属下群体遗传学研究提供科学依据。  相似文献   

15.
The genome sequences of Phaeodactylum tricornutum, Thalassiosira pseudonana, and Cyanidioschyzon merolae have provided significant evidence for the secondary endosymbiosis of diatoms in regard to the genome. Yet little about their relationships in regard to gene regulation pattern, such as microRNA (miRNA), has been reported. Using a homology search based on genomic sequences, 13, 3, and 7 predicted miRNA genes were found in genomes from P. tricornutum, T. pseudonana, and C. merolae, respectively. Of the 23 miRNA genes, 18 had homology with animal miRNAs, implying that they are ancestral miRNAs. A phylogenetic tree based on common miRNA families shared by these three unicellular algae, higher plants, and animals showed that P. tricornutum shared most miRNAs with animals. The phylogenetic tree also showed that C. merolae shared more miRNAs with plants than did the two diatoms, and the majority of its miRNAs were shared with the two diatoms. Our results were consistent with diatoms originating from a secondary endosymbiosis.  相似文献   

16.
Over 3000 microbial (bacterial and archaeal) genomes have been made publically available to date, providing an unprecedented opportunity to examine evolutionary genomic trends and offering valuable reference data for a variety of other studies such as metagenomics. The utility of these genome sequences is greatly enhanced when we have an understanding of how they are phylogenetically related to each other. Therefore, we here describe our efforts to reconstruct the phylogeny of all available bacterial and archaeal genomes. We identified 24, single-copy, ubiquitous genes suitable for this phylogenetic analysis. We used two approaches to combine the data for the 24 genes. First, we concatenated alignments of all genes into a single alignment from which a Maximum Likelihood (ML) tree was inferred using RAxML. Second, we used a relatively new approach to combining gene data, Bayesian Concordance Analysis (BCA), as implemented in the BUCKy software, in which the results of 24 single-gene phylogenetic analyses are used to generate a “primary concordance” tree. A comparison of the concatenated ML tree and the primary concordance (BUCKy) tree reveals that the two approaches give similar results, relative to a phylogenetic tree inferred from the 16S rRNA gene. After comparing the results and the methods used, we conclude that the current best approach for generating a single phylogenetic tree, suitable for use as a reference phylogeny for comparative analyses, is to perform a maximum likelihood analysis of a concatenated alignment of conserved, single-copy genes.  相似文献   

17.
采用鸡胚成纤维细胞(CEF)培养增殖首次从湖北省云梦县分离的鹦鹉幼雏病病毒(Budgerigar fledgling dis ease virus,BFDV)分离株(BFDV HBYM02),经 PCR分段扩增法获得全基因组并完成序列测定。HBYM02 株全序列测定结果与GenBank中仅有的六株BFDV全序列进行同源性与进化分析。经BLAST分析,HBYM02株与其他六株BFDV同源性为98%~99%,为同一个基因型。运用Phylip3.5软件构建进化树,分析显示,来源于不同宿主的BFDV与宿主关系紧密,与地理分布没有明显的相关性。  相似文献   

18.
Zhang YJ  Ma PF  Li DZ 《PloS one》2011,6(5):e20596

Background

Bambusoideae is the only subfamily that contains woody members in the grass family, Poaceae. In phylogenetic analyses, Bambusoideae, Pooideae and Ehrhartoideae formed the BEP clade, yet the internal relationships of this clade are controversial. The distinctive life history (infrequent flowering and predominance of asexual reproduction) of woody bamboos makes them an interesting but taxonomically difficult group. Phylogenetic analyses based on large DNA fragments could only provide a moderate resolution of woody bamboo relationships, although a robust phylogenetic tree is needed to elucidate their evolutionary history. Phylogenomics is an alternative choice for resolving difficult phylogenies.

Methodology/Principal Findings

Here we present the complete nucleotide sequences of six woody bamboo chloroplast (cp) genomes using Illumina sequencing. These genomes are similar to those of other grasses and rather conservative in evolution. We constructed a phylogeny of Poaceae from 24 complete cp genomes including 21 grass species. Within the BEP clade, we found strong support for a sister relationship between Bambusoideae and Pooideae. In a substantial improvement over prior studies, all six nodes within Bambusoideae were supported with ≥0.95 posterior probability from Bayesian inference and 5/6 nodes resolved with 100% bootstrap support in maximum parsimony and maximum likelihood analyses. We found that repeats in the cp genome could provide phylogenetic information, while caution is needed when using indels in phylogenetic analyses based on few selected genes. We also identified relatively rapidly evolving cp genome regions that have the potential to be used for further phylogenetic study in Bambusoideae.

Conclusions/Significance

The cp genome of Bambusoideae evolved slowly, and phylogenomics based on whole cp genome could be used to resolve major relationships within the subfamily. The difficulty in resolving the diversification among three clades of temperate woody bamboos, even with complete cp genome sequences, suggests that these lineages may have diverged very rapidly.  相似文献   

19.
A study aimed at defining population structure of Italian tomato spotted wilt virus (TSWV) isolates was performed. Full genome sequencing of six TSWV isolates found in two Italian regions (two from Latium: Lazio 17 and Tarquinia; and four from Calabria: PepCal 10, 12, 22 and 24) were assembled. Identity percentages in nucleotide sequence among these TSWV isolates are here provided. The six full length genome sequences were compared with other two Italian isolates (p105 and p202/3WT) already fully sequenced, as well as full TSWV genomes that could be retrieved from GenBank. Phylogenetic analysis, performed in concatenated sequences and for each gene of each genome segment (L, M and S), confirmed the presence of two clades, namely A-like and D-like. In particular, the phylogenetic tree based on segment L grouped all the newly sequenced TSWV isolates in D-like clade. In the M segment phylogenetic tree, all our TSWV isolates shifted in the A-like clade. Isolates separation was not correlated to their geographical origin in phylogenetic study of distinct ORFs encoded by the RNA S segment. In fact, in nucleocapsid-encoding phylogenetic tree, PepCal 10 and 22 grouped in an A-like clade with p105, PepCal 12 and 24 in a D-like clade with p202/3WT, whereas Lazio 17 and Tarquinia in a third well distinct group. NSs tree displayed only PepCal 10 with p105 in A-like clade, whereas PepCal 12, 22, 24 with p202/3WT in D-like subclade; and isolates from Latium grouped a separated clade adjacent to D-like isolates. Additional analysis on putative reassortment events showed that TSWV Calabrian isolates likely originated from a reassortment event in M RNA and other in S RNA with p105 as major parent. Recombination events were detected in isolates from Latium in L and S RNAs with Chinese isolates as putative major parent.  相似文献   

20.

Background  

The construction of robust and well resolved phylogenetic trees is important for our understanding of many, if not all biological processes, including speciation and origin of higher taxa, genome evolution, metabolic diversification, multicellularity, origin of life styles, pathogenicity and so on. Many older phylogenies were not well supported due to insufficient phylogenetic signal present in the single or few genes used in phylogenetic reconstructions. Importantly, single gene phylogenies were not always found to be congruent. The phylogenetic signal may, therefore, be increased by enlarging the number of genes included in phylogenetic studies. Unfortunately, concatenation of many genes does not take into consideration the evolutionary history of each individual gene. Here, we describe an approach to select informative phylogenetic proteins to be used in the Tree of Life (TOL) and barcoding projects by comparing the cophenetic correlation coefficients (CCC) among individual protein distance matrices of proteins, using the fungi as an example. The method demonstrated that the quality and number of concatenated proteins is important for a reliable estimation of TOL. Approximately 40–45 concatenated proteins seem needed to resolve fungal TOL.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号