共查询到20条相似文献,搜索用时 0 毫秒
1.
It is well known that phylogenetic trees derived from different protein families are often incongruent. This is explained by mapping errors and by the essential processes of gene duplication, loss, and horizontal transfer. Therefore, the problem is to derive a "consensus" tree best fitting the given set of gene trees. This work presents a new method of deriving this tree. The method is different from the existing ones, since it considers not only the topology of the initial gene trees, but also the reliability of their branches. Thereby one can explicitly take into account the possible errors in the gene trees caused by the absence of reliable models of sequence evolution, by uneven evolution of different gene families and taxonomic groups, etc. 相似文献
2.
Tofigh A Hallett M Lagergren J 《IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM》2011,8(2):517-535
The incongruency between a gene tree and a corresponding species tree can be attributed to evolutionary events such as gene duplication and gene loss. This paper describes a combinatorial model where so-called DTL-scenarios are used to explain the differences between a gene tree and a corresponding species tree taking into account gene duplications, gene losses, and lateral gene transfers (also known as horizontal gene transfers). The reasonable biological constraint that a lateral gene transfer may only occur between contemporary species leads to the notion of acyclic DTL-scenarios. Parsimony methods are introduced by defining appropriate optimization problems. We show that finding most parsimonious acyclic DTL-scenarios is NP-hard. However, by dropping the condition of acyclicity, the problem becomes tractable, and we provide a dynamic programming algorithm as well as a fixed-parameter tractable algorithm for finding most parsimonious DTL-scenarios. 相似文献
3.
4.
Cecilia Alsmark Peter G Foster Thomas Sicheritz-Ponten Sirintra Nakjang T Martin Embley Robert P Hirt 《Genome biology》2013,14(2):R19
Background
The influence of lateral gene transfer on gene origins and biology in eukaryotes is poorly understood compared with those of prokaryotes. A number of independent investigations focusing on specific genes, individual genomes, or specific functional categories from various eukaryotes have indicated that lateral gene transfer does indeed affect eukaryotic genomes. However, the lack of common methodology and criteria in these studies makes it difficult to assess the general importance and influence of lateral gene transfer on eukaryotic genome evolution.Results
We used a phylogenomic approach to systematically investigate lateral gene transfer affecting the proteomes of thirteen, mainly parasitic, microbial eukaryotes, representing four of the six eukaryotic super-groups. All of the genomes investigated have been significantly affected by prokaryote-to-eukaryote lateral gene transfers, dramatically affecting the enzymes of core pathways, particularly amino acid and sugar metabolism, but also providing new genes of potential adaptive significance in the life of parasites. A broad range of prokaryotic donors is involved in such transfers, but there is clear and significant enrichment for bacterial groups that share the same habitats, including the human microbiota, as the parasites investigated.Conclusions
Our data show that ecology and lifestyle strongly influence gene origins and opportunities for gene transfer and reveal that, although the outlines of the core eukaryotic metabolism are conserved among lineages, the genes making up those pathways can have very different origins in different eukaryotes. Thus, from the perspective of the effects of lateral gene transfer on individual gene ancestries in different lineages, eukaryotic metabolism appears to be chimeric. 相似文献5.
Great progress has been made in understanding the genetic architecture of phenotypic variation, but it is almost entirely focused on how the genotype of an individual affects the phenotype of that same individual. However, in many species the genotype of the mother is a major determinant of the phenotype of her offspring. Therefore, a complete picture of genetic architecture must include these maternal genetic effects, but they can be difficult to identify because maternal and offspring genotypes are correlated and therefore, partially confounded. We present a conceptual framework that overcomes this challenge to separate direct and maternal effects in intact families through an analysis that we call "statistical cross-fostering." Our approach combines genotype data from mothers and their offspring to remove the confounding effects of the offspring's own genotype on measures of maternal genetic effects. We formalize our approach in an orthogonal model and apply this model to an experimental population of mice. We identify a set of six maternal genetic effect loci that explain a substantial portion of variation in body size at all ages. This variation would be missed in an approach focused solely on direct genetic effects, but is clearly a major component of genetic architecture. Our approach can easily be adapted to examine maternal effects in different systems, and because it does not require experimental manipulation, it provides a framework that can be used to understand the contribution of maternal genetic effects in both natural and experimental populations. 相似文献
6.
BACKGROUND: Lateral gene transfer (LGT) is an important evolutionary mechanism among prokaryotes. The situation in eukaryotes is less clear; the human genome sequence failed to give strong support for any recent transfers from prokaryotes to vertebrates, yet a number of LGTs from prokaryotes to protists (unicellular eukaryotes) have been documented. Here, we perform a systematic analysis to investigate the impact of LGT on the evolution of diplomonads, a group of anaerobic protists.RESULTS: Phylogenetic analyses of 15 genes present in the genome of the Atlantic Salmon parasite Spironucleus barkhanus and/or the intestinal parasite Giardia lamblia show that most of these genes originated via LGT. Half of the genes are putatively involved in processes related to an anaerobic lifestyle, and this finding suggests that a common ancestor, which most probably was aerobic, of Spironucleus and Giardia adapted to an anaerobic environment in part by acquiring genes via LGT from prokaryotes. The sources of the transferred diplomonad genes are found among all three domains of life, including other eukaryotes. Many of the phylogenetic reconstructions show eukaryotes emerging in several distinct regions of the tree, strongly suggesting that LGT not only involved diplomonads, but also involved other eukaryotic groups.CONCLUSIONS: Our study shows that LGT is a significant evolutionary mechanism among diplomonads in particular and protists in general. These findings provide insights into the evolution of biochemical pathways in early eukaryote evolution and have important implications for studies of eukaryotic genome evolution and organismal relationships. Furthermore, "fusion" hypotheses for the origin of eukaryotes need to be rigorously reexamined in the light of these results. 相似文献
7.
Laura A. Katz 《Philosophical transactions of the Royal Society of London. Series B, Biological sciences》2015,370(1678)
While there is compelling evidence for the impact of endosymbiotic gene transfer (EGT; transfer from either mitochondrion or chloroplast to the nucleus) on genome evolution in eukaryotes, the role of interdomain transfer from bacteria and/or archaea (i.e. prokaryotes) is less clear. Lateral gene transfers (LGTs) have been argued to be potential sources of phylogenetic information, particularly for reconstructing deep nodes that are difficult to recover with traditional phylogenetic methods. We sought to identify interdomain LGTs by using a phylogenomic pipeline that generated 13 465 single gene trees and included up to 487 eukaryotes, 303 bacteria and 118 archaea. Our goals include searching for LGTs that unite major eukaryotic clades, and describing the relative contributions of LGT and EGT across the eukaryotic tree of life. Given the difficulties in interpreting single gene trees that aim to capture the approximately 1.8 billion years of eukaryotic evolution, we focus on presence–absence data to identify interdomain transfer events. Specifically, we identify 1138 genes found only in prokaryotes and representatives of three or fewer major clades of eukaryotes (e.g. Amoebozoa, Archaeplastida, Excavata, Opisthokonta, SAR and orphan lineages). The majority of these genes have phylogenetic patterns that are consistent with recent interdomain LGTs and, with the notable exception of EGTs involving photosynthetic eukaryotes, we detect few ancient interdomain LGTs. These analyses suggest that LGTs have probably occurred throughout the history of eukaryotes, but that ancient events are not maintained unless they are associated with endosymbiotic gene transfer among photosynthetic lineages. 相似文献
8.
Background
When organismal phylogenies based on sequences of single marker genes are poorly resolved, a logical approach is to add more markers, on the assumption that weak but congruent phylogenetic signal will be reinforced in such multigene trees. Such approaches are valid only when the several markers indeed have identical phylogenies, an issue which many multigene methods (such as the use of concatenated gene sequences or the assembly of supertrees) do not directly address. Indeed, even when the true history is a mixture of vertical descent for some genes and lateral gene transfer (LGT) for others, such methods produce unique topologies. 相似文献9.
10.
Evolutionary analyses of the small subunit of glutamate synthase: gene order conservation,gene fusions,and prokaryote-to-eukaryote lateral gene transfers 总被引:3,自引:0,他引:3 下载免费PDF全文
Lateral gene transfer has been identified as an important mode of genome evolution within prokaryotes. Except for the special case of gene transfer from organelle genomes to the eukaryotic nucleus, only a few cases of lateral gene transfer involving eukaryotes have been described. Here we present phylogenetic and gene order analyses on the small subunit of glutamate synthase (encoded by gltD) and its homologues, including the large subunit of sulfide dehydrogenase (encoded by sudA). The scattered distribution of the sudA and sudB gene pair and the phylogenetic analysis strongly suggest that lateral gene transfer was involved in the propagation of the genes in the three domains of life. One of these transfers most likely occurred between a prokaryote and an ancestor of diplomonad protists. Furthermore, phylogenetic analyses indicate that the gene for the small subunit of glutamate synthase was transferred from a low-GC gram-positive bacterium to a common ancestor of animals, fungi, and plants. Interestingly, in both examples, the eukaryotes encode a single gene that corresponds to a conserved operon structure in prokaryotes. Our analyses, together with several recent publications, show that lateral gene transfers from prokaryotes to unicellular eukaryotes occur with appreciable frequency. In the case of the genes for sulfide dehydrogenase, the transfer affected only a limited group of eukaryotes—the diplomonads—while the transfer of the glutamate synthase gene probably happened earlier in evolution and affected a wider range of eukaryotes. 相似文献
11.
A phylogeny that allows for lateral gene transfer (LGT) can be thought of as a strictly branching tree (all of whose branches
are vertical) to which lateral branches have been added. Given that the goal of phylogenetics is to depict evolutionary history,
we should look for the best supported phylogenetic network and not restrict ourselves to considering trees. However, the obvious
extensions of popular tree-based methods such as maximum parsimony and maximum likelihood face a serious problem—if we judge
networks by fit to data alone, networks that have lateral branches will always fit the data at least as well as any network
that restricts itself to vertical branches. This is analogous to the well-studied problem of overfitting data in the curve-fitting
problem. Analogous problems often have analogous solutions and we propose to treat network inference as a case of model selection
and use the Akaike Information Criterion (AIC). Strictly tree-like networks are more parsimonious than those that postulate
lateral as well as vertical branches. This leads to the conclusion that we should not always infer LGT events whenever it
would improve our fit-to-data, but should do so only when the improved fit is larger than the penalty for adding extra lateral
branches. 相似文献
12.
Background
The glycolytic phosphoglycerate mutases exist as non-homologous isofunctional enzymes (NISE) having independent evolutionary origins and no similarity in primary sequence, 3D structure, or catalytic mechanism. Cofactor-dependent PGM (dPGM) requires 2,3-bisphosphoglycerate for activity; cofactor-independent PGM (iPGM) does not. The PGM profile of any given bacterium is unpredictable and some organisms such as Escherichia coli encode both forms.Methods/Principal Findings
To examine the distribution of PGM NISE throughout the Bacteria, and gain insight into the evolutionary processes that shape their phyletic profiles, we searched bacterial genome sequences for the presence of dPGM and iPGM. Both forms exhibited patchy distributions throughout the bacterial domain. Species within the same genus, or even strains of the same species, frequently differ in their PGM repertoire. The distribution is further complicated by the common occurrence of dPGM paralogs, while iPGM paralogs are rare. Larger genomes are more likely to accommodate PGM paralogs or both NISE forms. Lateral gene transfers have shaped the PGM profiles with intradomain and interdomain transfers apparent. Archaeal-type iPGM was identified in many bacteria, often as the sole PGM. To address the function of PGM NISE in an organism encoding both forms, we analyzed recombinant enzymes from E. coli. Both NISE were active mutases, but the specific activity of dPGM greatly exceeded that of iPGM, which showed highest activity in the presence of manganese. We created PGM null mutants in E. coli and discovered the ΔdPGM mutant grew slowly due to a delay in exiting stationary phase. Overexpression of dPGM or iPGM overcame this defect.Conclusions/Significance
Our biochemical and genetic analyses in E. coli firmly establish dPGM and iPGM as NISE. Metabolic redundancy is indicated since only larger genomes encode both forms. Non-orthologous gene displacement can fully account for the non-uniform PGM distribution we report across the bacterial domain. 相似文献13.
Even though the 16S rRNA gene is the most commonly used taxonomic marker in microbial ecology, its poor resolution is still not fully understood at the intra-genus level. In this work, the number of rRNA gene operons, intra-genomic heterogeneities and lateral transfers were investigated at a fine-scale resolution, throughout the Pseudomonas genus. In addition to nineteen sequenced Pseudomonas strains, we determined the 16S rRNA copy number in four other Pseudomonas strains by Southern hybridization and Pulsed-Field Gel Electrophoresis, and studied the intra-genomic heterogeneities by Denaturing Gradient Gel Electrophoresis and sequencing. Although the variable copy number (from four to seven) seems to be correlated with the evolutionary distance, some close strains in the P. fluorescens lineage showed a different number of 16S rRNA genes, whereas all the strains in the P. aeruginosa lineage displayed the same number of genes (four copies). Further study of the intra-genomic heterogeneities revealed that most of the Pseudomonas strains (15 out of 19 strains) had at least two different 16S rRNA alleles. A great difference (5 or 19 nucleotides, essentially grouped near the V1 hypervariable region) was observed only in two sequenced strains. In one of our strains studied (MFY30 strain), we found a difference of 12 nucleotides (grouped in the V3 hypervariable region) between copies of the 16S rRNA gene. Finally, occurrence of partial lateral transfers of the 16S rRNA gene was further investigated in 1803 full-length sequences of Pseudomonas available in the databases. Remarkably, we found that the two most variable regions (the V1 and V3 hypervariable regions) had probably been laterally transferred from another evolutionary distant Pseudomonas strain for at least 48.3 and 41.6% of the 16S rRNA sequences, respectively. In conclusion, we strongly recommend removing these regions of the 16S rRNA gene during the intra-genus diversity studies. 相似文献
14.
apTreeshape: statistical analysis of phylogenetic tree shape 总被引:3,自引:0,他引:3
apTreeshape is a R package dedicated to simulation and analysis of phylogenetic tree topologies using statistical imbalance measures. It is a companion library of the R package 'ape', which provides additional functions for reading, plotting, manipulating phylogenetic trees and for connecting to public phylogenetic tree databases. One strength of the package is to include appropriate corrections of classical shape statistics as well as new tests based on the statistical theory of likelihood ratios. 相似文献
15.
Lateral gene transfer (LGT) is an important mechanism of natural variation among prokaryotes. Over the full course of evolution, most or all of the genes resident in a given prokaryotic genome have been affected by LGT, yet the frequency of LGT can vary greatly across genes and across prokaryotic groups. The proteobacteria are among the most diverse of prokaryotic taxa. The prevalence of LGT in their genome evolution calls for the application of network-based methods instead of tree-based methods to investigate the relationships among these species. Here, we report networks that capture both vertical and horizontal components of evolutionary history among 1,207,272 proteins distributed across 329 sequenced proteobacterial genomes. The network of shared proteins reveals modularity structure that does not correspond to current classification schemes. On the basis of shared protein-coding genes, the five classes of proteobacteria fall into two main modules, one including the alpha-, delta-, and epsilonproteobacteria and the other including beta- and gammaproteobacteria. The first module is stable over different protein identity thresholds. The second shows more plasticity with regard to the sequence conservation of proteins sampled, with the gammaproteobacteria showing the most chameleon-like evolutionary characteristics within the present sample. Using a minimal lateral network approach, we compared LGT rates at different phylogenetic depths. In general, gene evolution by LGT within proteobacteria is very common. At least one LGT event was inferred to have occurred in at least 75% of the protein families. The average LGT rate at the species and class depth is about one LGT event per protein family, the rate doubling at the phylum level to an average of two LGT events per protein family. Hence, our results indicate that the rate of gene acquisition per protein family is similar at the level of species (by recombination) and at the level of classes (by LGT). The frequency of LGT per genome strongly depends on the species lifestyle, with endosymbionts showing far lower LGT frequencies than free-living species. Moreover, the nature of the transferred genes suggests that gene transfer in proteobacteria is frequently mediated by conjugation. 相似文献
16.
Michael W. Smith Da-Fei Feng Russell F. Doolittle 《Trends in biochemical sciences》1992,17(12):489-493
One of the most debated questions in the field of molecular evolution is the possible role of horizontal transfer in evolution. Of all the claims that have been made over the years, those reporting transfers between eukaryotes and prokaryotes are the most controversial. Here we present the cases for and against several such possible gene acquisitions. 相似文献
17.
Chupov VS 《Tsitologiia》2002,44(4):323-333
The lateral phylogenetic branch is S-formed, when regarded in the time-divergency degree coordinates. On the initial side of the curve, the rate of divergency development is low to increase in the middle, and to fall down again on the final side. In result, within one phylogenetic branch all the taxa appear to be typologically diverse. 相似文献
18.
19.
Reverse gyrase, an enzyme of uncertain funtion, is present in all hyperthermophilic archaea and bacteria. Previous phylogenetic studies have suggested that the gene for reverse gyrase has an archaeal origin and was transferred laterally (LGT) to the ancestors of the two bacterial hyperthermophilic phyla, Thermotogales and Aquificales. Here, we performed an in-depth analysis of the evolutionary history of reverse gyrase in light of genomic progress. We found genes coding for reverse gyrase in the genomes of several thermophilic bacteria that belong to phyla other than Aquificales and Thermotogales. Several of these bacteria are not, strictly speaking, hyperthermophiles because their reported optimal growth temperatures are below 80 degrees C. Furthermore, we detected a reverse gyrase gene in the sequence of the large plasmid of Thermus thermophilus strain HB8, suggesting a possible mechanism of transfer to the T. thermophilus strain HB8 involving plasmids and transposases. The archaeal part of the reverse gyrase tree is congruent with recent phylogenies of the archaeal domain based on ribosomal proteins or RNA polymerase subunits. Although poorly resolved, the complete reverse gyrase phylogeny suggests an ancient acquisition of the gene by bacteria via one or two LGT events, followed by its secondary distribution by LGT within bacteria. Finally, several genes of archaeal origin located in proximity to the reverse gyrase gene in bacterial genomes have bacterial homologues mostly in thermophiles or hyperthermophiles, raising the possibility that they were co-transferred with the reverse gyrase gene. Our new analysis of the reverse gyrase history strengthens the hypothesis that the acquisition of reverse gyrase may have been a crucial evolutionary step in the adaptation of bacteria to high-temperature environments. However, it also questions the role of this enzyme in thermophilic bacteria and the selective advantage its presence could provide. 相似文献