共查询到20条相似文献,搜索用时 0 毫秒
1.
How gene function evolves is a central question of evolutionary biology. It can be investigated by comparing functional genomics results between species and between genes. Most comparative studies of functional genomics have used pairwise comparisons. Yet it has been shown that this can provide biased results, as genes, like species, are phylogenetically related. Phylogenetic comparative methods should be used to correct for this, but they depend on strong assumptions, including unbiased tree estimates relative to the hypothesis being tested. Such methods have recently been used to test the “ortholog conjecture,” the hypothesis that functional evolution is faster in paralogs than in orthologs. Although pairwise comparisons of tissue specificity () provided support for the ortholog conjecture, phylogenetic independent contrasts did not. Our reanalysis on the same gene trees identified problems with the time calibration of duplication nodes. We find that the gene trees used suffer from important biases, due to the inclusion of trees with no duplication nodes, to the relative age of speciations and duplications, to systematic differences in branch lengths, and to non-Brownian motion of tissue specificity on many trees. We find that incorrect implementation of phylogenetic method in empirical gene trees with duplications can be problematic. Controlling for biases allows successful use of phylogenetic methods to study the evolution of gene function and provides some support for the ortholog conjecture using three different phylogenetic approaches. 相似文献
2.
Discovery of Novel Conserved Peptide Domains by Ortholog Comparison within Plant Multi-Protein Families 总被引:1,自引:0,他引:1
Panstruga R 《Plant molecular biology》2005,59(3):485-500
3.
判定直系同源关系的进化分析方法 总被引:1,自引:0,他引:1
如何正确判定基因之间的直系同源 (ortholog)和旁系同源 (paralog)关系 ,仍是基因组功能诠释和比较基因组学中有待更好解决的关键问题。在以前的工作中 ,曾用进化分析方法解决多基因家族的直系 /旁系同源关系的判定问题 ,现进而完整地展开判定直系同源关系的进化分析方法。从 44个同源蛋白质家族的案例观察表明 ,与流行的COG方法 (直系同源蛋白质的聚类 )比较 ,本方法能一般的判定直系同源关系以及能准确的诠释基因组的分子功能 相似文献
4.
Quality control of probe sequences is a major concern in microarray technology. The presence of poor quality probes has a negative impact on the microarray data analysis process. The Microarray Manual Curation Tool (MMCT) is a web server application that provides computational and visual means to investigate the quality of individual probes for oligo microarrays. The MMCT quality metrics assess the free energy of hybridization and the secondary structure of duplexes formed by selected targets and probes, which are specific to various microarray platforms. AVAILABILITY: http://www.nrcbioinformatics.ca/mmct. 相似文献
5.
Drouilhet L Paillisson A Bontoux M Jeanpierre E Mazerbourg S Monget P 《Molecular reproduction and development》2008,75(12):1691-1700
During folliculogenesis, oocytes accumulate maternal mRNAs in preparation for the first steps of early embryogenesis. The processing of oocyte mRNAs is ensured by heterogeneous nuclear ribonucleoproteins (hnRNPs) genes that encode RNA binding proteins implied in mRNA biogenesis, translation, alternative splicing, nuclear exportation, and degradation. In the present work, by combining phylogenetic analyses and, when available, in silico expression data, we have identified three new oocyte-expressed genes encoding RNA binding proteins by using two strategies. Firstly, we have identified mouse orthologs of the Car1 gene, known to be involved in regulation of germ cell apoptosis in C. elegans, and of the Squid gene, required for the establishment of anteroposterior polarity in the Drosophila oocyte. Secondly, we have identified, among genes whose ESTs are highly represented in oocyte libraries, a paralog of Poly(A) binding protein--Interacting Protein 2 (Paip2) gene, known to inhibit the interaction of the Poly(A)-Binding Protein with Poly(A) tails of mRNAs. For all of these genes, the expression in oocyte was verified by in situ hybridization. Overall, this work underlines the efficiency of in silico methodologies to identify new genes involved in biological processes such as oogenesis. 相似文献
6.
7.
Sridhar J Sabarinathan R Balan SS Rafi ZA Gunasekaran P Sekar K 《基因组蛋白质组与生物信息学报(英文版)》2011,9(4-5):179-182
In the past few decades, scientists from all over the world have taken a keen interest in novel functional units such as small regulatory RNAs, small open reading frames, pseudogenes, transposons, integrase binding attB/attP sites, repeat elements within the bacterial intergenic regions (IGRs) and in the analysis of those "junk" regions for genomic complexity. Here we have developed a web server, named Junker, to facilitate the in-depth analysis of IGRs for examining their length distribution, four-quadrant plots, GC percentage and repeat details. Upon selection of a particular bacterial genome, the physical genome map is displayed as a multiple loci with options to view any loci of interest in detail. In addition, an IGR statistics module has been created and implemented in the web server to analyze the length distribution of the IGRs and to understand the disordered grouping of IGRs across the genome by generating the four-quadrant plots. The proposed web server is freely available at the URL http://pranag.physics.iisc.ernet.in/junker/. 相似文献
8.
Jayavel Sridhar Radhakrishnan Sabarinathan Shanmugam Siva Balan Ziauddin Ahamed Rafi Paramasamy Gunasekaran Kanagaraj Sekar 《基因组蛋白质组与生物信息学报(英文版)》2011,(Z2)
In the past few decades, scientists from all over the world have taken a keen interest in novel functional units such as small regulatory RNAs, small open reading frames, pseudogenes, transposons, integrase binding attB/attP sites, repeat elements within the bacterial intergenic regions (IGRs) and in the analysis of those junk regions for ge- nomic complexity. Here we have developed a web server, named Junker, to facilitate the in-depth analysis of IGRs for examining their length distribution, four-quadrant... 相似文献
9.
Possible genetic fates of a gene duplicate are silencing, redundancy, subfunctionalization, or novel function. These different fates can be realized at the DNA, RNA, or protein level, and their genetic determinants are poorly understood. We explored molecular evolution of duplicated RAG-1 genes in African clawed frogs (Xenopus and Silurana) (1) to examine the fate of paralogs of this gene at the DNA level in terms of recombination, positive selection, and gene degeneration and in the absence of extensive recombination among alleles at different paralogs, (2) to test phylogenetic hypotheses about the origins of polyploid species. We found that recombination between different RAG-1 paralogs is infrequent, that degeneration of some paralogs has occurred via stop codons and frameshift mutations, and that this degeneration occurred in paralogs inherited from only one diploid progenitor species. Simulations and phylogenetic analyses of RAG-1 and mitochondrial DNA support one origin of extant tetraploids in Xenopus and at least one origin in Silurana, five allopolyploid origins of extant octoploids, and two allopolyploid origins of extant dodecaploids. In allopolyploid species, which inherit a complete genome from two different ancestors, genes inherited from the same ancestor have a longer period of coevolution than genes inherited from different ancestors. Because of this, gene ancestry could potentially influence gene fate: interacting paralogs derived from the same lower ploidy ancestor might have similar genetic destinies. 相似文献
10.
Bikram K. Parida Sandeep Panda Namrata Misra Prasanna K. Panda Barada K. Mishra 《Geomicrobiology journal》2014,31(4):299-314
Recovery of metal value, especially from low-grade ores and overburden minerals using acidophilic bacteria through the process of bioleaching is an environmentally benign and commercially scalable biotechnology. In recent years, while the “OMICS” landscape has been witnessing extensive application of computational tools to understand and interpret global biological sequence data, a dedicated bioinformatic server for analysis of bacterial information in the context of its bioleaching ability is not available. We have developed an on-line Bacterial Bioleaching Protein Finder (BBProF) System, which rapidly identifies novel proteins involved in a bacterial bioleaching process and also performs phylogenetic analysis of 16S rRNA genes. BBProF uses the features of Asynchronous Java Script and XML (AJAX) to provide an efficient and fast user experience with minimal requirement of network bandwidth. In the input module the server accepts any bacterial or archaeal complete genome sequence in RAW format and provides a list of proteins involved in the microbial leaching process. BBProF web server is integrated with the European Bioinformatics Institute (EBI) web services such as BLAST for homology search and InterProScan for functional characterization of output protein sequences. Studying evolutionary relationship of bacterial strains of interest using Muscle and ClustalW2 phylogeny web services from EBI is another key feature of our server, where 16S rRNA gene sequences are considered as input through a JQUERY interface along with the sequences present in the BBProF database library. Complete genome sequences of 24 bioleaching microorganism characterized by genomic and physiological study in the laboratory and their respective 16S rRNA gene sequences were stored in the database of the BBProF library. To our knowledge BBProF is the first integrated bioinformatic web server that demonstrates its utility in identifying potential bioleaching bacteria. We hope that the server facilitate ongoing comparative genomic studies on of bioleaching microorganisms and also assist in identification and design of novel microbial consortia that are optimally efficient bioleaching agents. 相似文献
11.
We suggest a new procedure to search for the genes with horizontal transfer events in their evolutionary history. The search is based on analysis of topology difference between the phylogenetic trees of gene (protein) groups and the corresponding phylogenetic species trees. Numeric values are introduced to measure the discrepancy between the trees. This approach was applied to analyze 40 prokaryotic genomes classified into 132 classes of orthologs. This resulted in a list of the candidate genes for which the hypothesis of horizontal transfer in evolution looks true. 相似文献
12.
Ontologies have emerged as a fast growing research topic in the area of semantic web during last decade. Currently there are 204 ontologies that are available through OBO Foundry and BioPortal. Several excellent tools for navigating the ontological structure are available, however most of them are dedicated to a specific annotation data or integrated with specific analysis applications, and do not offer flexibility in terms of general-purpose usage for ontology exploration. We developed OntoVisT, a web based ontological visualization tool. This application is designed for interactive visualization of any ontological hierarchy for a specific node of interest, up to the chosen level of children and/or ancestor. It takes any ontology file in OBO format as input and generates output as DAG hierarchical graph for the chosen query. To enhance the navigation capabilities of complex networks, we have embedded several features such as search criteria, zoom in/out, center focus, nearest neighbor highlights and mouse hover events. The application has been tested on all 72 data sets available in OBO format through OBO foundry. The results for few of them can be accessed through OntoVisT-Gallery. AVAILABILITY: The database is available for free at http://ccbb.jnu.ac.in/OntoVisT.html. 相似文献
13.
The paper clears up confusions about the concepts of orthology and paralogy, particularly in cases involving gene family expansions. The terms ‘inparalog’ and ‘outparalog’ are defined to distinguish ancient paralogs from lineage-specific ones. 相似文献
14.
Rapid Evaluation of Least-Squares and Minimum-Evolution Criteria on Phylogenetic Trees 总被引:3,自引:2,他引:3
We present fast new algorithms for evaluating trees with respectto least squares and minimum evolution (ME), the most commonlyused criteria for inferring phylogenetic trees from distancedata. The new algorithms include an optimal O(N2) time algorithmfor calculating the edge (branch or internode) lengths on atree according to ordinary or unweighted least squares (OLS);an O(N3) time algorithm for edge lengths under weighted leastsquares (WLS) including the Fitch-Margoliash method; and anoptimal O(N4) time algorithm for generalized least-squares (GLS)edge lengths (where N is the number of taxa in the tree). TheME criterion is based on the sum of edge lengths. Consequently,the edge lengths algorithms presented here lead directly toO(N2), O(N3), and O(N4) time algorithms for ME
under OLS, WLS,and GLS, respectively. All of these algorithms are as fast asor faster than any of those previously published, and the algorithmsfor OLS and GLS are the fastest possible (with respect to orderof computational complexity). A major advantage of our new methodsis that they are as well adapted to multifurcating trees asthey are to binary trees. An optimal algorithm for determiningpath lengths from a tree with given edge lengths is also developed.This
leads to an optimal O(N2) algorithm for OLS sums of squaresevaluation and corresponding O(N3) and O(N4) time algorithmsfor WLS and GLS sums of squares, respectively. The GLS algorithmis time-optimal if the covariance matrix is already inverted.The speed of each algorithm is assessed analyticallythespeed increases we calculate are confirmed by the dramatic speedincreases resulting from their implementation in PAUP* 4.0.The new algorithms enable far more extensive tree searches andstatistical evaluations (e.g., bootstrap, parametric bootstrap,or jackknife) in the same amount of time. Hopefully, the fastalgorithms for WLS and GLS will encourage the use of these criteriafor evaluating trees and their edge lengths (e.g., for approximatedivergence time estimates), since they should be more statisticallyefficient than OLS. 相似文献
15.
Phylogenetic trees inferred from sequence data often have branch lengths measured in the expected number of substitutions and therefore, do not have divergence times estimated. These trees give an incomplete view of evolutionary histories since many applications of phylogenies require time trees. Many methods have been developed to convert the inferred branch lengths from substitution unit to time unit using calibration points, but none is universally accepted as they are challenged in both scalability and accuracy under complex models. Here, we introduce a new method that formulates dating as a nonconvex optimization problem where the variance of log-transformed rate multipliers is minimized across the tree. On simulated and real data, we show that our method, wLogDate, is often more accurate than alternatives and is more robust to various model assumptions. 相似文献
16.
We present the first phylogenetic study on the widespread Middle American microhylid frog genus Hypopachus. Partial sequences of mitochondrial (12S and 16S ribosomal RNA) and nuclear (rhodopsin) genes (1275 bp total) were analyzed from 43 samples of Hypopachus, three currently recognized species of Gastrophryne, and seven arthroleptid, brevicipitid and microhylid outgroup taxa. Maximum parsimony (PAUP), maximum likelihood (RAxML) and Bayesian inference (MrBayes) optimality criteria were used for phylogenetic analyses, and BEAST was used to estimate divergence dates of major clades. Population-level analyses were conducted with the programs NETWORK and Arlequin. Results confirm the placement of Hypopachus and Gastrophryne as sister taxa, but the latter genus was strongly supported as paraphyletic. The African phrynomerine genus Phrynomantis was recovered as the sister taxon to a monophyletic Chiasmocleis, rendering our well-supported clade of gastrophrynines paraphyletic. Hypopachus barberi was supported as a disjunctly distributed highland species, and we recovered a basal split in lowland populations of Hypopachus variolosus from the Pacific versant of Mexico and elsewhere in the Mesoamerican lowlands. Dating analyses from BEAST estimate speciation within the genus Hypopachus occurred in the late Miocene/early Pliocene for most clades. Previous studies have not found bioacoustic or morphological differences among these lowland clades, and our molecular data support the continued recognition of two species in the genus Hypopachus. 相似文献
17.
本研究应用套式RT-PCR方法扩增出牛病毒性腹泻病毒VEDEVAC株编码NS3蛋白的基因,克隆至表达载体pET-30a(+),并进行测序。对瘟病毒属病毒NS3基因进行氨基酸差异性分析,显示平均P-distance为0.07,系统进化树分析表明VEDEVAC株隶属于BVDV1型。将构建成功的重组质粒转化大肠埃希氏菌Rosetta(DE3),在IPTG诱导下表达NS3重组蛋白,经Ni-NTA亲和层析纯化和尿素梯度透析后进行反应原性鉴定。Western blotting结果显示表达的重组蛋白可以与牛病毒性腹泻病毒阳性血清反应,并与猪瘟阳性血清有交叉反应,ELISA结果显示该重组蛋白具有良好的反应原性。所获得的蛋白为建立针对NS3抗体的ELISA检测方法奠定了基础。 相似文献
18.
Martin Bitomský Pavla Mldkov Robin J. Pakeman Martin Duchoslav 《Ecology and evolution》2020,10(8):3747-3757
Phylogenetic diversity quantification is based on indices computed from phylogenetic distances among species, which are derived from phylogenetic trees. This approach requires phylogenetic expertise and available molecular data, or a fully sampled synthesis‐based phylogeny. Here, we propose and evaluate a simpler alternative approach based on taxonomic coding. We developed metrics, the clade indices, based on information about clade proportions in communities and species richness of a community or a clade, which do not require phylogenies. Using vegetation records from herbaceous plots from Central Europe and simulated vegetation plots based on a megaphylogeny of vascular plants, we examined fit accuracy of our proposed indices for all dimensions of phylogenetic diversity (richness, divergence, and regularity). For real vegetation data, the clade indices fitted phylogeny‐based metrics very accurately (explanatory power was usually higher than 80% for phylogenetic richness, almost always higher than 90% for phylogenetic divergence, and often higher than 70% for phylogenetic regularity). For phylogenetic regularity, fit accuracy was habitat and species richness dependent. For phylogenetic richness and divergence, the clade indices performed consistently. In simulated datasets, fit accuracy of all clade indices increased with increasing species richness, suggesting better precision in species‐rich habitats and at larger spatial scales. Fit accuracy for phylogenetic divergence and regularity was unreliable at large phylogenetic scales, suggesting inadvisability of our method in habitats including many distantly related lineages. The clade indices are promising alternative measures for all projects with a phylogenetic framework, which can trade‐off a little precision for a significant speed‐up and simplification, such as macroecological analyses or where phylogenetic data is incomplete. 相似文献
19.
Carreras M Marco C Gianti E Eleonora G Sartori L Luca S Plyte SE Edward PS Isacchi A Antonella I Bosotti R Roberta B 《基因组蛋白质组与生物信息学报(英文版)》2005,3(1):58-60
PoInTree (Polar and Interactive Tree) is an application that allows to build, visualize, and customize phylogenetic trees in a polar, interactive, and highly flexible view. It takes as input a FASTA file or multiple alignment formats. Phylogenetic tree calculation is based on a sequence distance method and utilizes the Neighbor Joining (N J) algorithm. It also allows displaying precalculated trees of the major protein families based on Pfam classification. In PoInTree, nodes can be dynamically opened and closed and distances between genes are graphically represented. Tree root can be centered on a selected leaf. Text search mechanism, color-coding and labeling display are integrated. The visualizer can be connected to an Oracle database containing information on sequences and other biological data, helping to guide their interpretation within a given protein family across multiple species. The application is written in Borland Delphi and based on VCL Teechart Pro 6 graphical component (Steema software). 相似文献
20.
Jérôme Bourret Fanni Borvető Ignacio G. Bravo 《Journal of evolutionary biology》2023,36(10):1375-1392
Gene paralogs are copies of an ancestral gene that appear after gene or full genome duplication. When two sister gene copies are maintained in the genome, redundancy may release certain evolutionary pressures, allowing one of them to access novel functions. Here, we focused our study on gene paralogs on the evolutionary history of the three polypyrimidine tract binding protein genes (PTBP) and their concurrent evolution of differential codon usage preferences (CUPrefs) in vertebrate species. PTBP1-3 show high identity at the amino acid level (up to 80%) but display strongly different nucleotide composition, divergent CUPrefs and, in humans and in many other vertebrates, distinct tissue-specific expression levels. Our phylogenetic inference results show that the duplication events leading to the three extant PTBP1-3 lineages predate the basal diversification within vertebrates, and genomic context analysis illustrates that local synteny has been well preserved over time for the three paralogs. We identify a distinct evolutionary pattern towards GC3-enriching substitutions in PTBP1, concurrent with enrichment in frequently used codons and with a tissue-wide expression. In contrast, PTBP2s are enriched in AT-ending, rare codons, and display tissue-restricted expression. As a result of this substitution trend, CUPrefs sharply differ between mammalian PTBP1s and the rest of PTBPs. Genomic context analysis suggests that GC3-rich nucleotide composition in PTBP1s is driven by local substitution processes, while the evidence in this direction is thinner for PTBP2-3. An actual lack of co-variation between the observed GC composition of PTBP2-3 and that of the surrounding non-coding genomic environment would raise an interrogation on the origin of CUPrefs, warranting further research on a putative tissue-specific translational selection. Finally, we communicate an intriguing trend for the use of the UUG-Leu codon, which matches the trends of AT-ending codons. Our results are compatible with a scenario in which a combination of directional mutation–selection processes would have differentially shaped CUPrefs of PTBPs in vertebrates: the observed GC-enrichment of PTBP1 in placental mammals may be linked to genomic location and to the strong and broad tissue-expression, while AT-enrichment of PTBP2 and PTBP3 would be associated with rare CUPrefs and thus, possibly to specialized spatio-temporal expression. Our interpretation is coherent with a gene subfunctionalisation process by differential expression regulation associated with the evolution of specific CUPrefs. 相似文献