首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
Zeng LC  Han ZG  Ma WJ 《FEBS letters》2005,579(25):5443-5453
The categorization of genes by structural distinctions relevant to biological characteristics is very important for understanding of gene functions and predicting functional implications of uncharacterized genes. It was absolutely necessary to deploy an effective and efficient strategy to deal with the complexity of the large olfactomedin-like (OLF) gene family sharing sequence similarity but playing diversified roles in many important biological processes, as the simple highest-hit homology analysis gave incomprehensive results and led to inappropriate annotation for some uncharacterized OLF members. In light of evolutionary information that may facilitate the classification of the OLF family and proper association of novel OLF genes with characterized homologs, we performed phylogenetic analysis on all 116 OLF proteins currently available, including two novel members cloned by our group. The OLF family segregated into seven subfamilies and members with similar domain compositions or functional properties all fell into relevant subfamilies. Furthermore, our Northern blot analysis and previous studies revealed that the typical human OLF members in each subfamily exhibited tissue-specific expression patterns, which in turn supported the segregation of the OLF subfamilies with functional divergence. Interestingly, the phylogenetic tree topology for the OLF domains alone was almost identical with that of the full-length tree representing the unique phylogenetic feature of full-length OLF proteins and their particular domain compositions. Moreover, each of the major functional domains of OLF proteins kept the same phylogenetic feature in defining similar topology of the tree. It indicates that the OLF domain and the various domains in flanking non-OLF regions have coevolved and are likely to be functionally interdependent. Expanded by a plausible gene duplication and domain couplings scenario, the OLF family comprises seven evolutionarily and functionally distinct subfamilies, in which each member shares similar structural and functional characteristics including the composition of coevolved and interdependent domains. The phylogenetically classified and preliminarily assessed subfamily framework may greatly facilitate the studying on the OLF proteins. Furthermore, it also demonstrated a feasible and reliable strategy to categorize novel genes and predict the functional implications of uncharacterized proteins based on the comprehensive phylogenetic classification of the subfamilies and their relevance to preliminary functional characteristics.  相似文献   

3.
Conventional classification of the species in the family Mycoplasmataceae is mainly based on phenotypic criteria, which are complicated, can be difficult to measure, and have the potential to be hampered by phenotypic deviations among the isolates. The number of biochemical reactions suitable for phenotypic characterization of the Mycoplasmataceae is also very limited and therefore the strategy for the final identification of the Mycoplasmataceae species is based on comparative serological results. However, serological testing of the Mycoplasmataceae species requires a performance panel of hyperimmune sera which contains anti-serum to each known species of the family, a high level of technical expertise, and can only be properly performed by mycoplasma-reference laboratories. In addition, the existence of uncultivated and fastidious Mycoplasmataceae species/isolates in clinical materials significantly complicates, or even makes impossible, the application of conventional bacteriological tests. The analysis of available genetic markers is an additional approach for the primary identification and phylogenetic classification of cultivable species and uncultivable or fastidious organisms in standard microbiological laboratories. The partial nucleotide sequences of the RNA polymerase β-subunit gene (rpoB) and the 16S-23S rRNA intergenic transcribed spacer (ITS) were determined for all known type strains and the available non-type strains of the Mycoplasmataceae species. In addition to the available 16S rRNA gene data, the ITS and rpoB sequences were used to infer phylogenetic relationships among these species and to enable identification of the Mycoplasmataceae isolates to the species level. The comparison of the ITS and rpoB phylogenetic trees with the 16S rRNA reference phylogenetic tree revealed a similar clustering patterns for the Mycoplasmataceae species, with minor discrepancies for a few species that demonstrated higher divergence of their ITS and rpoB in comparison to their neighbor species. Overall, our results demonstrated that the ITS and rpoB gene could be useful complementary phylogenetic markers to infer phylogenetic relationships among the Mycoplasmataceae species and provide useful background information for the choice of appropriate metabolic and serological tests for the final classification of isolates. In summary, three-target sequence analysis, which includes the ITS, rpoB, and 16S rRNA genes, was demonstrated to be a reliable and useful taxonomic tool for the species differentiation within the family Mycoplasmataceae based on their phylogenetic relatedness and pairwise sequence similarities. We believe that this approach might also become a valuable tool for routine analysis and primary identification of new isolates in medical and veterinary microbiological laboratories.  相似文献   

4.
The sequence and structural analysis of cadherins allow us to find sequence determinants-a few positions in sequences whose residues are characteristic and specific for the structures of a given family. Comparison of the five extracellular domains of classic cadherins showed that they share the same sequence determinants despite only a nonsignificant sequence similarity between the N-terminal domain and other extracellular domains. This allowed us to predict secondary structures and propose three-dimensional structures for these domains that have not been structurally analyzed previously. A new method of assigning a sequence to its proper protein family is suggested: analysis of sequence determinants. The main advantage of this method is that it is not necessary to know all or almost all residues in a sequence as required for other traditional classification tools such as BLAST, FASTA, and HMM. Using the key positions only, that is, residues that serve as the sequence determinants, we found that all members of the classic cadherin family were unequivocally selected from among 80,000 examined proteins. In addition, we proposed a model for the secondary structure of the cytoplasmic domain of cadherins based on the principal relations between sequences and secondary structure multialignments. The patterns of the secondary structure of this domain can serve as the distinguishing characteristics of cadherins.  相似文献   

5.
A new classification scheme based on the melting profile of DNA sequences simulated thermal melting profiles. This method was applied in the classification of (a) several species of mammalian - β globin and (b) α-chain class II MHC genes. Comparison of the thermal melting profile with the molecular phylogenetic trees constructed using the sequences shows that the melting temperature based approach is able to reproduce most of the major features of the sequence based evolutionary tree. Melting profile method takes into account the inherent structure and dynamics of the DNA molecule, does not require sequence alignment prior to tree construction, and provides a means to verify the results experimentally. Therefore our results show that melting profile based classification of DNA sequences could be a useful tool for sequence analysis.  相似文献   

6.
rRNA二级结构序列用于真菌系统学研究的方法初探   总被引:1,自引:0,他引:1  
本文首次利用核酸二级结构特征代替核酸碱基作为探讨类群之间亲缘关系的信号,构建了基于结构特征的子囊菌部分类群的系统进化树。该方法以S(规范的碱基对),Q(不规范的碱基对),I(单链),B(侧环),M(多分枝环)和H(发卡结构)为代码将二级结构特征区分为6种不同的亚结构类型,然后将二级结构特征转换为结构序列,并进行结构序列分析。该方法使rRNA不只局限于碱基比较,拓展了其应用范围,为揭示分子的功能与进化的关系提供了线索。结果表明,结构序列分析可用于子囊菌的系统学研究;相对于核酸序列分析,结构分析的结果似乎更加清晰地体现子囊果的演化过程。  相似文献   

7.
【背景】16S rRNA基因序列分析已广泛应用于细菌的分类鉴定,但是存在一定局限性,而使用看家基因作为分子标记在近缘种及亚种间的系统发育分析中具有其独特的优势。【目的】研究16S rRNA、uvr C (核酸外切酶ABC,C亚基)和mur E (UDP-N-乙酰胞壁酰三肽合酶)基因序列对干酪乳杆菌的近缘种及亚种的区分能力。【方法】采用分离自传统发酵乳中的6株干酪乳杆菌为研究对象,选取uvr C和mur E基因片段,通过PCR扩增、测序,结合已公布的干酪乳杆菌的近缘种或亚种的相应序列计算遗传距离、构建系统发育树,并与16S rRNA基因序列分析技术进行比较。【结果】研究发现Lactobacilluscasei及相近种间的uvr C、mur E和联合基因(uvr C-mur E)构建的系统发育树拓扑结构与16S rRNA基因结果基本一致,区别在于相似性的不同,其分别为79.00%-99.16%、89.08%-99.20%、76.56%-99.69%和99.58%-100%。基于16S rRNA基因不能区分干酪乳杆菌的近缘种及亚种,而看家基因uvr C和mur E基因序列能够很好地区分干酪乳杆菌的近缘种及亚种,并且将uvr C和mur E基因串联使用后,试验菌株与参考菌株的分类关系更加清晰。【结论】联合基因(uvr C-mur E)可作为16SrRNA基因的辅助工具用于干酪乳杆菌的近缘种及亚种的快速准确鉴定。  相似文献   

8.
9.
In the modern era, molecular genetic techniques are crucial in ecological studies, as well as in the classification, typing, and phylogenetic analysis of prokaryotes. These techniques are mainly aimed at whole genome comparisons and PCR-derived experiments, including amplifying the 16S rRNA and other various housekeeping genes used in taxonomy, as well as MLST (multilocus sequence typing) and MLSA (multilocus sequence analysis) of different taxonomic bacterial groups. The gene encoding threonine-tRNA ligase (thrS) is a gene potentially applicable as an identification and phylogenetic marker in bacteria. It is widely distributed in bacterial genomes and is subject to evolutionary selection pressure due to its important function in protein synthesis. In this study, specific primers were used to amplify a thrS gene fragment (~740 bp) in 36 type and 30 wild strains classified under family Bifidobacteriaceae. The full-length gene has not yet been considered as a possible identification, classification, and phylogenetic marker in bifidobacteria. The thrS sequences revealed higher sequence variability (82.7% of pairwise identities) among members of the family than that shown by 16S rRNA gene sequences (96.0%). Although discrepancies were found between the thrS-derived and previously reported whole genome phylogenetic analyses, the main phylogenetic groups of bifidobacteria were properly assigned. Most wild strains of bifidobacteria were better differentiated based on their thrS sequences than on their 16S rRNA gene identities. Phylogenetic confidence of the evaluated gene with respect to other alternative genetic markers widely used in taxonomy of bifidobacteria (fusA, GroELhsp60, pyrG, and rplB genes) was confirmed using the localized incongruence difference - Templeton analysis.  相似文献   

10.
Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish DNA gene similarity. We present the inter-coefficient difference (ICD) transformation, a novel extension of the discrete Fourier transformation, which can be applied to any DNA sequence. The ICD method is a mathematical, alignment-free DNA comparison method that generates a genetic signature for any DNA sequence that is used to generate relative measures of similarity among DNA sequences. We demonstrate our method on a set of insulin genes obtained from an evolutionarily wide range of species, and on a set of avian influenza viral sequences, which represents a set of highly similar sequences. We compare phylogenetic trees generated using our technique against trees generated using traditional alignment techniques for similarity and demonstrate that the ICD method produces a highly accurate tree without requiring an alignment prior to establishing sequence similarity.  相似文献   

11.
张国萍  王蔚  朱世杰  申煜  常弘 《四川动物》2005,24(4):500-506
鹳形目鸟类的传统分类一直存在分歧,而近期的分子系统学研究大多只用单个基因,其结论的可信度需要进一步验证.本文通过核c-mos基因和线粒体12S rRNA基因序列分别和合并分析,采用分子系统学方法探讨了鹳形目6科12种鸟类的系统发生关系.文中测出鹳形目鸟类6种核c-mos基因的片断序列,结合来自Genebank的其他种类的c-mos和12S rRNA基因序列,分别经Clustal W软件对位排列后,以原鸡为外类群用最大似然法、邻接法和最大简约法建立系统树.系统树分析表明, 鹳形目6科之间的系统发生关系总结为:(鹭科,((鹮科,美洲鹫科),(鹳科,(鲸头鹳科,锤头鹳科)))).鹭科7个属之间的系统发生关系总结为:(麻(开鸟)属(夜鹭属(池鹭属(苍鹭属(中白鹭属(白鹭属,大白鹭属)))))).分别基于两个单基因的系统树有一定差异,而基于合并数据的系统树支持率和分辨率都高于基于单基因的系统树,表明使用在遗传上相对独立的分子数据合并建立系统树有较高的可信度和分辨率,是一种更好的研究方法.  相似文献   

12.
Through multiple sequence alignment and phylogenetic analysis, the subgrouping of the crustacean hyperglycemic hormone (CHH) family was updated using the most complete, nonredundant sequence data set. All sequences from insects were clustered into a distinct subbranch with characters closer to CHH subfamily I. Several sequences that are controversial in their nomenclature and classification are discussed. The motif configuration of CHHs differs from that of molt-inhibiting hormone or gonad-inhibiting hormone in both N and C termini. These two motifs approach each other in tertiary structure models, and the motif preference reveals the critical roles of these regions in functional specificity. Two types of exon organizations of the CHH family genes were observed. Four-exon Chh genes were found in a wide range of pan-crustacean (crustacean and hexapod) taxa, except for the penaeid species, from which the 3-exon Chh genes were reported. Meanwhile, the 3-exon structure was found in the Mih gene and Moih genes from one brachyuran species. Combining gene scan skill and exon splicing rules found in this study, we define three more novel sequences from two insect genomes. The pattern of the exon-exon junction within the mature peptide segment is preserved in all CHH family members.The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint first authors.  相似文献   

13.
新疆塔里木盆地可培养嗜盐放线菌系统发育多样性   总被引:3,自引:0,他引:3  
应用纯培养手段和基于16S rRNA基因序列的系统发育分析,对从塔里木盆地高盐环境土壤样品中分离的18株可培养嗜盐放线菌多样性进行了研究.实验结果表明,18株嗜盐放线菌可3个(GlycomycetaceaePseudonocardineae和Nocardiopsaceae),在有效发表的5个属的嗜盐放线菌中有4个属的嗜盐放线菌被分离到.多数菌株属于Actinopolyspora属(38.9%),Nocardiopsis属(27.8%)和Streptomonospora属(22.2%),是塔里木盆地高盐环境中嗜盐放线菌的优势类群.这些分离菌株中,菌株YIM 92370与最近种的相似性为92%,在Glycomycetaceae科内形成一个独立的分支,极有可能代表Glycomycetaceae科的一个新属.研究结果表明塔里木盆地高盐环境中存在有较为丰富的嗜盐放线菌系统发育多样性,并且潜藏着新类型的放线菌资源.  相似文献   

14.
15.
16.
With the development of genome sequencing more whole genomes of microorganisms were completed, many methods wereintroduced to reconstruct the phylogenetic tree of those microorganismswith the information extracted from the whole genomes through variousways of transforming or mapping the whole genome sequences into otherforms which can describe the evolutionary distance in a new way. We thinkit might be possible that there exists information buried in the wholegenome transferred along lineage, which remains stable and is moreessential than sequence conservation of individual genes or the arrangementof some genes of a selected set. We need to find one measurement that caninvolve as many phylogenetic features as possible that are beyond thegenome sequence itself. We converted each genome sequence of themicroorganisms into another linear sequence to represent the functionalstructure of the sequence, and we used a new information function tocalculate the discrepancy of sequences and to get one distance matrix of thegenomes, and built one phylogenetic tree with a neighbor joining method.The resulting tree shows that the major lineages are consistent with theresult based on their 16srRNA sequences. Our method discovered onephylogenetic feature derived from the genome sequences and the encodedgenes that can rebuild the phylogenetic tree correctly. The mapping of onegenome sequence to its new form representing the relative positions of thefunctional genes provides a new way to measure the phylogeneticrelationships, and with the more specific classification of gene functions theresult could be more sensitive.  相似文献   

17.
Given the lack of direct observational data relating to transposition of Stowaway miniature inverted repeat transposable elements, phylogenetic methods may provide a means of generating data that adds to our knowledge of these elements. In a phylogenetic framework the evolutionary history of homologous elements may be traced, and the nucleotide sequence of elements at or close to the time of insertion can be reconstructed. Based on a phylogeny of the diploid species of the genus Hordeum we explore evolutionary aspects of four non-homologous groups of Stowaway elements inserted into three nuclear genes: nucellin, xylose isomerase, and barley leucine zipper 1. The data illustrate how elements starting from a high degree of sequence similarity between terminal inverted repeat regions gradually degrade, and confirm previous notions about preferential insertion at particular TA target sites. It is shown how creation of consensus sequences as estimates of ancestral elements may be positively misleading. The Stowaway family of transposable elements is often further divided into subfamilies based on sequence similarity between elements. Sequence similarity data from the elements discovered in the xylose isomerase gene, and other elements found through BLAST searches in GenBank, reveal inconsistency in the rules used for classification. In order to reflect natural groups, a classification of transposable elements must be based on phylogenetic evidence rather than raw similarity.  © The Willi Hennig Society 2009.  相似文献   

18.
MOTIVATION: Most molecular phylogenies are based on sequence alignments. Consequently, they fail to account for modes of sequence evolution that involve frequent insertions or deletions. Here we present a method for generating accurate gene and species phylogenies from whole genome sequence that makes use of short character string matches not placed within explicit alignments. In this work, the singular value decomposition of a sparse tetrapeptide frequency matrix is used to represent the proteins of organisms uniquely and precisely as vectors in a high-dimensional space. Vectors of this kind can be used to calculate pairwise distance values based on the angle separating the vectors, and the resulting distance values can be used to generate phylogenetic trees. Protein trees so derived can be examined directly for homologous sequences. Alternatively, vectors defining each of the proteins within an organism can be summed to provide a vector representation of the organism, which is then used to generate species trees. RESULTS: Using a large mitochondrial genome dataset, we have produced species trees that are largely in agreement with previously published trees based on the analysis of identical datasets using different methods. These trees also agree well with currently accepted phylogenetic theory. In principle, our method could be used to compare much larger bacterial or nuclear genomes in full molecular detail, ultimately allowing accurate gene and species relationships to be derived from a comprehensive comparison of complete genomes. In contrast to phylogenetic methods based on alignments, sequences that evolve by relative insertion or deletion would tend to remain recognizably similar.  相似文献   

19.
Chemokines are small secreted proteins with important roles in immune responses. They consist of a conserved three-dimensional (3D) structure, so-called IL8-like chemokine fold, which is supported by disulfide bridges characteristic of this protein family. Sequence- and profile-based computational methods have been proficient in discovering novel chemokines by making use of their sequence-conserved cysteine patterns. However, it has been recently shown that some chemokines escaped annotation by these methods due to low sequence similarity to known chemokines and to different arrangement of cysteines in sequence and in 3D. Innovative methods overcoming the limitations of current techniques may allow the discovery of new remote homologs in the still functionally uncharacterized fraction of the human genome. We report a novel computational approach for proteome-wide identification of remote homologs of the chemokine family that uses fold recognition techniques in combination with a scaffold-based automatic mapping of disulfide bonds to define a 3D profile of the chemokine protein family. By applying our methodology to all currently uncharacterized human protein sequences, we have discovered two novel proteins that, without having significant sequence similarity to known chemokines or characteristic cysteine patterns, show strong structural resemblance to known anti-HIV chemokines. Detailed computational analysis and experimental structural investigations based on mass spectrometry and circular dichroism support our structural predictions and highlight several other chemokine-like features. The results obtained support their functional annotation as putative novel chemokines and encourage further experimental characterization. The identification of remote homologs of human chemokines may provide new insights into the molecular mechanisms causing pathologies such as cancer or AIDS, and may contribute to the development of novel treatments. Besides, the genome-wide applicability of our methodology based on 3D protein family profiles may open up new possibilities for improving and accelerating protein function annotation processes.  相似文献   

20.
MOTIVATION: Many proposed statistical measures can efficiently compare biological sequences to further infer their structures, functions and evolutionary information. They are related in spirit because all the ideas for sequence comparison try to use the information on the k-word distributions, Markov model or both. Motivated by adding k-word distributions to Markov model directly, we investigated two novel statistical measures for sequence comparison, called wre.k.r and S2.k.r. RESULTS: The proposed measures were tested by similarity search, evaluation on functionally related regulatory sequences and phylogenetic analysis. This offers the systematic and quantitative experimental assessment of our measures. Moreover, we compared our achievements with these based on alignment or alignment-free. We grouped our experiments into two sets. The first one, performed via ROC (receiver operating curve) analysis, aims at assessing the intrinsic ability of our statistical measures to search for similar sequences from a database and discriminate functionally related regulatory sequences from unrelated sequences. The second one aims at assessing how well our statistical measure is used for phylogenetic analysis. The experimental assessment demonstrates that our similarity measures intending to incorporate k-word distributions into Markov model are more efficient.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号