首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Genome sequence comparison between evolutionarily distant species revealed ultraconserved elements (UCEs) among mammals under strong purifying selection. Most of them were also conserved among vertebrates. Because they tend to be located in the flanking regions of developmental genes, they would have fundamental roles in creating vertebrate body plans. However, the evolutionary origin and selection mechanism of these UCEs remain unclear. Here we report that UCEs arose in primitive vertebrates, and gradually grew in vertebrate evolution. We searched for UCEs in two teleost fishes, Tetraodon nigroviridis and Oryzias latipes, and found 554 UCEs with 100% identity over 100 bps. Comparison of teleost and mammalian UCEs revealed 43 pairs of common, jawed-vertebrate UCEs (jUCE) with high sequence identities, ranging from 83.1% to 99.2%. Ten of them retain lower similarities to the Petromyzon marinus genome, and the substitution rates of four non-exonic jUCEs were reduced after the teleost-mammal divergence, suggesting that robust conservation had been acquired in the jawed vertebrate lineage. Our results indicate that prototypical UCEs originated before the divergence of jawed and jawless vertebrates and have been frozen as perfect conserved sequences in the jawed vertebrate lineage. In addition, our comparative sequence analyses of UCEs and neighboring regions resulted in a discovery of lineage-specific conserved sequences. They were added progressively to prototypical UCEs, suggesting step-wise acquisition of novel regulatory roles. Our results indicate that conserved non-coding elements (CNEs) consist of blocks with distinct evolutionary history, each having been frozen since different evolutionary era along the vertebrate lineage.  相似文献   

2.
Signature sequences are contiguous patterns of amino acids 10-50 residues long that are associated with a particular structure or function in proteins. These may be of three types (by our nomenclature): superfamily signatures, remnant homologies, and motifs. We have performed a systematic search through a database of protein sequences to automatically and preferentially find remnant homologies and motifs. This was accomplished in three steps: 1. We generated a nonredundant sequence database. 2. We used BLAST3 (Altschul and Lipman, Proc. Natl. Acad. Sci. U.S.A. 87:5509-5513, 1990) to generate local pairwise and triplet sequence alignments for every protein in the database vs. every other. 3. We selected "interesting" alignments and grouped them into clusters. We find that most of the clusters contain segments from proteins which share a common structure or function. Many of them correspond to signatures previously noted in the literature. We discuss three previously recognized motifs in detail (FAD/NAD-binding, ATP/GTP-binding, and cytochrome b5-like domains) to demonstrate how the alignments generated by our procedure are consistent with previous work and make structural and functional sense. We also discuss two signatures (for N-acetyltransferases and glycerol-phosphate binding) which to our knowledge have not been previously recognized.  相似文献   

3.
4.
Evolutionary conservation of kinetochore protein sequences in plants   总被引:5,自引:0,他引:5  
The evolutionary conservation of structural/functional kinetochore proteins has been studied on isolated nuclei and pro-/metaphase chromosomes of mono- and dicot plants. The cross-reactivities of antibodies against human CENPC, CENPE and CENPF, and against maize CENPCa with the centromeric regions of mitotic chromosomes of Vicia faba and/or Hordeum vulgare are shown. Putative homologs of the kinetochore protein SKP1 (suppressor of kinetochore protein 1p of yeast) were found in both species and of CBF5p (centromere binding factor 5 of yeast) in barley. Antibodies against synthetic peptides derived from partial sequences encoding these proteins were produced and recognized the centromeric regions on mitotic chromosomes as detected by indirect immunofluorescence.  相似文献   

5.
The sequences of related proteins show the alternance of conserved and variable regions. This fact is generally seen as a reverberation of 3 D constraints onto 1 D structures. Although the exact meaning of such constraints remains elusive, conserved regions can be extracted from protein chains and used to align them. We developed a program that efficiently performs this task. The program constructs symbolic motifs fitting a target subsequence present in every chain without requiring any insertion or deletion. However, a motif can be obliterated by substitutions when it is found in a sequence. The motifs formally consist in aminoacid symbols separated (and virtually preceded and followed) by a variable number of wild-card symbols. A wild-card, which can match any aminoacid of the chains (with no increment of score), represents a variable site within conserved regions. Different motifs are progressively built by substituting a wild-card with an aminoacid symbol within or beside preexisting motifs. Only those motifs showing an outstanding association of high matching score over all chains, and of low deviation between extreme scores over individual chains are selected for making the next generation. Starting with a null motif, the construction ends when no new aminoacid can be introduced into the current motifs. A surviving motif is then considered valid if it maps without ambiguity a unique region in every sequence, and the motif with highest score is finally selected. The construction of new motifs is then reinitated for the left and right parts of the sequences, after these have been split by the previously selected motif.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

6.
We present an algorithm to detect protein sub-structural motifs from primary sequence. The input to the algorithm is a set of aligned multiple protein sequences. It uses wavelet transforms to decompose protein sequences represented numerically by different indices (such as polarity, accessible surface area or electron-ion integration potentials of the amino acids). The numerical representation of a protein sequence has significant correlation with its biological activity, thus common motifs are expected to be observable from the wavelet spectrum. The decomposed signals are then up-sampled and similarity search techniques are used to identify similar regions across all the proteins at multiple scales. Results indicate that wavelet transform techniques are a promising approach for rapid motif detection.  相似文献   

7.
8.
9.
The eukaryotic porin superfamily consists of two families, voltage-dependent anion channel (VDAC) and Tom40, which are both located in the mitochondrial outer membrane. In Trypanosoma brucei, only a single member of the VDAC family has been described. We report the detection of two additional eukaryotic porin-like sequences in T. brucei. By bioinformatic means, we classify both as putative VDAC isoforms.  相似文献   

10.
11.
现有92株芜菁花叶病毒(TuMV)的全基因组序列已在GenBank报道,据分析报道其中58株不含重组序列。利用系统聚类法对92株TuMV的全基因组序列和58株TuMV全基因组序列的相对密码子频率RSCU值进行聚类分析。同时利用系统发育分析方法分析了这92株和58株TuMV全基因组序列。结果发现,92株芜菁花叶病毒株的密码子偏性聚类树与其系统进化树的一致度很低;而不含重组序列的58株芜菁花叶病毒株的密码子偏性聚类树与其系统进化树的一致度却非常高,且与寄生宿主类型基本对应。这表明在不存在重组的情况下,TuMV密码子频率的偏性可能是宿主内的一种选择压力,影响TuMV基因组的点突变进化方向,促使TuMV适应宿主内环境。  相似文献   

12.
MOTIVATION: Data on protein-protein interactions (PPIs) are increasing exponentially. To date, large-scale protein interaction networks are available for human and most model species. The arising challenge is to organize these networks into models of cellular machinery. As in other biological domains, a comparative approach provides a powerful basis for addressing this challenge. RESULTS: We develop a probabilistic model for protein complexes that are conserved across two species. The model describes the evolution of conserved protein complexes from an ancestral species by protein interaction attachment and detachment and gene duplication events. We apply our model to search for conserved protein complexes within the PPI networks of yeast and fly, which are the largest networks in public databases. We detect 150 conserved complexes that match well-known complexes in yeast and are coherent in their functional annotations both in yeast and in fly. In comparison with two previous approaches, our model yields higher specificity and sensitivity levels in protein complex detection. AVAILABILITY: The program is available upon request.  相似文献   

13.
Bøttger P  Pedersen L 《The FEBS journal》2005,272(12):3060-3074
The mammalian members of the inorganic phosphate (P(i)) transporter (PiT) family, the type III sodium-dependent phosphate (NaP(i)) transporters PiT1 and PiT2, have been assigned housekeeping P(i) transport functions and are suggested to be involved in chondroblastic and osteoblastic mineralization and ectopic calcification. The PiT family members are conserved throughout all kingdoms and use either sodium (Na+) or proton (H+) gradients to transport P(i). Sequence logo analyses revealed that independent of their cation dependency these proteins harbor conserved signature sequences in their N- and C-terminal ends with the common core consensus sequence GANDVANA. With the exception of 10 proteins from extremophiles all 109 proteins analyzed carry an aspartic acid in one or both of the signature sequences. We changed either of the highly conserved aspartates, Asp28 and Asp506, in the N- and C-terminal signature sequences, respectively, of human PiT2 to asparagine and analyzed P(i) uptake function in Xenopus laevis oocytes. Both mutant proteins were expressed at the cell surface of the oocytes but exhibited knocked out NaP(i) transport function. Human PiT2 is also a retroviral receptor and we have previously shown that this function can be exploited as a control for proper processing and folding of mutant proteins. Both mutant transporters displayed wild-type receptor functions implying that their overall architecture is undisturbed. Thus the presence of an aspartic acid in either of the PiT family signature sequences is critical for the Na+-dependent P(i) transport function of human PiT2. The conservation of the aspartates among proteins using either Na+- or H+-gradients for P(i) transport suggests that they are involved in H+-dependent P(i) transport as well. Current results favor a membrane topology model in which the N- and C-terminal PiT family signature sequences are positioned in intra- and extracellular loops, respectively, suggesting that they are involved in related functions on either side of the membrane. The present data are in agreement with a possible role of the signature sequences in translocation of cations.  相似文献   

14.

Background  

One of the most evident achievements of bioinformatics is the development of methods that transfer biological knowledge from characterised proteins to uncharacterised sequences. This mode of protein function assignment is mostly based on the detection of sequence similarity and the premise that functional properties are conserved during evolution. Most automatic approaches developed to date rely on the identification of clusters of homologous proteins and the mapping of new proteins onto these clusters, which are expected to share functional characteristics.  相似文献   

15.
Microvesicles (exosomes) are important mediators of intercellular communication, playing a role in immune regulation, cancer progression, and the spread of infectious agents. The biological functions of these small vesicles are dependent on their composition, which is regulated by mechanisms that are not well understood. Although numerous proteomic studies of these particles exist, little is known about their glycosylation. Carbohydrates are involved in protein trafficking and cellular recognition. Glycomic analysis may thus provide valuable insights into microvesicle biology. In this study, we analyzed glycosylation patterns of microvesicles derived from a variety of biological sources using lectin microarray technology. Comparison of the microvesicle glycomes with their parent cell membranes revealed both enrichment and depletion of specific glycan epitopes in these particles. These include enrichment in high mannose, polylactosamine, α-2,6 sialic acid, and complex N-linked glycans and exclusion of terminal blood group A and B antigens. The polylactosamine signature derives from distinct glycoprotein cohorts in microvesicles of different origins. Taken together, our data point to the emergence of microvesicles from a specific membrane microdomain, implying a role for glycosylation in microvesicle protein sorting.  相似文献   

16.
Characterization of protein primary sequences based on partial ordering   总被引:1,自引:0,他引:1  
In this paper, we present a new approach to characterize protein sequences. Based on orderings of the 20 natural amino acids which reflect some of their physico-chemical properties, we construct an augmented Hasse matrix for each protein sequence. Furthermore, the normalized leading eigenvalues of these matrices are computed and considered as invariants for the protein sequences. Finally, we make a comparison for the similarity/diversity of nine different protein sequences.  相似文献   

17.
Uricase is a peroxisomal liver enzyme that catalyzes the oxidation of uric acid to allantoin during purine catabolism. It is present in vertebrates in most species of fish, amphibians, and mammals but its enzymatic activity is absent in hominoids. We have used Western blot analysis in a comparative study to establish a homology among uricases from different species of vertebrates. Using antibodies against denatured rat liver uricase, we have been able to detect for the first time cross-reactivity with the uricase of species ranging in the evolutionary scale from fish to primates (macaque). Our results suggest that these uricases have a common evolutionary origin. Our conclusion is also supported by the fact that uricase from different species exhibits identical tissue, subcellular localization, and similarity of molecular weights. This study was extended to include human liver samples. Using the same approach but with a more sensitive detection system (alkaline phosphatase instead of peroxidase), we did not detect polypeptide species related to rat uricase in human fetal or adult liver samples, which indicates that during hominoid evolution, the mutational event responsible for the loss of uricase activity in humans precluded formation of a translatable uricase mRNA.  相似文献   

18.
A sensitive technique for protein sequence motif recognition based on neural networks has been developed. It involves three major steps. (1) At each appropriate alignment position of a set of N matched sequences, a set of N aligned oligopeptides is specified with preselected window length. N neural nets are subsequently and successively trained on N-1 amino acid spans after eliminating each ith oligopeptide. A test for recognition of each of the ith spans is performed. The average neural net recognition over N such trials is used as a measure of conservation for the particular windowed region of the multiple alignment. This process is repeated for all possible spans of given length in the multiple alignment. (2) The M most conserved regions are regarded as motifs and the oligopeptides within each are used to train intensively M individual neural networks. (3) The M networks are then applied in a search for related primary structures in a databank of known protein sequences. The oligopeptide spans in the database sequence with strongest neural net output for each of the M networks are saved and then scored according to the output signals and the proper combination that follows the expected N- to C-terminal sequence order. The motifs from the database with highest similarity scores can then be used to retrain the M neural nets, which can be subsequently utilized for further searches in the databank, thus providing even greater sensitivity to recognize distant familial proteins. This technique was successfully applied to the integrase, DNA-polymerase and immunoglobulin families.  相似文献   

19.
Insertions or deletions (indels) of amino acids residues have been recognized as an important source of genetic and structural divergence between paralogous Bcl-2 family members. However, these signature sequences have not so far been extensively investigated amongst orthologous Bcl-2 family proteins. Bcl2l10 is an antiapoptotic member of the Bcl-2 family that has evolved rapidly throughout the vertebrate lineage and which shows conserved abundant expression in eggs and oocytes. In this paper, we have unraveled two major sites of divergence between human Bcl2l10 and its vertebrate homologs. The first one provides length variation at the N-terminus (before the BH4 domain) and the second one is located between the predicted α5-α6 pore-forming helices, providing an unprecedented case in the superfamily of helix-bundled pore-forming proteins. These two particular indels were studied phylogenetically and through biochemical and cell biological techniques, including truncation and site-directed mutagenesis. While deletion of the N-terminal extension had no significant functional impact in HeLa cells, our results suggest that the human Bcl2l10 protein evolved a calcium-binding motif in its α5-α6 interhelical region by acquiring critical negatively charged residues. Considering the reliance of female eggs on calcium-dependent proteins and calcium-regulated processes and the exceptional longevity of oocytes in the primate lineage, we propose that this microstructural variation may be an adaptive feature associated with high maternal expression of this Bcl-2 family member.  相似文献   

20.
管维红 《生物信息学》2012,10(3):194-198
蛋白质序列特性的研究对于蛋白质的结构及功能具有重要意义。该文为了研究蛋白质序列是否具有混沌行为,先将蛋白质序列通过氨基酸电子离子相互作用势(electron interaction potential,EIIP)转化为时间序列,再根据混沌理论对其进行相空间重构,利用去偏自相关系数,经典G-P算法确定系统的时间延迟t和嵌入维数m,系统的最大Lyapunov指数则用改进的最大Lyapunov指数计算方法计算,其结果绝大多数为正,从而确认了蛋白质时间序列的混沌行为,并对特例进行了说明。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号