首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Savill NJ  Hoyle DC  Higgs PG 《Genetics》2001,157(1):399-411
We test models for the evolution of helical regions of RNA sequences, where the base pairing constraint leads to correlated compensatory substitutions occurring on either side of the pair. These models are of three types: 6-state models include only the four Watson-Crick pairs plus GU and UG; 7-state models include a single mismatch state that combines all of the 10 possible mismatches; 16-state models treat all mismatch states separately. We analyzed a set of eubacterial ribosomal RNA sequences with a well-established phylogenetic tree structure. For each model, the maximum-likelihood values of the parameters were obtained. The models were compared using the Akaike information criterion, the likelihood-ratio test, and Cox's test. With a high significance level, models that permit a nonzero rate of double substitutions performed better than those that assume zero double substitution rate. Some models assume symmetry between GC and CG, between AU and UA, and between GU and UG. Models that relaxed this symmetry assumption performed slightly better, but the tests did not all agree on the significance level. The most general time-reversible model significantly outperformed any of the simplifications. We consider the relative merits of all these models for molecular phylogenetics.  相似文献   

2.
3.

Background  

We are interested in the problem of predicting secondary structure for small sets of homologous RNAs, by incorporating limited comparative sequence information into an RNA folding model. The Sankoff algorithm for simultaneous RNA folding and alignment is a basis for approaches to this problem. There are two open problems in applying a Sankoff algorithm: development of a good unified scoring system for alignment and folding and development of practical heuristics for dealing with the computational complexity of the algorithm.  相似文献   

4.
SUMMARY: Circles is a program for inferring RNA secondary structure using maximum weight matching. The program can read in an alignment in FASTA, ClustalW, or NEXUS format, compute a maximum weight matching, and export one or more secondary structures in various file formats. AVAILABILITY: The program is available at no cost from http://taxonomy.zoology.gla.ac.uk/rod/circles/ and requires Windows 95/98/NT. CONTACT: r.page@bio.gla.ac.uk  相似文献   

5.
Lisi V  Major F 《RNA (New York, N.Y.)》2007,13(9):1537-1545
Despite an increasing number of experimentally determined RNA structures, the gap between the number of structures and that of RNA families is still growing. To overcome this limitation, efficient and reliable RNA modeling methodologies must be developed. In order to reach this goal, here, we show how triloop sequence-structure relationships have been inferred through a systematic analysis of all triloops found in available high-resolution structures. The structural annotation of all triloops allowed us to define discrete states of the triloop's conformational space, and therefore an explicit sequence-to-structure relation. The sequence-structure relationships inferred from this explicit relation are presented in a convenient modeling table that provides a limited set of possible three-dimensional structures given any triloop sequence. The table is indexed by the two nucleotides that form the triloop's flanking base pair, since they are shown to provide the most information about the triloop three-dimensional structures. We also report the observations in the X-ray crystallographic structures of important conformational variations, which we believe might be the result of RNA dynamic.  相似文献   

6.
A Macintosh Hypertalk program (Hypercard ‘stack’)for use in phylogenetic comparative analysis of RNA structureis described. The program identifies covariations and compensatorychanges in RNA sequence alignments, for use in the constructionof secondary structure models or the identification of tertiaryinteractions. The results of an analysis are presented eitheras a list of positions in the alignment which covary, or asa 2-dimensional matrix in which potential helices in the secondarystructure appear as diagonal patterns. Received on January 7, 1991; accepted on March 19, 1991  相似文献   

7.

Background

Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions.

Methodology/Principal Findings

We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations) and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes).

Conclusions

The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1) the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2) the high conservation of non-coding sequence around the genes wingless and Ecdysone receptor, both involved in multiple developmental processes including wing pattern formation.  相似文献   

8.
9.
If it is assumed that the primary sequence determines the three-dimensional folded structure of a protein, then the regular folding patterns, such as alpha-helix, beta-sheet, and other ordered patterns in the three-dimensional structure must correspond to the periodic distribution of the physical properties of the amino acids along the primary sequence. An AutoRegressive Moving Average (ARMA) model method of spectral analysis is applied to analyze protein sequences represented by the hydrophobicity of their amino acids. The results for several membrane proteins of known structures indicate that the periodic distribution of hydrophobicity of the primary sequence is closely related to the regular folding patterns in a protein's three-dimensional structure. We also applied the method to the transmembrane regions of acetylcholine receptor alpha subunit and Shaker potassium channel for which no atomic resolution structure is available. This work is an extension of our analysis of globular proteins by a similar method.  相似文献   

10.
A detailed comparative analysis of archaeal RNase P RNA structure and a comparison of the resulting structural information with that of the bacterial RNA reveals that the archaeal RNase P RNAs are strikingly similar to those of Bacteria. The differences between the secondary structure models of archaeal and bacterial RNase P RNA have largely disappeared, and even variation in the sequence and structure of the RNAs are similar in extent and type. The structure of the cruciform (P7-11) has been reevaluated on the basis of a total of 321 bacterial and archaeal sequences, leading to a model for the structure of this region of the RNA that includes an extension to P11 that consistently organizes the cruciform and adjacent highly-conserved sequences.  相似文献   

11.
PCR was used to amplify DNA-dependent RNA polymerase gene sequences specifically from the cyanobacterial population in a seawater sample from the Sargasso Sea. Sequencing and analysis of the cloned fragments suggest that the population in the sample consisted of two distinct clusters of Prochlorococcus-like cyanobacteria and four clusters of Synechococcus-like cyanobacteria. The diversity within these clusters was significantly different, however. Clones within each Synechococcus-like cluster were 99 to 100% identical, while each Prochlorococcus-like cluster was only 91% identical at the nucleotide level. One Prochlorococcus-like cluster was significantly more closely related to a Mediterranean Sea (surface) Prochlorococcus isolate than to the other cluster, showing the highly divergent nature of this group even in one sample. The approach described here can be used as a general method for examining cyanobacterial diversity, while an oligotrophic ocean ecosystem such as the Sargasso Sea may be an ideal model for examining diversity in relation to environmental parameters.  相似文献   

12.
13.
We report here the complete genomic sequence of the Chilean human isolate of Andes virus CHI-7913. The S, M, and L genome segment sequences of this isolate are 1,802, 3,641 and 6,466 bases in length, with an overall GC content of 38.7%. These genome segments code for a nucleocapsid protein of 428 amino acids, a glycoprotein precursor protein of 1,138 amino acids and a RNA-dependent RNA polymerase of 2,152 amino acids. In addition, the genome also has other ORFs coding for putative proteins of 34 to 103 amino acids. The encoded proteins have greater than 98% overall similarity with the proteins of Andes virus isolates AH-1 and Chile R123. Among other sequenced Hantavirus, CHI-7913 is more closely related to Sin Nombre virus, with an overall protein similarity of 92%. The characteristics of the encoded proteins of this isolate, such as hydrophobic domains, glycosylation sites, and conserved amino acid motifs shared with other Hantavirus and other members of the Bunyaviridae family, are identified and discussed.  相似文献   

14.
Elucidating the structure of RNA and RNA ensembles is essential to understand biological functions. In this work, we explored the previously uncharted reactivity of bis-chloropiperidines (B-CePs) towards RNA. We characterized at the molecular level the different adducts induced by the fast reacting compound B-CeP 1 with RNA. Following an approach based on solution thermal melting coupled with ESI mass spectrometry (STHEM-ESI), we proved the ability of B-CePs to induce inter-molecular cross-links between guanines in double stranded RNA. These results open the possibility of using B-CePs as structural probes for investigating higher-order structures, such as the kissing loop complex established by the dimerization initiation site (DIS) of the HIV-1 genome. We confirmed the potential of B-CePs to reveal the identity of RNA structures involved in long-range interactions, expecting to benefit the characterization of samples that are not readily amenable to traditional high-resolution techniques, and thus promoting the elucidation of pertinent RNA systems associated with old and new diseases.  相似文献   

15.
The amino acid sequence of an insect apolipoprotein, apolipophorin-III from Manduca sexta, was determined by a combination of cDNA and protein sequencing. The mature hemolymph protein consists of 166 amino acids. The cDNA also encodes for an amino-terminal extension of 23 amino acids which is not represented in the mature hemolymph protein. The existence of a precursor protein was confirmed by in vitro translation of fat body mRNA. Computer-assisted comparative sequence analysis revealed the following points: 1) the protein is composed of tandemly repeating tetradecapeptide units with a high potential for forming amphiphilic helical structures. Compared to mammalian apolipoproteins the repeat units in the insect apolipoprotein show considerable length variability; 2) the sequence has a striking resemblance to several human apolipoproteins including apoE, AIV, AI, and CI. However, the homology seems to be entirely functional since, although the insect and mammalian apoproteins contain very similar types of amino acid residues, the actual degree of sequence identity is quite low. Whether the mammalian and insect apoproteins are derived from a common ancestral amphiphilic helix forming, lipid-binding protein, or arose by convergent evolution can not be determined at present. This represents the first complete amino acid sequence for an insect apolipoprotein.  相似文献   

16.
The primary structure of rabbit 18S ribosomal RNA was determined by nucleotide sequence analysis of the RNA directly. The rabbit rRNA was specifically cleaved with T1 ribonuclease, as well as with E. coli RNase H using a Pst 1 DNA linker to generate a specific set of overlapping fragments spanning the entire length of the molecule. Both intact and fragmented 18S rRNA were end-labeled with [32P], base-specifically cleaved enzymatically and chemically and nucleotide sequences determined from long polyacrylamide sequencing gels run in formamide. This approach permitted the detection of both cistron heterogeneities and modified bases. Specific nucleotide sequences within E. coli 16S rRNA previously implicated in polyribosome function, tRNA binding, and subunit association are also conserved within the rabbit 18S rRNA. This conservation suggests the likelihood that these regions have similar functions within the eukaryotic 40S subunit.  相似文献   

17.
SUMMARY: MaxBench is a web-based system available for evaluating the results of sequence and structure comparison methods, based on the SCOP protein domain classification. The system makes it easy for developers to both compare the overall performance of their methods to standard algorithms and investigate the results of individual comparisons. AVAILABILITY: http://www.sanger.ac.uk/Users/lp1/MaxBench/  相似文献   

18.
An RNA secondary structure model is presented for the nuclear ribosomal internal transcribed spacers (ITS) based on comparative analysis of 340 sequences from the angiosperm family Asteraceae. The model based on covariation analysis agrees with structural features proposed in previous studies using mainly thermodynamic criteria and provides evidence for additional structural motifs within ITS1 and ITS2. The minimum structure model suggests that at least 20% of ITS1 and 38% of ITS2 nucleotide positions are involved in base pairing to form helices. The sequence alignment enabled by conserved structural features provides a framework for broadscale molecular evolutionary studies and the first family-level phylogeny of the Asteraceae based on nuclear DNA data. The phylogeny based on ITS sequence data is very well resolved and shows considerable congruence with relationships among major lineages of the family suggested by chloroplast DNA studies, including a monophyletic subfamily Asteroideae and a paraphyletic subfamily Cichorioideae. Combined analyses of ndhF and ITS sequences provide additional resolution and support for relationships in the family.  相似文献   

19.
李桢  邹红岩  邵超鹏  唐斯  王大明  程良红 《遗传》2007,29(11):1367-1372
使用FLOW-SSO、PCR-SSP以及测序等分型技术, 发现一个与HLA-B*270401基因相关的未知基因。设计基因特异性引物单独扩增B*27基因的外显子2-5, 包括内含子2-4, 并进行双向测序, 分析与B*270401基因序列的差异。该基因的扩增产物为1 815 bp。与B*270401相比在外显子3和4共有10个碱基的改变, 从而使相应氨基酸发生错义或同义突变。碱基634 A→C (密码子130丝氨酸→精氨酸); 670 A→T (密码子142苏氨酸→丝氨酸); 683 G→T (密码子146色氨酸→亮氨酸); 698 A→T (密码子151谷氨酸→缬氨酸); 774 G→C (密码子176谷氨酸→天冬氨酸); 776 C→A (密码子177苏氨酸→赖氨酸); 781 C→G (密码子179谷氨酰胺→谷氨酸); 789 G→T (密码子181丙氨酸同义突变); 1 438 C→T (密码子206甘氨酸同义突变); 1 449 G→C (密码子210甘氨酸→丙氨酸)。在IMGT/HLA数据库中B*27组只有3个基因(B*270502 / 2706 / 2732)提交了内含子序列。该未知基因的内含子2序列与B*2706相同, 显示了与B*27组基因的同源性, 但其同源性在内含子3、4均未得到支持, 与B*27组基因相比, 内含子3的第106个碱基C→G, 碱基168缺失, 碱基179 G→A, 碱基536 G→A; 内含子4中碱基82 T→C。但其内含子3、4序列却与B*070201完全相同。该基因序列已提交GenBank, 编号为被DQ915176, 被WHO确认为HLA-B*2736等位基因。  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号