首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Covalent ligation studies on the human telomere quadruplex   总被引:5,自引:4,他引:1  
Qi J  Shafer RH 《Nucleic acids research》2005,33(10):3185-3192
Recent X-ray crystallographic studies on the human telomere sequence d[AGGG(TTAGGG)3] revealed a unimolecular, parallel quadruplex structure in the presence of potassium ions, while earlier NMR results in the presence of sodium ions indicated a unimolecular, antiparallel quadruplex. In an effort to identify and isolate the parallel form in solution, we have successfully ligated into circular products the single-stranded human telomere and several modified human telomere sequences in potassium-containing solutions. Using these sequences with one or two terminal phosphates, we have made chemically ligated products via creation of an additional loop. Circular products have been identified by polyacrylamide gel electrophoresis, enzymatic digestion with exonuclease VII and electrospray mass spectrometry in negative ion mode. Optimum pH for the ligation reaction of the human telomere sequence ranges from 4.5 to 6.0. Several buffers were also examined, with MES yielding the greatest ligation efficiency. Human telomere sequences with two phosphate groups, one each at the 3′ and 5′ ends, were more efficient at ligation, via pyrophosphate bond formation, than the corresponding sequences with only one phosphate group, at the 5′ end. Circular dichroism spectra showed that the ligation product was derived from an antiparallel, single-stranded guanine quadruplex rather than a parallel single-stranded guanine quadruplex structure.  相似文献   

2.
Highly prevalent putative quadruplex sequence motifs in human DNA   总被引:25,自引:14,他引:11  
We report here the results of a systematic search for the existence and prevalence of potential intramolecular G-quadruplex forming sequences in the human genome. We have also examined the tendency for particular sequences of ‘loop’ regions to occur in particular positions with respect to the G-tracts in a quadruplex. Using arithmetic ratio and probability techniques we have discovered frequent and systematic occurrence of certain sequence types, the most prominent being a potential quadruplex containing CCTGT in the first ‘loop’ position. Being able to highlight types of potential quadruplex sequences in G-rich regions is an important step in searching for biologically relevant sequences and finding their function.  相似文献   

3.
Here we report a deoxyribozyme with a unique structure that contains a two-tiered guanine quadruplex interlinked to a Watson-Crick duplex. Through in vitro selection, sequence mutation, and methylation interference, we show the presence of both the two-tiered guanine-quadruplex and two helical regions contained in the active structure of this self-phosphorylating deoxyribozyme. Interestingly, one GG element of the quadruplex is part of a hairpin loop within one of the identified helical regions. Circular dichroism analysis showed that antiparallel quadruplex formation was dependent on this helix. To our knowledge, this is the first report of a pseudoknot nucleic acid structure that involves a guanine quadruplex. Our findings indicate that guanine quadruplexes can be part of complex structural arrangements, increasing the likelihood of finding more complex guanine quadruplex arrangements in biological systems.  相似文献   

4.
We propose a new method for classifying and identifying transmembrane (TM) protein functions in proteome-scale by applying a single-linkage clustering method based on TM topology similarity, which is calculated simply from comparing the lengths of loop regions. In this study, we focused on 87 prokaryotic TM proteomes consisting of 31 proteobacteria, 22 gram-positive bacteria, 19 other bacteria, and 15 archaea. Prior to performing the clustering, we first categorized individual TM protein sequences as "known," "putative" (similar to "known" sequences), or "unknown" by using the homology search and the sequence similarity comparison against SWISS-PROT to assess the current status of the functional annotation of the TM proteomes based on sequence similarity only. More than three-quarters, that is, 75.7% of the TM protein sequences are functionally "unknown," with only 3.8% and 20.5% of them being classified as "known" and "putative," respectively. Using our clustering approach based on TM topology similarity, we succeeded in increasing the rate of TM protein sequences functionally classified and identified from 24.3% to 60.9%. Obtained clusters correspond well to functional superfamilies or families, and the functional classification and identification are successfully achieved by this approach. For example, in an obtained cluster of TM proteins with six TM segments, 109 sequences out of 119 sequences annotated as "ATP-binding cassette transporter" are properly included and 122 "unknown" sequences are also contained.  相似文献   

5.
6.
G-quadruplex structures, formed from guanine rich sequences, have previously been shown to be involved in various physiological processes including cancer-related gene expression. Furthermore, G-quadruplexes have been found in several oncogene promoter regions, and have been shown to play a role in the regulation of gene expression. The mutagenic properties of oxidative stress on DNA have been widely studied, as has the association with carcinogenesis. Guanine is the most susceptible nucleotide to oxidation, and as such, G-rich sequences that form G-quadruplexes can be viewed as potential "hot-spots" for DNA oxidation. We propose that oxidation may destabilise the G-quadruplex structure, leading to its unfolding into the duplex structure, affecting gene expression. This would imply a possible mechanism by which oxidation may impact on oncogene expression. This work investigates the effect of oxidation on two biologically relevant G-quadruplex structures through 500 ns molecular dynamics simulations on those found in the promoter regions of the c-Kit and c-Myc oncogenes. The results show oxidation having a detrimental effect on stability of the structure, substantially destabilising the c-Kit quadruplex, and with a more attenuated effect on the c-Myc quadruplex. Results are suggestive of a novel route for oxidation-mediated oncogenesis and may have wider implications for genome stability.  相似文献   

7.
Risitano A  Fox KR 《Biochemistry》2003,42(21):6507-6513
We have determined the stability of intramolecular quadruplexes that are formed by a variety of G-rich sequences, using oligonucleotides containing appropriately placed fluorophores and quenchers. The stability of these quadruplexes is compared with that of the DNA duplexes that are formed on addition of complementary C-rich oligonucleotides. We find that the linkers joining the G-tracts are not essential for folding and can be replaced with nonnucleosidic moieties, though their sequence composition profoundly affects quadruplex stability. Although the human telomere repeat sequence d[G(3)(TTAG(3))(3)] folds into a quadruplex structure, this forms a duplex in the presence of the complementary C-rich strand at physiological conditions. The Tetrahymena sequence d[G(4)(T(2)G(4))(3)], the sequence d[G(3)(T(2)G(3))(3)], and sequences related to regions of the c-myc promoter d(G(4)AG(4)T)(2) and d(G(4)AG(3)T)(2) preferentially adopt the quadruplex form in potassium-containing buffers, even in the presence of a 50-fold excess of their complementary C-rich strands, though the duplex predominates in the presence of sodium. The HIV integrase inhibitor d[G(3)(TG(3))(3)] forms an extremely stable quadruplex which is not affected by addition of a 50-fold excess of the complementary C-rich strand in both potassium- and sodium-containing buffers. Replacing the TTA loops of the human telomeric repeat with AAA causes a large decrease in quadruplex stability, though a sequence with AAA in the first loop and TTT in the second and third loops is slightly more stable.  相似文献   

8.
9.
In addition to the well-known Watson–Crick double helix, DNA can form other structures. One of them is a four-stranded quadruplex, formation of which was also acknowledged in in vivo conditions. It was suggested that the presence of quadruplexes in e.g. telomeric region has a significant biological importance. We have studied structural properties of the human telomeric quadruplex formed by G3(T2AG3)3 and related sequences, in which each guanine base was one-by-one replaced by adenine. In the next step, we have studied sequences, in which two, or even four guanines were replaced by adenine. These sequences were studied in the presence of sodium or potassium ions. Using CD spectroscopy, UV thermal stability measurements, and polyacrylamide gel electrophoresis we found that none of the substitutions hindered the formation of the antiparallel quadruplex formed by the unsubstituted sequence in sodium solutions. However, the effect of substitution differed depending on the position of the guanine replaced. The middle quartet of the antiparallel basket scaffold was the most sensitive and led to the least stable structures. With other sequences, the effect of substitution depends on the position and also on the syn/anti glycosidic bond orientation of the appropriate guanosine in the original quadruplex structure. In the case of the multiple A for G substitutions, the G3(T2AG3)3 quadruplex was most destabilized by the G:G:A:A tetrad, in which the adenosines substituted syn guanosines. Interestingly, unlike with G3(T2AG3)3, no structural transitions were observed with the A-containing analogs of the sequence when sodium ions were replaced by potassium ions. The basic quadruplex topology remained antiparallel for all modified sequences in both salts. As in vivo misincorporation of A for a G in the telomeric sequence is possible and potassium is a physiological salt, these findings may be biologically important. In our next studies, we have compared the effect of the G to A substitutions in the human telomere sequence with 8-oxoguanine substituted samples or samples containing guanine apurinic sites. Data obtained from our study show a noticeable trend: it is not the type of the lesion but the position of the modification determines the effect on the conformation and stability of the quadruplex.  相似文献   

10.
Biological aspects of DNA/RNA quadruplexes.   总被引:6,自引:0,他引:6  
R H Shafer  I Smirnov 《Biopolymers》2000,56(3):209-227
Among the many unusual conformations of DNA and RNA, quadruplex structures, based on the guanine quartet, possess several unique properties. These properties, along with the general features of guanine quadruplexes, are described in the context of possible roles for these structures in biological systems. A variety of experimental observations supporting the notion that quadruplexes are important in vivo is presented, including proteins known to specifically bind to quadruplex structures, guanine-rich DNA, and RNA sequences endowed with the potential for forming quartet-based structures in telomeres and regulatory regions, such as gene promoters, quadruplexes as DNA aptamer folding motifs arising from in vitro selection experiments, and potential chemotherapeutic, quadruplex-forming oligonucleotides. Taken together, all of these observations argue cogently not only for the presence of quadruplexes in biological systems but also for their significance in terms of their roles in various biological processes.  相似文献   

11.
The V3 loop of human immunodeficiency virus type 1 (HIV-1) is critical for coreceptor binding and is the main determinant of which of the cellular coreceptors, CCR5 or CXCR4, the virus uses for cell entry. The aim of this study is to provide a large-scale data driven analysis of HIV-1 coreceptor usage with respect to the V3 loop evolution and to characterize CCR5- and CXCR4-tropic viral phenotypes previously studied in small- and medium-scale settings. We use different sequence similarity measures, phylogenetic and clustering methods in order to analyze the distribution in sequence space of roughly 1000 V3 loop sequences and their tropism phenotypes. This analysis affords a means of characterizing those sequences that are misclassified by several sequence-based coreceptor prediction methods, as well as predicting the coreceptor using the location of the sequence in sequence space and of relating this location to the CD4+ T-cell count of the patient. We support previous findings that the usage of CCR5 is correlated with relatively high sequence conservation whereas CXCR4-tropic viruses spread over larger regions in sequence space. The incorrectly predicted sequences are mostly located in regions in which their phenotype represents the minority or in close vicinity of regions dominated by the opposite phenotype. Nevertheless, the location of the sequence in sequence space can be used to improve the accuracy of the prediction of the coreceptor usage. Sequences from patients with high CD4+ T-cell counts are relatively highly conserved as compared to those of immunosuppressed patients. Our study thus supports hypotheses of an association of immune system depletion with an increase in V3 loop sequence variability and with the escape of the viral sequence to distant parts of the sequence space.  相似文献   

12.
Kumar N  Maiti S 《Nucleic acids research》2008,36(17):5610-5622
Loop length and its composition are important for the structural and functional versatility of quadruplexes. To date studies on the loops have mainly concerned model sequences compared with naturally occurring quadruplex sequences which have diverse loop lengths and compositions. Herein, we have characterized 36 quadruplex-forming sequences from the promoter regions of various proto-oncogenes using CD, UV and native gel electrophoresis. We examined folding topologies and determined the thermodynamic profile for quadruplexes varying in total loop length (5–18 bases) and composition. We found that naturally occurring quadruplexes have variable thermodynamic stabilities (ΔG37) ranging from −1.7 to −15.6 kcal/mol. Overall, our results suggest that both loop length and its composition affect quadruplex structure and thermodynamics, thus making it difficult to draw generalized correlations between loop length and thermodynamic stability. Additionally, we compared the thermodynamic stability of quadruplexes and their respective duplexes to understand quadruplex–duplex competition. Our findings invoke a discussion on whether biological function is associated with quadruplexes with lower thermodynamic stability which undergo facile formation and disruption, or by quadruplexes with high thermodynamic stability.  相似文献   

13.
Angiogenesis, or neovascularization, is tightly controlled by positive and negative regulators, many of which reside in the extracellular matrix. We have now identified eight novel 19- to 20-residue peptides derived from the alpha4, alpha5, and alpha6 fibrils of type IV collagen, which we have designated tetrastatins, pentastatins, and hexastatins, respectively. We have shown that these endogenous peptides suppress the proliferation and migration of HUVECs in vitro. By performing clustering analyses of the sequences using sequence similarity criteria and of the experimental results using a hierarchical algorithm, we report that the clusters identified by the experimental results coincide with the sequence-based clusters, indicating a tight relationship between peptide sequence and anti-angiogenic potency. These peptides may have potential as anti-angiogenic therapeutic agents.  相似文献   

14.
15.
Bioinformatics approaches to quadruplex sequence location   总被引:1,自引:0,他引:1  
Guanine quadruplex structures are potentially useful therapeutic targets. There have been several studies attempting to locate genomic sequences which are capable of forming these structures. Since the number of potential quadruplex forming sequences which have been identified is so high, several different strategies have been applied to try and determine which of these sequences may be physiologically relevant and which sequences are most likely to form quadruplex structures. These are based on the limited structural information that is currently available and comparative analyses of the location of these sequences with respect to different genomic regions. Sequence information alone is not enough to identify regions of nucleic acid which participate in quadruplex structures, however it is the starting point for quadruplex structure discovery when complemented with further experimentation.  相似文献   

16.
G-Rich sequences found within biologically important regions of the genome have been shown to form intramolecular G-quadruplexes with varied loop lengths and sequences. Many of these quadruplexes will be distinguishable from each other on the basis of their thermodynamic stabilities and folded conformations. It has been proposed that loop lengths can strongly influence the topology and stability of intramolecular G-quadruplexes. Previous studies have been limited to the analysis of quadruplex sequences with particular loop sequences, making it difficult to make generalizations. Here, we describe an original study that aimed to elucidate the effect of loop length on the biophysical properties of G-quadruplexes in a sequence-independent context. We employed UV melting and circular dichroism spectroscopy to examine and compare the properties of 21 DNA quadruplex libraries, each comprising partially randomized loop sequences with lengths ranging from one to three nucleotides. Our work supports a number of general predictions that can be made solely on the basis of loop lengths. In particular, the results emphasize the strong influence of single-nucleotide loops on quadruplex properties. This study provides a predictive framework that may help identify or classify biologically relevant G-quadruplex-forming sequences.  相似文献   

17.
Mishra P  Pandey PN 《Bioinformation》2011,6(10):372-374
The number of amino acid sequences is increasing very rapidly in the protein databases like Swiss-Prot, Uniprot, PIR and others, but the structure of only some amino acid sequences are found in the Protein Data Bank. Thus, an important problem in genomics is automatically clustering homologous protein sequences when only sequence information is available. Here, we use graph theoretic techniques for clustering amino acid sequences. A similarity graph is defined and clusters in that graph correspond to connected subgraphs. Cluster analysis seeks grouping of amino acid sequences into subsets based on distance or similarity score between pairs of sequences. Our goal is to find disjoint subsets, called clusters, such that two criteria are satisfied: homogeneity: sequences in the same cluster are highly similar to each other; and separation: sequences in different clusters have low similarity to each other. We tested our method on several subsets of SCOP (Structural Classification of proteins) database, a gold standard for protein structure classification. The results show that for a given set of proteins the number of clusters we obtained is close to the superfamilies in that set; there are fewer singeltons; and the method correctly groups most remote homologs.  相似文献   

18.
Hundreds of thousands of putative quadruplex sequences have been found in the human genome. It is important to understand the rules that govern the stability of these intramolecular structures. In this report, we analysed sequence effects in a 3-base-long central loop, keeping the rest of the quadruplex unchanged. A first series of 36 different sequences were compared; they correspond to the general formula GGGTTTGGGHNHGGGTTTGGG. One clear rule emerged from the comparison of all sequence motifs: the presence of an adenine at the first position of the loop was significantly detrimental to stability. In contrast, adenines have no detrimental effect when present at the second or third position of the loop. Cytosines may either have a stabilizing or destabilizing effect depending on their position. In general, the correlation between the Tm or ΔG° in sodium and potassium was weak. To determine if these sequence effects could be generalized to different quadruplexes, specific loops were tested in different sequence contexts. Analysis of 26 extra sequences confirmed the general destabilizing effect of adenine as the first base of the loop(s). Finally, analysis of some of the sequences by microcalorimetry (DSC) confirmed the differences found between the sequence motifs.  相似文献   

19.
The specific function of RNA molecules frequently resides in their seemingly unstructured loop regions. We performed a systematic analysis of RNA loops extracted from experimentally determined three-dimensional structures of RNA molecules. A comprehensive loop-structure data set was created and organized into distinct clusters based on structural and sequence similarity. We detected clear evidence of the hallmark of homology present in the sequence–structure relationships in loops. Loops differing by <25% in sequence identity fold into very similar structures. Thus, our results support the application of homology modeling for RNA loop model building. We established a threshold that may guide the sequence divergence-based selection of template structures for RNA loop homology modeling. Of all possible sequences that are, under the assumption of isosteric relationships, theoretically compatible with actual sequences observed in RNA structures, only a small fraction is contained in the Rfam database of RNA sequences and classes implying that the actual RNA loop space may consist of a limited number of unique loop structures and conserved sequences. The loop-structure data sets are made available via an online database, RLooM. RLooM also offers functionalities for the modeling of RNA loop structures in support of RNA engineering and design efforts.  相似文献   

20.
MOTIVATION: Clustering of protein sequences is widely used for the functional characterization of proteins. However, it is still not easy to cluster distantly-related proteins, which have only regional similarity among their sequences. It is therefore necessary to develop an algorithm for clustering such distantly-related proteins. RESULTS: We have developed a time and space efficient clustering algorithm. It uses a graph representation where its vertices and edges denote proteins and their sequence similarities above a certain cutoff score, respectively. It repeatedly partitions the graph by removing edges that have small weights, which correspond to low sequence similarities. To find the appropriate partitions, we introduce a score combining the normalized cut and a locally minimal cut capacities. Our method is applied to the entire 40,703 human proteins in SWISS-PROT and TrEMBL. The resulting clusters shows a 76% recall (20,529 proteins) of the 26,917 classified by InterPro. It also finds relationships not found by other clustering methods. AVAILABILITY: The complete result of our algorithm for all the human proteins in SWISS-PROT and TrEMBL, and other supplementary information are available at http://motif.ics.es.osaka-u.ac.jp/Ncut-KL/  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号