首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 21 毫秒
1.
New methods are described for finding recurrent three-dimensional (3D) motifs in RNA atomic-resolution structures. Recurrent RNA 3D motifs are sets of RNA nucleotides with similar spatial arrangements. They can be local or composite. Local motifs comprise nucleotides that occur in the same hairpin or internal loop. Composite motifs comprise nucleotides belonging to three or more different RNA strand segments or molecules. We use a base-centered approach to construct efficient, yet exhaustive search procedures using geometric, symbolic, or mixed representations of RNA structure that we implement in a suite of MATLAB programs, “Find RNA 3D” (FR3D). The first modules of FR3D preprocess structure files to classify base-pair and -stacking interactions. Each base is represented geometrically by the position of its glycosidic nitrogen in 3D space and by the rotation matrix that describes its orientation with respect to a common frame. Base-pairing and base-stacking interactions are calculated from the base geometries and are represented symbolically according to the Leontis/Westhof basepairing classification, extended to include base-stacking. These data are stored and used to organize motif searches. For geometric searches, the user supplies the 3D structure of a query motif which FR3D uses to find and score geometrically similar candidate motifs, without regard to the sequential position of their nucleotides in the RNA chain or the identity of their bases. To score and rank candidate motifs, FR3D calculates a geometric discrepancy by rigidly rotating candidates to align optimally with the query motif and then comparing the relative orientations of the corresponding bases in the query and candidate motifs. Given the growing size of the RNA structure database, it is impossible to explicitly compute the discrepancy for all conceivable candidate motifs, even for motifs with less than ten nucleotides. The screening algorithm that we describe finds all candidate motifs whose geometric discrepancy with respect to the query motif falls below a user-specified cutoff discrepancy. This technique can be applied to RMSD searches. Candidate motifs identified geometrically may be further screened symbolically to identify those that contain particular basepair types or base-stacking arrangements or that conform to sequence continuity or nucleotide identity constraints. Purely symbolic searches for motifs containing user-defined sequence, continuity and interaction constraints have also been implemented. We demonstrate that FR3D finds all occurrences, both local and composite and with nucleotide substitutions, of sarcin/ricin and kink-turn motifs in the 23S and 5S ribosomal RNA 3D structures of the H. marismortui 50S ribosomal subunit and assigns the lowest discrepancy scores to bona fide examples of these motifs. The search algorithms have been optimized for speed to allow users to search the non-redundant RNA 3D structure database on a personal computer in a matter of minutes.  相似文献   

2.
3.
4.
5.
In humans and mice, the Cys2His2 zinc finger protein PRDM9 binds to a DNA sequence motif enriched in hotspots of recombination, possibly modifying nucleosomes, and recruiting recombination machinery to initiate Double Strand Breaks (DSBs). However, since its discovery, some researchers have suggested that the recombinational effect of PRDM9 is lineage or species specific. To test for a conserved role of PRDM9-like proteins across taxa, we use the Drosophila pseudoobscura species group in an attempt to identify recombination associated zinc finger proteins and motifs. We leveraged the conserved amino acid motifs in Cys2His2 zinc fingers to predict nucleotide binding motifs for all Cys2His2 zinc finger proteins in Drosophila pseudoobscura and identified associations with empirical measures of recombination rate. Additionally, we utilized recombination maps from D. pseudoobscura and D. miranda to explore whether changes in the binding motifs between species can account for changes in the recombination landscape, analogous to the effect observed in PRDM9 among human populations. We identified a handful of potential recombination-associated sequence motifs, but the associations are generally tenuous and their biological relevance remains uncertain. Furthermore, we found no evidence that changes in zinc finger DNA binding explains variation in recombination rate between species. We therefore conclude that there is no protein with a DNA sequence specific human-PRDM9-like function in Drosophila. We suggest these findings could be explained by the existence of a different recombination initiation system in Drosophila.  相似文献   

6.
With the aim of improving aqueous solubility, we designed and synthesized five N-methylpyrrole (Py)–N-methylimidazole (Im) polyamides capable of recognizing 9-bp sequences. Their DNA-binding affinities and sequence specificities were evaluated by SPR and Bind-n-Seq analyses. The design of polyamide 1 was based on a conventional model, with three consecutive Py or Im rings separated by a β-alanine to match the curvature and twist of long DNA helices. Polyamides 2 and 3 contained an 8-amino-3,6-dioxaoctanoic acid (AO) unit, which has previously only been used as a linker within linear Py–Im polyamides or between Py–Im hairpin motifs for tandem hairpin. It is demonstrated herein that AO also functions as a linker element that can extend to 2-bp in hairpin motifs. Notably, although the AO-containing unit can fail to bind the expected sequence, polyamide 4, which has two AO units facing each other in a hairpin form, successfully showed the expected motif and a KD value of 16 nM was recorded. Polyamide 5, containing a β-alanine–β-alanine unit instead of the AO of polyamide 2, was synthesized for comparison. The aqueous solubilities and nuclear localization of three of the polyamides were also examined. The results suggest the possibility of applying the AO unit in the core of Py–Im polyamide compounds.  相似文献   

7.
The conjugative transfer of bacterial F plasmids relies on TraM, a plasmid-encoded protein that recognizes multiple DNA sites to recruit the plasmid to the conjugative pore. In spite of the high degree of amino acid sequence conservation between TraM proteins, many of these proteins have markedly different DNA binding specificities that ensure the selective recruitment of a plasmid to its cognate pore. Here we present the structure of F TraM RHH (ribbon–helix–helix) domain bound to its sbmA site. The structure indicates that a pair of TraM tetramers cooperatively binds an underwound sbmA site containing 12 base pairs per turn. The sbmA is composed of 4 copies of a 5-base-pair motif, each of which is recognized by an RHH domain. The structure reveals that a single conservative amino acid difference in the RHH β-ribbon between F and pED208 TraM changes its specificity for its cognate 5-base-pair sequence motif. Specificity is also dictated by the positioning of 2-base-pair spacer elements within sbmA; in F sbmA, the spacers are positioned between motifs 1 and 2 and between motifs 3 and 4, whereas in pED208 sbmA, there is a single spacer between motifs 2 and 3. We also demonstrate that a pair of F TraM tetramers can cooperatively bind its sbmC site with an affinity similar to that of sbmA in spite of a lack of sequence similarity between these DNA elements. These results provide a basis for the prediction of the DNA binding properties of the family of TraM proteins.  相似文献   

8.
Type IV secretion system (T4SS) substrates are recruited through a translocation signal that is poorly defined for conjugative relaxases. The relaxase TrwC of plasmid R388 is translocated by its cognate conjugative T4SS, and it can also be translocated by the VirB/D4 T4SS of Bartonella henselae, causing DNA transfer to human cells. In this work, we constructed a series of TrwC variants and assayed them for DNA transfer to bacteria and human cells to compare recruitment requirements by both T4SSs. Comparison with other reported relaxase translocation signals allowed us to determine two putative translocation sequence (TS) motifs, TS1 and TS2. Mutations affecting TS1 drastically affected conjugation frequencies, while mutations affecting either motif had only a mild effect on DNA transfer rates through the VirB/D4 T4SS of B. henselae. These results indicate that a single substrate can be recruited by two different T4SSs through different signals. The C terminus affected DNA transfer rates through both T4SSs tested, but no specific sequence requirement was detected. The addition of a Bartonella intracellular delivery (BID) domain, the translocation signal for the Bartonella VirB/D4 T4SS, increased DNA transfer up to 4% of infected human cells, providing an excellent tool for DNA delivery to specific cell types. We show that the R388 coupling protein TrwB is also required for this high-efficiency TrwC-BID translocation. Other elements apart from the coupling protein may also be involved in substrate recognition by T4SSs.  相似文献   

9.

Background

Mimivirus isolated from A. polyphaga is the largest virus discovered so far. It is unique among all the viruses in having genes related to translation, DNA repair and replication which bear close homology to eukaryotic genes. Nevertheless, only a small fraction of the proteins (33%) encoded in this genome has been assigned a function. Furthermore, a large fraction of the unassigned protein sequences bear no sequence similarity to proteins from other genomes. These sequences are referred to as ORFans. Because of their lack of sequence similarity to other proteins, they can not be assigned putative functions using standard sequence comparison methods. As part of our genome-wide computational efforts aimed at characterizing Mimivirus ORFans, we have applied fold-recognition methods to predict the structure of these ORFans and further functions were derived based on conservation of functionally important residues in sequence-template alignments.

Results

Using fold recognition, we have identified highly confident computational 3D structural assignments for 21 Mimivirus ORFans. In addition, highly confident functional predictions for 6 of these ORFans were derived by analyzing the conservation of functional motifs between the predicted structures and proteins of known function. This analysis allowed us to classify these 6 previously unannotated ORFans into their specific protein families: carboxylesterase/thioesterase, metal-dependent deacetylase, P-loop kinases, 3-methyladenine DNA glycosylase, BTB domain and eukaryotic translation initiation factor eIF4E.

Conclusion

Using stringent fold recognition criteria we have assigned three-dimensional structures for 21 of the ORFans encoded in the Mimivirus genome. Further, based on the 3D models and an analysis of the conservation of functionally important residues and motifs, we were able to derive functional attributes for 6 of the ORFans. Our computational identification of important functional sites in these ORFans can be the basis for a subsequent experimental verification of our predictions. Further computational and experimental studies are required to elucidate the 3D structures and functions of the remaining Mimivirus ORFans.  相似文献   

10.
11.
Structural 3D motifs in RNA play an important role in the RNA stability and function. Previous studies have focused on the characterization and discovery of 3D motifs in RNA secondary and tertiary structures. However, statistical analyses of the distribution of 3D motifs along the RNA appear to be lacking. Herein, we present a novel strategy for evaluating the distribution of 3D motifs along the RNA chain and those motifs whose distributions are significantly non-random are identified. By applying it to the X-ray structure of the large ribosomal subunit from Haloarcula marismortui, helical motifs were found to cluster together along the chain and in the 3D structure, whereas the known tetraloops tend to be sequentially and spatially dispersed. That the distribution of key structural motifs such as tetraloops differ significantly from a random one suggests that our method could also be used to detect novel 3D motifs of any size in sufficiently long/large RNA structures. The motif distribution type can help in the prediction and design of 3D structures of large RNA molecules.  相似文献   

12.
G-quadruplex DNA is a four-stranded DNA structure formed by non-Watson-Crick base pairing between stacked sets of four guanines. Many possible functions have been proposed for this structure, but its in vivo role in the cell is still largely unresolved. We carried out a genome-wide survey of the evolutionary conservation of regions with the potential to form G-quadruplex DNA structures (G4 DNA motifs) across seven yeast species. We found that G4 DNA motifs were significantly more conserved than expected by chance, and the nucleotide-level conservation patterns suggested that the motif conservation was the result of the formation of G4 DNA structures. We characterized the association of conserved and non-conserved G4 DNA motifs in Saccharomyces cerevisiae with more than 40 known genome features and gene classes. Our comprehensive, integrated evolutionary and functional analysis confirmed the previously observed associations of G4 DNA motifs with promoter regions and the rDNA, and it identified several previously unrecognized associations of G4 DNA motifs with genomic features, such as mitotic and meiotic double-strand break sites (DSBs). Conserved G4 DNA motifs maintained strong associations with promoters and the rDNA, but not with DSBs. We also performed the first analysis of G4 DNA motifs in the mitochondria, and surprisingly found a tenfold higher concentration of the motifs in the AT-rich yeast mitochondrial DNA than in nuclear DNA. The evolutionary conservation of the G4 DNA motif and its association with specific genome features supports the hypothesis that G4 DNA has in vivo functions that are under evolutionary constraint.  相似文献   

13.
The genes encoding the ApaLI (5′-G^TGCAC-3′), NspI (5′-RCATG^Y-3′), NspHI (5′-RCATG^Y-3′), SacI (5′-GAGCT^C-3′), SapI (5′-GCTCTTCN1^-3′, 5′-^N4GAAGAGC-3′) and ScaI (5′-AGT^ACT-3′) restriction-modification systems have been cloned in E.?coli. Amino acid sequence comparison of M.ApaLI, M.NspI, M.NspHI, and M.SacI with known methylases indicated that they contain the ten conserved motifs characteristic of C5 cytosine methylases. NspI and NspHI restriction-modification systems are highly homologous in amino acid sequence. The C-termini of the NspI and NlaIII (5′-CATG-3′) restriction endonucleases share significant similarity. 5mC modification of the internal C in a SacI site renders it resistant to SacI digestion. External 5mC modification of a SacI site has no effect on SacI digestion. N4mC modification of the second base in the sequence 5′-GCTCTTC-3′ blocks SapI digestion. N4mC modification of the other cytosines in the SapI site does not affect SapI digestion. N4mC modification of ScaI site blocks ScaI digetion. A DNA invertase homolog was found adjacent to the ApaLI restriction-modification system. A DNA transposase subunit homolog was found upstream of the SapI restriction endonuclease gene.  相似文献   

14.
Stable RNAs are modular and hierarchical 3D architectures taking advantage of recurrent structural motifs to form extensive non-covalent tertiary interactions. Sequence and atomic structure analysis has revealed a novel submotif involving a minimal set of five nucleotides, termed the UA_handle motif (5′XU/ANnX3′). It consists of a U:A Watson–Crick: Hoogsteen trans base pair stacked over a classic Watson–Crick base pair, and a bulge of one or more nucleotides that can act as a handle for making different types of long-range interactions. This motif is one of the most versatile building blocks identified in stable RNAs. It enters into the composition of numerous recurrent motifs of greater structural complexity such as the T-loop, the 11-nt receptor, the UAA/GAN and the G-ribo motifs. Several structural principles pertaining to RNA motifs are derived from our analysis. A limited set of basic submotifs can account for the formation of most structural motifs uncovered in ribosomal and stable RNAs. Structural motifs can act as structural scaffoldings and be functionally and topologically equivalent despite sequence and structural differences. The sequence network resulting from the structural relationships shared by these RNA motifs can be used as a proto-language for assisting prediction and rational design of RNA tertiary structures.  相似文献   

15.
Vitamin D receptor (VDR), a nuclear receptor for 1α,25-dihydroxyvitamin D3 (1α,25(OH)2D3, 1), is a promising target for multiple clinical applications. We recently developed non-secosteroidal VDR ligands based on a carbon-containing boron cluster, 1,12-dicarba-closo-dodecaborane (p-carborane), and examined the binding of one of them to VDR by means of crystallographic analysis. Here, we utilized that X-ray structure to design novel p-carborane-based tetraol-type vitamin D analogs, and we examined the biological activities of the synthesized compounds. Structure–activity relationship study revealed that introduction of an ω-hydroxyalkoxy functionality enhanced the biological activity, and the configuration of the substituent significantly influenced the potency. Among the synthesized compounds, 4-hydroxybutoxy derivative 9a exhibited the most potent activity, which was equal to that of the secosteroidal vitamin D analog, 19-nor-1α,25-dihydroxyvitamin D3 (2).  相似文献   

16.
17.
DNA has proved to be a successful material for creation of nanoscale structures because of its inherent programmability and predictable structural features. However, the assembly of periodic three-dimensional (3D) DNA crystals is hampered by the junctions needed to connect the inherently linear Watson–Crick duplexes. Here, we examine how predictable noncanonical base pairing motifs can be used in conjunction with Watson–Crick duplexes to assemble macroscopic 3D crystals with useful nanoscale features. Parallel-stranded homopurine 5′-GGA base pairs serve as a junction region in a continuously base paired 13-mer DNA crystal (Paukstelis et al., 2004). This motif is predictable and has been used in different sequence contexts to rationally design DNA crystals with different lattice dimensions. These designed crystals have been utilized as macromolecular sieves for capturing or excluding proteins (Paukstelis, 2006). Further, we have demonstrated that a protein enzyme encapsulated in the crystal solvent channels is capable of performing catalysis. Enzyme-infused DNA crystals are capable of multiple cycles of catalysis following removal of substrate and products, and may offer potential new routes for enzyme replacement therapies or the creation of new biodegradable solid-state catalysts and sensors. A structurally similar homoparallel region, 5′-CGAA, has also been used to generate crystals that are capable of making concerted in crystallo structural transitions in response to pH perturbations (Muser & Paukstelis, 2012). These studies highlight potential uses of DNA crystals as stimuli-responsive biomaterials. Despite these successes, the ability to use noncanonical DNA motifs in crystal design is limited by both the number of available noncanonical DNA structures, and our understanding of how these structures self-assemble. To address this we have initiated a high-throughput crystallization screen of short DNA oligonucleotides to identify new noncanonical base pairing motifs and to address the broad question: How structurally diverse is DNA?  相似文献   

18.
An extremely thermostable restriction endonuclease, PspGI, was purified from Pyrococcus sp. strain GI-H. PspGI is an isoschizomer of EcoRII and cleaves DNA before the first C in the sequence 5′ ^CCWGG 3′ (W is A or T). PspGI digestion can be carried out at 65 to 85°C. To express PspGI at high levels, the PspGI restriction-modification genes (pspGIR and pspGIM) were cloned in Escherichia coli. M.PspGI contains the conserved sequence motifs of α-aminomethyltransferases; therefore, it must be an N4-cytosine methylase. M.PspGI shows 53% similarity to (44% identity with) its isoschizomer, M.MvaI from Micrococcus variabilis. In a segment of 87 amino acid residues, PspGI shows significant sequence similarity to EcoRII and to regions of SsoII and StyD4I which have a closely related recognition sequence (5′ ^CCNGG 3′). PspGI was expressed in E. coli via a T7 expression system. Recombinant PspGI was purified to near homogeneity and had a half-life of 2 h at 95°C. PspGI remained active following 30 cycles of thermocycling; thus, it can be used in DNA-based diagnostic applications.  相似文献   

19.

Background  

For many metalloproteins, sequence motifs characteristic of metal-binding sites have not been found or are so short that they would not be expected to be metal-specific. Striking examples of such metalloproteins are those containing Mg2+, one of the most versatile metal cofactors in cellular biochemistry. Even when Mg2+-proteins share insufficient sequence homology to identify Mg2+-specific sequence motifs, they may still share similarity in the Mg2+-binding site structure. However, no structural motifs characteristic of Mg2+-binding sites have been reported. Thus, our aims are (i) to develop a general method for discovering structural patterns/motifs characteristic of ligand-binding sites, given the 3D protein structures, and (ii) to apply it to Mg2+-proteins sharing <30% sequence identity. Our motif discovery method employs structural alphabet encoding to convert 3D structures to the corresponding 1D structural letter sequences, where the Mg2+-structural motifs are identified as recurring structural patterns.  相似文献   

20.
Three related polyoma virus species, designated D92 (92% the size of full-length polyoma virus DNA), D91 (91%) and D76 (76%) have been analysed and their structures compared with that of polyoma virus A2 DNA. Three independent methods (restriction endonuclease cleavage, depurination fingerprinting and DNA-DNA hybridization) were used in the analysis.The defective DNAs appear to be: (1) entirely composed of viral sequences (no host DNA sequences were detected): (2) made up in part of long continuous sequences of DNA which appear identical to sequences of A2 DNA (D92 contains continuous sequences from 1 to 72 map units on the physical map of A2 DNA; that is, it contains the entire late region and part of the early region of the viral DNA. D91 and D76 contain those same sequences except for a 1% deletion around 18 map units): (3) made up in part of rearranged viral sequences.Several interesting features were noted about the rearranged sequences present in the defective DNAs. Sequences from the region around 67 map units were found linked to other (non-contiguous) regions of the DNA. Sequences from about 72 map units were linked to sequences from about 1 map unit. Multiple copies of sequences from 67 to 72 map units (from around the origin of DNA replication) were found (4 copies in D91 and D92, and 2 copies in D76).  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号