首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
A new method which predicts internal exon sequences in human DNA has been developed. The method is based on a splice site prediction algorithm that uses the linear discriminant function to combine information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotides in protein coding and intron regions. The accuracy of our splice site recognition function is 97% for donor splice sites and 96% for acceptor splice sites. For exon prediction, we combine in a discriminant function the characteristics describing the 5'-intron region, donor splice site, coding region, acceptor splice site and 3'-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79%. The recognition quality computed at the level of individual nucleotides is 89% for exon sequences and 98% for intron sequences. This corresponds to a correlation coefficient for exon prediction of 0.87. The precision of this approach is better than other methods and has been tested on a larger data set. We have also developed a means for predicting exon-exon junctions in cDNA sequences, which can be useful for selecting optimal PCR primers.  相似文献   

2.
3.
4.
Medenbach J  Seiler M  Hentze MW 《Cell》2011,145(6):902-913
Analysis of the regulation of msl-2 mRNA by Sex lethal (SXL), which is critical for dosage compensation in Drosophila, has uncovered a mode of translational control based on common 5' untranslated region elements, upstream open reading frames (uORFs), and interaction sites for RNA-binding proteins. We show that SXL binding downstream of a short uORF imposes a strong negative effect on major reading frame translation. The underlying mechanism involves increasing initiation of scanning ribosomes at the uORF and augmenting its impediment to downstream translation. Our analyses reveal that SXL exerts its effect controlling initiation, not elongation or termination, at the uORF. Probing the generality of the underlying mechanism, we show that the regulatory module that we define experimentally functions in a heterologous context, and we identify natural Drosophila mRNAs that are regulated via this module. We propose that protein-regulated uORFs constitute a systematic principle for the regulation of protein synthesis.  相似文献   

5.
6.
7.
Identification of functional open reading frames in chloroplast genomes   总被引:7,自引:0,他引:7  
K H Wolfe  P M Sharp 《Gene》1988,66(2):215-222
We have used a rapid computer dot-matrix comparison method to identify all DNA regions which have been evolutionarily conserved between the completely sequenced chloroplast genomes of tobacco and a liverwort. Analysis of these regions reveals 74 homologous open reading frames (ORFs) which have been conserved as to length and amino acid sequence; these ORFs also have an excess of nucleotide substitutions at silent sites of codons. Since the nonfunctional parts of these genomes have become saturated with mutations and show no sequence similarity whatsoever, the homologous ORFs are almost certainly functional. A further four pairs of ORFs show homology limited to only a short part of their putative gene products. Amino acid sequence identities range between 50 and 99%; some chloroplast proteins are seen to be among the most slowly evolving of all known proteins. A search of the nucleotide and amino acid sequence databanks has revealed several previously unidentified genes in chloroplast sequences from other species, but no new homologies to prokaryotic genes.  相似文献   

8.
9.
10.
11.
Long Open Reading Frames (ORFs) in antisense DNA strands have been reported in the literature as being rare events. However, an extensive analysis of the GenBank database revealed that a substantial number of genes from several species contain an in-phase ORF in the antisense strand, that overlaps entirely the coding sequence of the sense strand, or even extends beyond. The findings described in this paper show that this is a frequent, non-random phenomenon, which is primarily dependent on codon usage, and to a lesser extent on gene size and GC content. Examination of the sequence database for several prokaryotic and eukaryotic organisms, demonstrates that coding sequences with in-phase, 100% overlapping antisense ORFs are present in every genome studied so far.  相似文献   

12.
13.
A method is presented for construction of randomized open reading frame sequences (ORFs) and gene libraries containing them. The building blocks for the ORFs were 75 bp long DNA fragments generated by cloning sequences from a single synthetic oligonucleotide preparation by bridge mutagenesis. The fragments had the property that, regardless of their orientation in the ligated product, the ORF of the construct was maintained. The heterogeneity of the ORFs resulted from the random ligation of 2000 different DNA fragments. The randomized ORFs were cloned downstream from the lac promoter in a multicopy plasmid in Escherichia coli. To test the method, a library of 10(6) clones was constructed.  相似文献   

14.
Comprehensive open reading frame (ORF) clone collections, ORFeomes, are key components of functional genomics projects. When recombinational cloning systems are used to capture ORFs in master clones, these DNA sequences can be easily transferred into a variety of expression plasmids, each designed for a specific assay. Depending on downstream applications, an ORF is cloned either with or without a stop codon at its original position, referred to as closed or open configuration, respectively. The former is preferred when the encoded protein is produced in its native form or with an amino-terminal tag; the latter is obligatory when the protein is produced as a fusion with a carboxyl-terminal tag. We developed a streamlined protocol for high-throughput, simultaneous cloning of both open and closed ORF entry clones with the Gateway recombinational cloning system. The protocol is straightforward to set up in large-scale ORF cloning projects, and is cost-effective, because the initial ORF amplification and the cloning in a pDONR vector are performed only once to obtain the two ORF configurations. We illustrated its implementation for the isolation and validation of 346 Arabidopsis ORF entry clones.  相似文献   

15.
16.
17.
We have analyzed existing methodologies and created novel methodologies for the automatic assignment of S-adenosylmethionine (AdoMet)-dependent methyltransferase functionality to genomic open reading frames based on predicted protein sequences. A large class of the AdoMet-dependent methyltransferases shares a common binding motif for the AdoMet cofactor in the form of a seven-strand twisted beta-sheet; this structural similarity is mirrored in a degenerate sequence similarity that we refer to as methyltransferase signature motifs. These motifs are the basis of our assignments. We find that simple pattern matching based on the motif sequence is of limited utility and that a new method of "sensitized matrices for scoring methyltransferases" (SM2) produced with modified versions of the MEME and MAST tools gives greatly improved results for the Saccharomyces cerevisiae yeast genome. From our analysis, we conclude that this class of methyltransferases makes up approximately 0.6-1.6% of the genes in the yeast, human, mouse, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, and Escherichia coli genomes. We provide lists of unidentified genes that we consider to have a high probability of being methyltransferases for future biochemical analyses.  相似文献   

18.
19.
20.
The DNA repair protein RecA of Mycobacterium tuberculosis contains an intein, a self-splicing protein element. We have employed this Mtu recA intein to create a selection system for successful intein splicing by inserting it into a kanamycin-resistance gene so that functional antibiotic resistance can only be restored upon protein splicing. We then proceeded to develop an ORFTRAP, i.e., a selection system for the cloning of open reading frames (ORFs). The ORFTRAP exploits the self-splicing properties of inteins (which depend on full-length in-frame translation of a precursor protein) by allowing protein splicing to occur when DNA fragments encoding ORFs are inserted into the Mtu recA intein, whereas DNA fragments containing non-ORFs are selected against. Regions of the Mtu recA intein that tolerate the insertion of additional amino acids were identified by Bgl II linker scanning mutagenesis, and a respective construct was chosen as the ORFTRAP. To test the maximum insert size that could be cloned into ORFTRAP, DNA fragments of increasing length from the Listeria monocytogenes hly gene as well as a genomic library of Haemophilus influenzae were inserted and it was found that the longest permissive inserts were 425 bp and 251 bp, respectively. The H. influenzae ORFTRAP library also demonstrated the strength (strong selection power) and weakness (insertion of very small fragments) of the system. Further modifications should make the ORFTRAP useful for protein expression, epitope mapping, and antigen screening.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号