首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
近期,从非编码RNA中发现具有肽编码能力的小开放阅读框(sORFs),激发了人们对这种长期被忽略的基因组元件的研究兴趣,sORFs迅速成为当前重点研究领域.由于表达水平及丰度低、序列短等因素,对肽编码sORFs的有效研究方法及数据资源还很缺乏,现有研究仅集中在少数真核模式生物,对自然界中广泛存在的原核生物研究非常少,肽编码sORFs的发现为目前精准背景下的基因组注释提出严峻挑战.在此背景下,本文首先系统研究了80余种不同类型原核生物中长度小于100个氨基酸的肽编码sORFs分布及功能特征,并对不同长度区间sORFs的序列组成、分布及进化特征进行了对比分析.结果表明,肽编码sORFs在原核生物基因组普遍存在,随着序列长度的降低,其序列复杂度降低,行使的生物功能也相对集中.在此基础上,进一步结合当前肽编码sORFs研究现状,深入总结了肽编码sORFs研究存在的问题及挑战,为今后肽编码sORFs研究奠定了坚实理论基础.  相似文献   

3.
Molecular characterization of cytopathogenic (cp) bovine viral diarrhea virus (BVDV) strain CP Rit, a temperature-sensitive strain widely used for vaccination, revealed that the viral genomic RNA is about 15.2 kb long, which is about 2.9 kb longer than the one of noncytopathogenic (noncp) BVDV strains. Molecular cloning and nucleotide sequencing of parts of the genome resulted in the identification of a duplication of the genomic region encoding nonstructural proteins NS3, NS4A, and part of NS4B. In addition, a nonviral sequence was found directly upstream of the second copy of the NS3 gene. The 3′ part of this inserted sequence encodes an N-terminally truncated ubiquitin monomer. This is remarkable since all described cp BVDV strains with ubiquitin coding sequences contain at least one complete ubiquitin monomer. The 5′ region of the nonviral sequence did not show any homology to cellular sequences identified thus far in cp BVDV strains. Databank searches revealed that this second cellular insertion encodes part of ribosomal protein S27a. Further analyses included molecular cloning and nucleotide sequencing of the cellular recombination partner. Sequence comparisons strongly suggest that the S27a and the ubiquitin coding sequences found in the genome of CP Rit were both derived from a bovine mRNA encoding a hybrid protein with the structure NH2-ubiquitin-S27a-COOH. Polyprotein processing in the genomic region encoding the N-terminal part of NS4B, the two cellular insertions, and NS3 was studied by a transient-expression assay. The respective analyses showed that the S27a-derived polypeptide, together with the truncated ubiquitin, served as processing signal to yield NS3, whereas the truncated ubiquitin alone was not capable of mediating the cleavage. Since the expression of NS3 is strictly correlated with the cp phenotype of BVDV, the altered genome organization leading to expression of NS3 most probably represents the genetic basis of cytopathogenicity of CP Rit.  相似文献   

4.
5.
The gene encoding human hemojuvelin (HJV) is one of the genes that, when mutated, can cause juvenile hemochromatosis, an early-onset inherited disorder associated with iron overload. The 5′ untranslated region of the human HJV mRNA has two upstream open reading frames (uORFs), with 28 and 19 codons formed by two upstream AUGs (uAUGs) sharing the same in-frame stop codon. Here we show that these uORFs decrease the translational efficiency of the downstream main ORF in HeLa and HepG2 cells. Indeed, ribosomal access to the main AUG is conditioned by the strong uAUG context, which results in the first uORF being translated most frequently. The reach of the main ORF is then achieved by ribosomes that resume scanning after uORF translation. Furthermore, the amino acid sequences of the uORF-encoded peptides also reinforce the translational repression of the main ORF. Interestingly, when iron levels increase, translational repression is relieved specifically in hepatic cells. The upregulation of protein levels occurs along with phosphorylation of the eukaryotic initiation factor 2α. Nevertheless, our results support a model in which the increasing recognition of the main AUG is mediated by a tissue-specific factor that promotes uORF bypass. These results support a tight HJV translational regulation involved in iron homeostasis.  相似文献   

6.
Anabaena sp. strain PCC 7120, widely studied, has 145 annotated transposase genes that are part of transposable elements called insertion sequences (ISs). To determine the entirety of the ISs, we aligned transposase genes and their flanking regions; identified the ISs'' possible terminal inverted repeats, usually flanked by direct repeats; and compared IS-interrupted sequences with homologous sequences. We thereby determined both ends of 87 ISs bearing 110 transposase genes in eight IS families (http://www-is.biotoul.fr/) and in a cluster of unclassified ISs, and of hitherto unknown miniature inverted-repeat transposable elements. Open reading frames were then identified to which ISs contributed and others—some encoding proteins of predictable function, including protein kinases, and restriction endonucleases—that were interrupted by ISs. Anabaena sp. ISs were often more closely related to exogenous than to other endogenous ISs, suggesting that numerous variant ISs were not degraded within PCC 7120 but transferred from without. This observation leads to the expectation that further sequencing projects will extend this and similar analyses. We also propose an adaptive role for poly(A) sequences in ISs.Insertion sequences (ISs) are transposable elements found in prokaryotic and eukaryotic genomes (17). A fully functional bacterial IS comprises one or more transposase genes, ends that are often inverted repeats (IRs), and, between the transposase genes and the ends, sequences termed linkers (32). Diverse bacterial ISs have been classified, and a searchable database of ISs has been constructed (ISfinder [http://www-is.biotoul.fr/]) (28). Miniature inverted-repeat transposable elements (MITEs) and even smaller mobile elements lack their own transposases and are also found in Anabaena spp. (11, 12, 33).Anabaena sp. strain PCC 7120 (also known as Nostoc sp. [25], here denoted Anabaena sp.) is widely used to study the patterned differentiation of dinitrogen-fixing cells called heterocysts. Transposition of ISs in Anabaena sp. has been documented (1, 7-9). We earlier reported, with few details, three genes that are intercepted by ISs in Anabaena sp. (23). We here describe the approach more extensively, organize the ISs of Anabaena sp., and present our efforts to identify Anabaena sp. open reading frames (ORFs) interrupted or contributed to by ISs.  相似文献   

7.
Identification of Open Reading Frames in Schizosaccharomyces pombe cDNAs   总被引:1,自引:0,他引:1  
A total of 214 non-overlapping cDNA clones from Schizosaccharomycespombe were selected and completely sequenced. The clones notpreviously reported were divided into the following three groups:1) homologous to Saccharomyces cerevisiae genes (139 clones);2) homologous to genes from other organisms but not to thosefrom Sac. cerevisiae (4 clones); and 3) no similar sequences(40 clones). Among the 31 sequences identical to those in thepublic databases, 4 genes have regions corresponding to introns.Protein sequences which had homologs both in budding yeast andmammals were compared with those from Sac. cerevisiae and mammals.The search revealed that the evolutionary distances among thesespecies are similar at least with genes of this category.  相似文献   

8.
9.
An analytical model based on the statistical properties of Open Reading Frames (ORFs) of eubacterial genomes such as codon composition and sequence length of all reading frames was developed. This new model predicts the average length, maximum length as well as the length distribution of the ORFs of 70 species with GC contents varying between 21% and 74%. Furthermore, the number of annotated genes is predicted with high accordance. However, the ORF length distribution in the five alternative reading frames shows interesting deviations from the predicted distribution. In particular, long ORFs appear more often than expected statistically. The unexpected depletion of stop codons in these alternative open reading frames cannot completely be explained by a biased codon usage in the +1 frame. While it is unknown if the stop codon depletion has a biological function, it could be due to a protein coding capacity of alternative ORFs exerting a selection pressure which prevents the fixation of stop codon mutations. The comparison of the analytical model with bacterial genomes, therefore, leads to a hypothesis suggesting novel gene candidates which can now be investigated in subsequent wet lab experiments.  相似文献   

10.
11.
KaKs_Calculator 3.0 is an updated toolkit that is capable of calculating selective pressure on both coding and non-coding sequences. Similar to the nonsynonymous/synonymous substitution rate ratio for coding sequences, selection on non-coding sequences can be quantified as the ratio of non-coding nucleotide substitution rate to synonymous substitution rate of adjacent coding sequences. As testified on empirical data, KaKs_Calculator 3.0 shows effectiveness to detect the strength and mode of sele...  相似文献   

12.
IS1 is one of the smallest transposable elements found in bacteria (768 bp). It contains eight overlapping open-reading-frames (ORFs) greater than 50 codons, designated insA to insG and insB'. To determine which of the ORFs actually code for proteins involved in transposition, we have introduced amber codons into each ORF by site-directed mutagenesis which make neutral changes in the overlapping ORFs. Each mutant IS1 was then tested for its ability to mediate cointegrate formation in Su+ and Su- backgrounds. The mutant elements were also tested for trans-complementation in an IS1-free Salmonella background. Our results show that the products of the insA and insB genes are the only ones essential for cointegrate formation. We suggest that other ORFs may, however, encode accessory proteins.  相似文献   

13.
Phosphoethanolamine N-methyltransferase (PEAMT) is involved in choline biosynthesis in plants. The 5′ untranslated region (UTR) of several PEAMT genes was found to contain an upstream open reading frame (uORF). We generated transgenic Arabidopsis calli that expressed a chimeric gene constructed by fusing the 5′ UTR of the Arabidopsis PEAMT gene (AtNMT1) upstream of the β-glucuronidase gene. The AtNMT1 uORF was found to be involved in declining levels of the chimeric gene mRNA and repression of downstream β-glucuronidase gene translation in the calli when the cells were treated with choline. Further, we discuss the role of the uORF.  相似文献   

14.
  相似文献   

15.
The Kaposi''s sarcoma-associated herpesvirus (KSHV) ORF36 protein kinase is translated as a downstream gene from the ORF35-37 polycistronic mRNA via a unique mechanism involving short upstream open reading frames (uORFs) located in the 5′ untranslated region. Here, we confirm that ORF35-37 is functionally dicistronic during infection and demonstrate that mutation of the dominant uORF restricts KSHV replication. Leaky scanning past the uORFs facilitates ORF35 expression, while a reinitiation mechanism after translation of the uORFs enables ORF36 expression.  相似文献   

16.
17.
18.
19.
A fully mature mRNA is usually associated to a reference open reading frame encoding a single protein. Yet, mature mRNAs contain unconventional alternative open reading frames (AltORFs) located in untranslated regions (UTRs) or overlapping the reference ORFs (RefORFs) in non-canonical +2 and +3 reading frames. Although recent ribosome profiling and footprinting approaches have suggested the significant use of unconventional translation initiation sites in mammals, direct evidence of large-scale alternative protein expression at the proteome level is still lacking. To determine the contribution of alternative proteins to the human proteome, we generated a database of predicted human AltORFs revealing a new proteome mainly composed of small proteins with a median length of 57 amino acids, compared to 344 amino acids for the reference proteome. We experimentally detected a total of 1,259 alternative proteins by mass spectrometry analyses of human cell lines, tissues and fluids. In plasma and serum, alternative proteins represent up to 55% of the proteome and may be a potential unsuspected new source for biomarkers. We observed constitutive co-expression of RefORFs and AltORFs from endogenous genes and from transfected cDNAs, including tumor suppressor p53, and provide evidence that out-of-frame clones representing AltORFs are mistakenly rejected as false positive in cDNAs screening assays. Functional importance of alternative proteins is strongly supported by significant evolutionary conservation in vertebrates, invertebrates, and yeast. Our results imply that coding of multiple proteins in a single gene by the use of AltORFs may be a common feature in eukaryotes, and confirm that translation of unconventional ORFs generates an as yet unexplored proteome.  相似文献   

20.
The mitochondrial genome of 23 wild-type strains belonging to three different species of the filamentous fungus Podospora was examined. Among the 15 optional sequences identified are two intronic reading frames, nad1-i4-orf1 and cox1-i7-orf2. We show that the presence of these sequences was strictly correlated with tightly clustered nucleotide substitutions in the adjacent exon. This correlation applies to the presence or absence of closely related open reading frames (ORFs), found at the same genetic locations, in all the Pyrenomycete genera examined. The recent gain of these optional ORFs in the evolution of the genus Podospora probably account for such sequence differences. In the homoplasmic progeny from heteroplasmons constructed between Podospora strains differing by the presence of these optional ORFs, nad1-i4-orf1 and cox1-i7-orf2 appeared highly invasive. Sequence comparisons in the nad1-i4 intron of various strains of the Pyrenomycete family led us to propose a scenario of its evolution that includes several events of loss and gain of intronic ORFs. These results strongly reinforce the idea that group I intronic ORFs are mobile elements and that their transfer, and comcomitant modification of the adjacent exon, could participate in the modular evolution of mitochondrial genomes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号