首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The DNA sequences of the entire coding regions of the A and C type variable surface protein genes from Paramecium tetraurelia, stock 51 have been determined. The 8151 nucleotide open reading frame of the A gene contains several tandem repeats of 210 nucleotides within the central portion of the molecule as well as a periodic structure defined by cysteine residues. The 6699 nucleotide open reading frame of the C gene does not contain any identifiable tandem repeats or internal similarity but maintains a periodicity based on the cysteine residue spacing. The deduced amino acid sequences encoded by the two genes are most similar within the 600 amino-terminal and 600 carboxyl-terminal amino acid residues, the central portions show only limited sequence similarity. We conclude that internal repeats are not a conserved feature of variable surface proteins in Paramecium and discuss the possible importance of the regular pattern of cysteine residues.  相似文献   

2.
We introduce a novel approach for the detection of possible mutations leading to a reading frame (RF) shift in a gene. Deletions and insertions of DNA coding regions are considerable events for genes because an RF shift results in modifications of the extensive region of amino acid sequence coded by a gene. The suggested method is based on the phenomenon of triplet periodicity (TP) in coding regions of genes and its relative resistance to substitutions in DNA sequence. We attempted to extend 326 933 regions of continuous TP found in genes from the KEGG databank by considering possible insertions and deletions. We revealed totally 824 genes where such extension was possible and statistically significant. Then we generated amino acid sequences according to active (KEGG''s) and hypothetically ancient RFs in order to find confirmation of a shift at a protein level. Consequently, 64 sequences have protein similarities only for ancient RF, 176 only for active RF, 3 for both and 581 have no protein similarity at all. We aimed to have revealed lower bound for the number of genes in which a shift between RF and TP is possible. Further ways to increase the number of revealed RF shifts are discussed.  相似文献   

3.
Normalized nucleotide and amino acid contents of complete genome sequences can be visualized as radar charts. The shapes of these charts depict the characteristics of an organism’s genome. The normalized values calculated from the genome sequence theoretically exclude experimental errors. Further, because normalization is independent of both target size and kind, this procedure is applicable not only to single genes but also to whole genomes, which consist of a huge number of different genes. In this review, we discuss the applications of the normalization of the nucleotide and predicted amino acid contents of complete genomes to the investigation of genome structure and to evolutionary research from primitive organisms to Homo sapiens. Some of the results could never have been obtained from the analysis of individual nucleotide or amino acid sequences but were revealed only after the normalization of nucleotide and amino acid contents was applied to genome research. The discovery that genome structure was homogeneous was obtained only after normalization methods were applied to the nucleotide or predicted amino acid contents of genome sequences. Normalization procedures are also applicable to evolutionary research. Thus, normalization of the contents of whole genomes is a useful procedure that can help to characterize organisms.  相似文献   

4.
对我国西藏小反刍兽疫病毒野生株China/Tib/Gej/07-30进行基质蛋白(M)和融合蛋白(F)基因序列测定,并进行分子生物学特征分析。首先应用逆转录聚合酶链式反应扩增出M和F基因片段,对聚合酶链式反应产物进行直接测序,然后对测定的核苷酸和推测的氨基酸序列进行比较分析。China/Tib/Gej/07-30的M基因由1483个核苷酸组成,编码335个氨基酸,与其他分离株核苷酸和氨基酸序列同源性分别为92.4%~97.7%和97.0%~98.2%。F基因由2411个核苷酸组成,编码546个氨基酸,与其他分离株核苷酸和氨基酸序列同源性分别为85.5%~96.1%和94.3%~98.2%。China/Tib/Gej/07-30的F蛋白含有信号肽序列和跨膜结构域,序列高度变异。F蛋白第104~108位和第109~133位氨基酸位点分别是高度保守的裂解位点和融合肽结构域。F蛋白还含有序列高度保守的三个七肽重复区。China/Tib/Gej/07-30的M基因3′端的非编码区(UTR)长度为443个核苷酸,GC含量高达68.4%,与其他PPRV毒株的同源性为82.4%~93.5%。China/Tib/Gej/07-30的F基因5′UTR区长度为634个核苷酸,GC含量高达70.0%,与其他PPRV毒株序列相似性为76.2%~91.7%。  相似文献   

5.
The cytochrome oxidase subunit II gene has been localized in the mitochondrial genome of Oenothera berteriana and the nucleotide sequence has been determined. The coding sequence contains 777 bp and, unlike the corresponding gene in Zea mays, is not interrupted by an intron. No TGA codon is found within the open reading frame. The codon CGG, as in the maize gene, is used in place of tryptophan codons of corresponding genes in other organisms. At position 742 in the Oenothera sequence the TGG of maize is changed into a CGG codon, where Trp is conserved as the amino acid in other organisms. Homologous sequences occur more than once in the mitochondrial genome as several mitochondrial DNA species hybridize with DNA probes of the cytochrome oxidase subunit II gene.  相似文献   

6.
We conducted classification for 472,288 regions of triplet periodicity found in 578,868 genes from release 29 of KEGG databank. A new concept of triplet periodicity class and a measure of similarity between them are introduced. Totally 2520 classes were created that contain 94% of found triplet periodicity. For 92% of triplet periodicity regions contained in classes an identical linkage of triplet periodicity to reading frame is observed. For the rest triplet periodicity cases a shift between reading frame of a gene and reading frame common for majority of genes contained in a class of triplet periodicity was observed. These periodicity regions were encoded into hypothetical amino acid sequences in accordance with reading frame built by triplet periodicity class. By BLAST program it was shown that 2660 hypothetical amino acid sequences have statistically significant similarity with proteins from UniProt databank. We suppose that 8% of triplet periodicity regions that joined classes mutated by means of reading frame shift. Created classes of triplet periodicity can be used for identification of coding regions of genes as well as for searching for mutations arisen from reading frame shift.  相似文献   

7.
An algorithm for prediction of the exon-intron structure of higher eukaryotic genes is suggested. The algorithm is based on comparison of genomic sequences of homologous genes from different species. It uses the fact that protein-coding sequences evolve slower than noncoding regions. Unlike the existing comparison methods, the proposed algorithm, which is a modified version of splicing alignment, compares not nucleotide but amino acid sequences, which increases its sensitivity. Conservation of the exon-intron structures of the compared genes is not assumed. The algorithm is implemented in the program Pro-Gen. The testing of the algorithm demonstrated that it can be successfully applied to prediction of vertebrate genes, and in some cases, for more distant comparisons (e.g., vertebrates and insects or nematodes). Thus, the program can be used for prediction of human genes by comparison with genes of model organisms: mouse, fugu, drosophila, and nematode. The algorithm overcomes deficiencies of the existing methods, both statistical (insufficient reliability) and similarity-based (inapplicability to completely new genes).  相似文献   

8.
In viruses an increased coding ability is provided by overlapping genes, in which two alternative open reading frames (ORFs) may be translated to yield two distinct proteins. The identification of signature sequences in overlapping genes is a topic of particular interest, since additional out-of-frame coding regions can be nested within known genes. In this work, a novel feature peculiar to overlapping coding regions is presented. It was detected by analysis of a sample set of 21 virus genomic sequences and consisted in the repeated occurrence of a cluster of basic amino acid residues, encoded by a frame, combined to a stretch of acidic residues, encoded by the corresponding overlapping frame. A computer scan of an additional set of virus sequences demonstrated that this feature is common to several other known overlapping ORFs and led to prediction of a novel overlapping gene in hepatitis G virus (HGV). The occurrence of a bifunctional coding region in HGV was also supported by its extremely lower rate of synonymous nucleotide substitutions compared to that observed in the other gene regions of the HGV genome. Analysis of the amino acid sequence that was deduced from the putative overlapping gene revealed a high content of basic residues and the presence of a nuclear targeting signal; these characteristics suggest that a core-like protein may be expressed by this novel ORF. Received: 21 July 1999 / Accepted: 26 October 1999  相似文献   

9.
A 1500 bp fragment of the Aspergillus nidulans mitochondrial genome contains genes for arginine and asparagine tRNAs, an unassigned reading frame, and the structural gene for ATPase subunit 6. The tRNA genes possess 66% nucleotide homology and possibly originated by a relatively recent duplication event. The unassigned reading frame displays a low level of homology with the human URF A6L. The predicted amino acid sequence of the A-nidulans ATPase subunit 6 gene is 40% homologous to the yeast polypeptide and includes the short, highly conserved regions also present in the equivalent subunits from other mitochondrial systems and from Escherichia coli.  相似文献   

10.
西瓜花叶病毒中国分离株全基因组核苷酸序列测定   总被引:3,自引:0,他引:3  
西瓜花叶病毒(Watermelon mosaic virus,WMV)是马铃薯Y病毒属(Potyvirus)成员,主要危害西瓜和甜瓜,引起花叶病。在田间,该病害主要由蚜虫以非持久性方式传播。西瓜和甜瓜花叶病在国内陕西、山东、云南、辽宁、山西、新疆、河南和黑龙江等地广泛发生[1-6]。从20世纪80年代中期开始发生,逐渐上升为普遍发生的主要病害。我国大部分地区因西瓜和甜瓜病毒病造成的损失为30%~50%,甚至会绝产,西瓜花叶病毒已经成为制约西瓜和甜瓜高产稳产最主要的因素之一[7]。到目前为止,多数工作集中在对西瓜和甜瓜病毒病的鉴定,在分子生物学上仅限于对CP基因…  相似文献   

11.
Sequences in the human genome with homology to the murine mammary tumor virus (MMTV) pol gene were isolated from a human phage library. Ten clones with extensive pol homology were shown to define five separate loci. These loci share common sequences immediately adjacent to the pol-like segments and, in addition, contain a related repeat element which bounds this region. This organization is suggestive of a proviral structure. We estimate that the human genome contains 30 to 40 copies of these pol-related sequences. The pol region of one of the cloned segments (HM16) and the complete MMTV pol gene were sequenced and compared. The nucleotide homology between these pol sequences is 52% and is concentrated in the terminal regions. The MMTV pol gene contains a single long open reading frame encoding 899 amino acids and is demarcated from the partially overlapping putative gag gene by termination codons and a shift in translational reading frame. The pol sequence of HM16 is multiply terminated but does contain open reading frames which encode 370, 105, and 112 amino acid residues in separate reading frames. We deduced a composite pol protein sequence for HM16 by aligning it to the MMTV pol gene and then compared these sequences with other retroviral pol protein sequences. Conserved sequences occur in both the amino and carboxyl regions which lie within the polymerase and endonuclease domains of pol, respectively.  相似文献   

12.
We determined the nucleotide sequence of a region between the gag and pol genes of a replication-competent proviral clone of a human T-cell leukemia virus type I (HTLV-I) from MT-2 cells. This region overlapping the gag and pol genes contains an open reading frame with a different phase from others. The deduced amino acid sequences show significant homology with the known protease gene of other retroviruses, and harbors highly conserved amino acid sequences that are well conserved in other retroviral protease domains. These results indicate that this open reading frame encodes a HTLV-I protease.  相似文献   

13.
测定了蓖麻蚕Samia cynthia ricini线粒体基因组(mtDNA)含完整的细胞色素氧化酶亚基Ⅲ(COX3)、tRNA-Gly和部分的NADH亚基Ⅲ(ND3)基因的DNA片段序列。COX3基因编码框包含789个核苷酸,编码262个氨基酸的蛋白质。通过同源性比较,发现COX3基因的3′端比5′端要保守,其编码的蛋白在C端有两个保守序列存在。COX3的下游为66 bp的tRNA-Gly基因。蓖麻蚕的COX3与家蚕COX3同源性最高,核苷酸和氨基酸序列同源性分别是80.2%和85.6%。根据COX3氨基酸序列进行了12种无脊椎动物的分子进化树分析,认为在采用线粒体基因序列进行分子进化分析时,应该综合考虑物种的繁殖模式及生态特点。  相似文献   

14.
In order to amplify the complete genome of enJSRV from the strain of Inner Mongolia (enJSRV-NM), we used enJSRV-specific and JSRV-specific DNA probes in dot blot hybridization. Seven pairs of primers were designed based on Genbank sequences. Seven fragments were obtained by PCR and were cloned into the PMD19- T vectors. The recombinant plasmids were sequenced and analyzed. The results showed that the genome was 7 942 bp in length and contained four overlapping open reading frames corresponding to the gag, pro, pol and env genes as well as an additional open reading frame (orf-x) that overlaps the 3' end of the pol gene. The nucleotide acid sequences of the enJSRV-NM loci were compared with the sequences of South Africa enJS56A1 strain (Accession No. AF153615) and USA JSRV21 strain (Accession No. AF105220). The nucleotide acid identities were 99.2% and 92.3% respectively. Two zinc fingers were found in the NC region in the predicted amino acid sequence. However, the YXXM motif, which is a reliable molecular marker for the infectious exogenous virus, was not found in the TM region. It was found that the enJSRV-NM region was 90%-98% identical at the amino acid level to its exogenous infectious counterparts in most of the retroviral genome. This is the first nucleotide sequence of enJSRV reported in P.R China. The resource work has provided a wide range of information useful not only for expression genomics and annotation of genomic DNA sequence, but also for further research on the clinical diagnosis of OPA.  相似文献   

15.
A 5.2 kb PstI restriction fragment containing the atpA gene cluster of the plastic genome of the centric diatom Odontella sinensis was cloned. Sequencing revealed a reading frame of 561 bp separating the genes atpF and atpA, which is preceded by a putative ribosome binding site. The third nucleotide of the codon for the last amino acid of atpF is the first nucleotide of the initiation codon of the 561 bp reading frame. The amino acid sequence deduced from the nucleotide sequence of this gene (ntpD) is colinear with δ subunits of different F0F1-ATPases and shows an overall sequence homology of up to 35% when compared with the sequences of cyanobacteria and Cyanophora paradoxa. The results are discussed in context with the evolution of chloroplasts of the chlorophyll-a + b and -a + c lineages, respectively.  相似文献   

16.
In order to amplify the complete genome of enJSRV from the strain of Inner Mongolia (enJSRV-NM), we used enJSRV-specific and JSRV-specific DNA probes in dot blot hybridization. Seven pairs of primers were designed based on Genbank sequences. Seven fragments were obtained by PCR and were cloned into the PMD19-T vectors. The recombinant plasmids were sequenced and analyzed. The results showed that the genome was 7 942 bp in length and contained four overlapping open reading frames corresponding to the gag, pro, pol and env genes as well as an additional open reading frame (orf-x) that overlaps the 3' end of the pol gene. The nucleotide acid sequences of the enJSRV-NM loci were compared with the sequences of South Africa enJS56A1 strain (Accession No.AF153615) and USA JSRV21 strain (Accession No. AF105220). The nucleotide acid identities were 99.2% and 92.3% respectively. Two zinc fingers were found in the NC region in the predicted amino acid sequence. However, the YXXM motif, which is a reliable molecular marker for the infectious exogenous virus, was not found in the TM region. It was found that the enJSRV-NM region was 90%-98% identical at the amino acid level to its exogenous infectious counterparts in most of the retroviral genome. This is the first nucleotide sequence of enJSRV reported in P.R China. The resource work has provided a wide range of information useful not only for expression genomics and annotation of genomic DNA sequence, but also for further research on the clinical diagnosis of OPA.  相似文献   

17.
The nucleotide sequence of the gag gene of feline leukemia virus and its flanking sequences were determined and compared with the corresponding sequences of two strains of feline sarcoma virus and with that of the Moloney strain of murine leukemia virus. A high degree of nucleotide sequence homology between the feline leukemia virus and murine leukemia virus gag genes was observed, suggesting that retroviruses of domestic cats and laboratory mice have a common, proximal evolutionary progenitor. The predicted structure of the complete feline leukemia virus gag gene precursor suggests that the translation of nonglycosylated and glycosylated gag gene polypeptides is initiated at two different AUG codons. These initiator codons fall in the same reading frame and are separated by a 222-base-pair segment which encodes an amino terminal signal peptide. The nucleotide sequence predicts the order of amino acids in each of the individual gag-coded proteins (p15, p12, p30, p10), all of which derive from the gag gene precursor. Stable stem-and-loop secondary structures are proposed for two regions of viral RNA. The first falls within sequences at the 5' end of the viral genome, together with adjacent palindromic sequences which may play a role in dimer linkage of RNA subunits. The second includes coding sequences at the gag-pol junction and is proposed to be involved in translation of the pol gene product. Sequence analysis of the latter region shows that the gag and pol genes are translated in different reading frames. Classical consensus splice donor and acceptor sequences could not be localized to regions which would permit synthesis of the expected gag-pol precursor protein. Alternatively, we suggest that the pol gene product (RNA-dependent DNA polymerase) could be translated by a frameshift suppressing mechanism which could involve cleavage modification of stems and loops in a manner similar to that observed in tRNA processing.  相似文献   

18.
19.
A family of interspersed repetitive elements, RlBamHI, with sequence similarity to the transposase of hAT DNA transposons, occurs in genomes of eight western Palaearctic water frog taxa and the brown frog Rana dalmatina , but was not detected in Xenopus laevis or Salamandra salamandra . RlBamHI elements are not tandemly arrayed, are dispersed across all chromosomes although not uniformly distributed, and based on dot-blot hybridizations may constitute as much as approximately 10% of the genomes of Rana lessonae , Rana ridibunda , and Rana perezi , but only approximately 1% of that of Rana saharica . Eleven nucleotide sequences of a 572 bp fragment from the nine taxa are very similar (pairwise differences 0.4–8.1% for nucleotides, 0.6–18.2% for amino acids), and all share a single open reading frame across the whole RlBamHI fragment. The reading frame is maintained despite several indels, most of which are multiples of 3 bp, but a pair of which in one species alters and, after 13 codons, restores the reading frame. It is possible that the reading frame is selectively maintained, suggesting recent or even present transposition capacity. The amino acid sequences encoded by RlBamHI elements, but not the nucleotide sequences themselves, reveal the similarity of RlBamHI to members of the widespread hAT superfamily of DNA transposons. Amino acid sequence comparisons permitted no convincing phylogenetic placement of RlBamHI among 32 representative hAT transposons across organisms, probably reflecting both the genetic divergence of RlBamHI elements and the paucity of aligned residues available for analysis. Among DNA transposons, a genomic fraction of 10% is extraordinarily high. Phylogenetic analyses of Rana RlBamHI sequences match poorly with independent genetic and molecular phylogenies; the elements compared probably are not orthologous, which renders their sequences inadequate for reconstructing organismal phylogenetic histories.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号