首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
The numbers of potential response regulator genes were determined from the complete and annotated genome sequences of Archaea and Bacteria. The numbers of each class of response regulators are shown for each organism, determined principally from BLASTP searches, but with reference to the gene category lists where available. The survey shows that for Bacteria there is a link between the total number of potential response regulator genes and both the genome complexity (number of potential protein-coding genes) and the organism's lifestyle/habitat. Increasingly complex lifestyles and genome complexities are matched by an increase in the average number of potential response regulator genes per genome, indicating that a higher degree of complexity requires a higher level of control of gene expression and cellular activity. Detailed results of this study are available online at and.  相似文献   

3.
Zhang QY  Xiao F  Xie J  Li ZQ  Gui JF 《Journal of virology》2004,78(13):6982-6994
Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1.  相似文献   

4.
Of 30 baculovirus genomes that have been sequenced to date, the only nonlepidopteran baculoviruses include the dipteran Culex nigripalpus nucleopolyhedrovirus and two hymenopteran nucleopolyhedroviruses that infect the sawflies Neodiprion lecontei (NeleNPV) and Neodiprion sertifer (NeseNPV). This study provides a complete sequence and genome analysis of the nucleopolyhedrovirus that infects the balsam fir sawfly Neodiprion abietis (Hymenoptera, Symphyta, Diprionidae). The N. abietis nucleopolyhedrovirus (NeabNPV) is 84,264 bp in size, with a G+C content of 33.5%, and contains 93 predicted open reading frames (ORFs). Eleven predicted ORFs are unique to this baculovirus, 10 ORFs have a putative sequence homologue in the NeleNPV genome but not the NeseNPV genome, and 1 ORF (neab53) has a putative sequence homologue in the NeseNPV genome but not the NeleNPV genome. Specific repeat sequences are coincident with major genome rearrangements that distinguish NeabNPV and NeleNPV. Genes associated with these repeat regions encode a common amino acid motif, suggesting that they are a family of repeated contiguous gene clusters. Lepidopteran baculoviruses, similarly, have a family of repeated genes called the bro gene family. However, there is no significant sequence similarity between the NeabNPV and bro genes. Homologues of early-expressed genes such as ie-1 and lef-3 were absent in NeabNPV, as they are in the previously sequenced hymenopteran baculoviruses. Analyses of ORF upstream sequences identified potential temporally distinct genes on the basis of putative promoter elements.  相似文献   

5.
Sixty-two DNA sequences for the coding regions of omega-secalin (ω-secalin) genes have been characterized from rye (Secale cereale L.), hexaploid and octoploid triticale (× Triticosecale Wittmack), and wheat (Triticum aestivum L.) 1BL/1RS translocation line. Only 19 out of the 62 ω-secalin gene sequences were full-length open reading frames (ORFs), which can be expressed into functional proteins. The other 43 DNA sequences were pseudogenes, as their ORFs were interrupted by one or a few stop codons or frameshift mutations. The 19 ω-secalin genes have a typical primary structure, which is different from wheat gliadins. There was no cysteine residue in ω-secalin proteins, and the potential celiac disease (CD) toxic epitope (PQQP) was identified to appear frequently in the repetitive domains. The ω-secalin genes from various cereal species shared high homology in their gene sequences. The ω-secalin gene family has involved fewer variations after the integration of the rye R chromosome or whole genome into the wheat or triticale genome. The higher Ka/Ks ratio (i.e. non-synonymous to synonymous substitutions per site) in ω-secalin pseudogenes than in ω-secalin ORFs indicate that the pseudogenes may be subject to a reduced selection pressure. Based on the conserved sequences of ω-secalin genes, it will be possible to manipulate the expression of this gene family in rye, triticale, or wheat 1BL/1RS translocation lines, to reduce its negative effects on grain quality.  相似文献   

6.
Gilligan P  Brenner S  Venkatesh B 《Gene》2002,294(1-2):35-44
The compact genome of the pufferfish, Fugu rubripes, has been proposed as a 'reference' genome to aid in annotating and analysing the human genome. We have annotated and compared 85 kb of Fugu sequence containing 17 genes with its homologous loci in the human draft genome and identified three 'novel' human genes that were missed or incompletely predicted by the previous gene prediction methods. Two of the novel genes contain zinc finger domains and are designated ZNF366 and ZNF367. They map to human chromosomes 5q13.2 and 9q22.32, respectively. The third novel gene, designated C9orf21, maps to chromosome 9q22.32. This gene is unique to vertebrates, and the protein encoded by it does not contain any known domains. We could not find human homologs for two Fugu genes, a novel chemokine gene and a kinase gene. These genes are either specific to teleosts or lost in the human lineage. The Fugu-human comparison identified several conserved non-coding sequences in the promoter and intronic regions. These sequences, conserved during 450 million years of vertebrate evolution, are likely to be involved in gene regulation. The 85 kb Fugu locus is dispersed over four human loci, occupying about 1.5 Mb. Contiguity is conserved in the human genome between six out of 16 Fugu gene pairs. These contiguous chromosomal segments should share a common evolutionary history dating back to the common ancestor of mammals and teleosts. We propose contiguity as strong evidence to identify orthologous genes in distant organisms. This study confirms the utility of the Fugu as a supplementary tool to uncover and confirm novel genes and putative gene regulatory regions in the human genome.  相似文献   

7.
Primary sequence patterns based on known conserved sites in eukaryotic protein kinases were used to search for eukaryotic-like protein kinase sequences in a six-frame translation of the bacterial subsection of GenBank. This search identified a previously unrecognized eukaryotic-like protein kinase gene in three related methanogenic archaebacteria, Methanococcus vannielii, M. voltae, and M. thermolithotrophicus. The proposed coding sequences are located in orthologous open reading frames (ORFs): ORF547, ORF294, and ORF114, respectively. The C-terminus of the ORFs contains 9 of the 11 subdomains characteristically conserved within the eukaryotic protein kinase catalytic domain. The N-terminus of the ORFs is similar to a putative glycoprotease in Pasteurella haemolytica and its homologue in Escherichia coli, the orfX gene. This is the first report of a eukaryotic-like protein kinase sequence observed in Archaebacteria.  相似文献   

8.
A "gene-island" sequencing strategy has been developed that expedites the targeted acquisition of orthologous gene sequences from related species for comparative genome analysis. A 152-kb bacterial artificial chromosome (BAC) clone from sorghum (Sorghum bicolor) encoding phytochrome A (PHYA) was fully sequenced, revealing 16 open reading frames with a gene density similar to many regions of the rice (Oryza sativa) genome. The sequences of genes in the orthologous region of the maize (Zea mays) and rice genomes were obtained using the gene-island sequencing method. BAC clones containing the orthologous maize and rice PHYA genes were identified, sheared, subcloned, and probed with the sorghum PHYA-containing BAC DNA. Sequence analysis revealed that approximately 75% of the cross-hybridizing subclones contained sequences orthologous to those within the sorghum PHYA BAC and less than 25% contained repetitive and/or BAC vector DNA sequences. The complete sequence of four genes, including up to 1 kb of their promoter regions, was identified in the maize PHYA BAC. Nine orthologous gene sequences were identified in the rice PHYA BAC. Sequence comparison of the orthologous sorghum and maize genes aided in the identification of exons and conserved regulatory sequences flanking each open reading frame. Within genomic regions where micro-colinearity of genes is absolutely conserved, gene-island sequencing is a particularly useful tool for comparative analysis of genomes between related species.  相似文献   

9.
A novel bacteriophage that infects Shigella sonnei was isolated from the Gap River in Korea, and its phenotypic and genomic characteristics were investigated. The virus, called SP18, showed morphology characteristic of the family Myoviridae, and phylogenetic analysis of major capsid gene (gp23) sequences classified it as a T4-like phage. Based on host spectrum analysis, it is lytic to S. sonnei, but not to Shigella flexneri, Shigella boydii or members of the genera Escherichia and Salmonella. Pyrosequencing of the SP18 bacteriophage genome revealed a 170-kb length sequence. In total, 286 ORFs and 3 tRNA genes were identified, and 259 ORFs showed similarity (BLASTP e-value<0.001) to genes of other bacteriophages. The results from comparative genomic analysis indicated that the enterophage JS98, isolated from human stool, is the closest relative of SP18. Based on phylogenetic analysis of gp23 protein-coding sequences, dot plot comparison and BLASTP analysis of genomes, SP18 and JS98 appear to be closely related to T4-even phages. However, several insertions, deletions, and duplications indicate differences between SP18 and JS98. Comparison of duplicated gp24 genes and the soc gene showed that duplication events are responsible for the differentiation and evolution of T4-like bacteriophages.  相似文献   

10.
Within the framework of an international Bacillus subtilis genomesequencing project, we have determined a 36-kb sequence coveringthe region between the gntZ and trnY genes. In addition to fivegenes sequenced and characterized previously, 27 putative proteincoding sequences (open reading frame; ORF) were identified.A homology search for the newly identified ORFs revealed thatsix of them had similarities to known proteins. It is notablethat new ORFs belonging to response-regulator aspartate phosphatase(Rap) and its regulator (Phr) families, and response regulatorand sensory kinase families of two-component signal transductionsystems have been identified. Furthermore, we found that some180-bp non-coding sequence, that might be an remnant of an ancientIS element, is preserved in at least five loci of the B. subtilisgenome.  相似文献   

11.
In this study, comparative high resolution genetic mapping of the GA-insensitive dwarfing gene sdw3 of barley revealed highly conserved macrosynteny of the target region on barley chromosome 2HS with rice chromosome 7L. A rice contig covering the sdw3-orthologous region was identified and subsequently exploited for marker saturation of the target interval in barley. This was achieved by (1) mapping of rice markers from the orthologous region of the rice genetic map, (2) mapping of rice ESTs that had been physically localized on the rice contig, or (3) mapping of barley ESTs that show strong sequence similarity to coding sequences present in the rice contig. Finally, the sdw3 gene was mapped to an interval of 0.55 cM in barley, corresponding to a physical distance of about 252 kb in rice, after employing orthologous EST-derived rice markers. Three putative ORFs were identified in this interval in rice, which exhibited significant sequence similarity to known signal regulator genes from different species. These ORFs can serve as starting points for the map-based isolation of the sdw3 gene from barley.Communicated by R. Hagemann  相似文献   

12.
The complete sequence of the genome of an aerobic hyper-thermophiliccrenarchaeon, Aeropyrum pernix K1, which optimally grows at95°C, has been determined by the whole genome shotgun methodwith some modifications. The entire length of the genome was1,669,695 bp. The authenticity of the entire sequence was supportedby restriction analysis of long PCR products, which were directlyamplified from the genomic DNA. As the potential protein-codingregions, a total of 2,694 open reading frames (ORFs) were assigned.By similarity search against public databases, 633 (23.5%) ofthe ORFs were related to genes with putative function and 523(19.4%) to the sequences registered but with unknown function.All the genes in the TCA cycle except for that of alpha-ketoglutaratedehydrogenase were included, and instead of the alpha-ketoglutaratedehydrogenase gene, the genes coding for the two subunits of2-oxoacid:ferredoxin oxidoreductase were identified. The remaining1,538 ORFs (57.1%) did not show any significant similarity tothe sequences in the databases. Sequence comparison among theassigned ORFs suggested that a considerable member of ORFs weregenerated by sequence duplication. The RNA genes identifiedwere a single 16S–23S rRNA operon, two 5S rRNA genes and47 tRNA genes including 14 genes with intron structures. Allthe assigned ORFs and RNA coding regions occupied 89.12% ofthe whole genome. The data presented in this paper are availableon the internet homepage (http://www.mild.nite.go.jp).  相似文献   

13.
Mizuno  Takeshi 《DNA research》1997,4(2):161-168
Bacteria have devised sophisticated His-Asp phosphorelay signalingsystems for eliciting a variety of adaptive responses to theirenvironment, which are generally referred to as the "two-componentregulatory system." The widespread occurrence of the His-Aspphosphorelay signaling in both prokaryotes and eukaryotes impliesthat it is a powerful device for a wide variety of adaptiveresponses of cells to their environment. The two-component signaltransducers contain one or more of three common and characteristicphosphotransfer signaling domains, named the "transmitter, receiver,and histidine-containing phosphotransfer (HPt) domains." Therecently determined entire genomic sequence of Escherichia coliallowed us to compile systematically a complete list of genesencoding such two-component signal transduction proteins. Theresults of such an effort, made in this study, revealed thatat least 62 open reading frames(ORFs) were identified as putativemembers of the two-component signaltransducers in this singlespecies. Among them, 32 were identified as response regulatorand 23 were identified as orthodox sensory kinases. In addition,E. coli has five hybrid sensory kinases. The precise locationof each ORF was mapped on a physical map of the entire E. coligenome. All of these ORFs were then compiled and annotated extensively.  相似文献   

14.
The nucleotide sequence of the genome of equine arteritis virus (EAV) was determined from a set of overlapping cDNA clones and was found to contain eight open reading frames (ORFs). ORFs 2 through 7 are expressed from six 3'-coterminal subgenomic mRNAs, which are transcribed from the 3'-terminal quarter of the viral genome. A number of these ORFs are predicted to encode structural EAV proteins. The organization and expression of the 3' part of the EAV genome are remarkably similar to those of coronaviruses and toroviruses. The 5'-terminal three-quarters of the genome contain the putative EAV polymerase gene, which also shares a number of features with the corresponding gene of corona- and toroviruses. The gene contains two large ORFs, ORF1a and ORF1b, with an overlap region of 19 nucleotides. The presence of a "shifty" heptanucleotide sequence in this region and a downstream RNA pseudoknot structure indicate that ORF1b is probably expressed by ribosomal frameshifting. The frameshift-directing potential of the ORF1a/ORF1b overlap region was demonstrated by using a reporter gene. Moreover, the predicted ORF1b product was found to contain four domains which have been identified in the same relative positions in coronavirus and torovirus ORF1b products. The sequences of the EAV and coronavirus ORF1a proteins were found to be much more diverged. The EAV ORF1a product contains a putative trypsinlike serine protease motif. Our data indicate that EAV, presently considered a togavirus, is evolutionarily related to viruses from the coronaviruslike superfamily.  相似文献   

15.
The genome sequence of Thermotoga maritima revealed that 24% of its open reading frames (ORFs) showed the highest similarity scores to archaeal genes in BLAST analyses. Here we screened 16 strains from the genus Thermotoga and other related Thermotogales for the occurrence of two of these "archaeal" genes: the gene encoding the large subunit of glutamate synthase (gltB) and the myo-inositol 1P synthase gene (ino1). Both genes were restricted to the Thermotoga species within the Thermotogales. The distribution of the two genes, along with results from phylogenetic analyses, showed that they were acquired from Archaea during the divergence of the Thermotogales. Database searches revealed that three other bacteria-Dehalococcoides ethenogenes, Sinorhizobium meliloti, and Clostridium difficile-possess archaeal-type gltBs, and the phylogenetic analyses confirmed at least two lateral gene transfer (LGT) events between Bacteria and Archaea. These LGT events were also strongly supported by gene structure data, as the three domains in bacterial-type gltB are homologous to three independent ORFs in Archaea and Bacteria with archaeal-type gltBs. The ino1 gene has a scattered distribution among Bacteria, and apart from the Thermotoga strains it is found only in Aquifex aeolicus, D. ethenogenes, and some high-G+C Gram-positive bacteria. Phylogenetic analysis of the ino1 sequences revealed three highly supported prokaryotic clades, all containing a mixture of archaeal and bacterial sequences, and suggested that all bacterial ino1 genes had been recruited from archaeal donors. The Thermotoga strains and A. aeolicus acquired this gene independently from different archaeal species. Although transfer of genes from hyperthermophilic Archaea may have facilitated the evolution of bacterial hyperthermophily, between-domain transfers also affect mesophilic species. For hyperthermophiles, we hypothesize that LGT may be as much a consequence as the cause of adaptation to hyperthermophily.  相似文献   

16.
In contrast to mammals, the evolution of MHC genes in birds appears to be characterized by high rates of gene duplication and concerted evolution. To further our understanding of the evolution of passerine MHC genes, we have isolated class II B sequences from two species of New Zealand robins, the South Island robin (Petroica australis australis), and the endangered Chatham Island black robin (Petroica traversi). Using an RT-PCR based approach we isolated four transcribed class II B MHC sequences from the black robin, and eight sequences from the South Island robin. RFLP analysis indicated that all class II B loci were contained within a single linkage group. Analysis of 3-untranslated region sequences enabled putative orthologous loci to be identified in the two species, and indicated that multiple rounds of gene duplication have occurred within the MHC of New Zealand robins. The orthologous relationships are not retained within the coding region of the gene, instead the sequences group within species. A number of putative gene conversion events were identified across the length of our sequences that may account for this. Exon 2 sequences are highly diverse and appear to have diverged under balancing selection. It is also possible that gene conversion involving short stretches of sequence within exon 2 adds to this diversity. Our study is the first report of putative orthologous MHC loci in passerines, and provides further evidence for the importance of gene duplication and gene conversion in the evolution of the passerine MHC.Nucleotide sequence data reported in this paper are available in the GenBank database under the accession numbers AY258333–AY258335, AY428561–AY428570, and AY530534–AY530535  相似文献   

17.
18.
19.
The nucleotide sequences of three independent fragments (designated no. 3, 4, and 9; each 15–20 kb in size) of the genome of alkaliphilic Bacillus sp. C-125 cloned in a λ phage vector have been determined. Thirteen putative open reading frames (ORFs) were identified in sequenced fragment no. 3 and 11 ORFs were identified in no. 4. Twenty ORFs were also identified in fragment no. 9. All putative ORFs were analyzed in comparison with the BSORF database and non-redundant protein databases. The functions of 5 ORFs in fragment no. 3 and 3 ORFs in fragment no. 4 were suggested by their significant similarities to known proteins in the database. Among the 20 ORFs in fragment no. 9, the functions of 11 ORFs were similarly suggested. Most of the annotated ORFs in the DNA fragments of the genome of alkaliphilic Bacillus sp. C-125 were conserved in the Bacillus subtilis genome. The organization of ORFs in the genome of strain C-125 was found to differ from the order of genes in the chromosome of B. subtilis, although some gene clusters (ydh, yqi, yer, and yts) were conserved as operon units the same as in B. subtilis. Received: April 17, 1998 / Accepted: June 23, 1998  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号