首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
3.
Mycoplasma hyopneumoniae is the causative agent of porcine enzootic pneumonia and a major factor in the porcine respiratory disease complex. A clear understanding of the mechanisms of pathogenesis does not exist, although it is clear that M. hyopneumoniae adheres to porcine ciliated epithelium by action of a protein called P97. Previous studies have shown variation in the gene encoding the P97 cilium adhesin in different strains of M. hyopneumoniae, but the extent of genetic variation among field strains across the genome is not known. Since M. hyopneumoniae is a worldwide problem, it is reasonable to expect that a wide range of genetic variability may exist given all of the different breeds and housing conditions. This variation may impact the overall virulence of a single strain. Using microarray technology, this study examined the potential variation of 14 field strains compared to strain 232, on which the array was based. Genomic DNA was obtained, amplified with TempliPhi, and labeled indirectly with Alexa dyes. After genomic hybridization, the arrays were scanned and data were analyzed using a linear statistical model. The results indicated that genetic variation could be detected in all 14 field strains but across different loci, suggesting that variation occurs throughout the genome. Fifty-nine percent of the variable loci were hypothetical genes. Twenty-two percent of the lipoprotein genes showed variation in at least one field strain. A permutation test identified a location in the M. hyopneumoniae genome where there is spatial clustering of variability between the field strains and strain 232.  相似文献   

4.
The 4 202 353 bp genome of the alkaliphilic bacterium Bacillus halodurans C-125 contains 4066 predicted protein coding sequences (CDSs), 2141 (52.7%) of which have functional assignments, 1182 (29%) of which are conserved CDSs with unknown function and 743 (18.3%) of which have no match to any protein database. Among the total CDSs, 8.8% match sequences of proteins found only in Bacillus subtilis and 66.7% are widely conserved in comparison with the proteins of various organisms, including B.subtilis. The B.halodurans genome contains 112 transposase genes, indicating that transposases have played an important evolutionary role in horizontal gene transfer and also in internal genetic rearrangement in the genome. Strain C-125 lacks some of the necessary genes for competence, such as comS, srfA and rapC, supporting the fact that competence has not been demonstrated experimentally in C-125. There is no paralog of tupA, encoding teichuronopeptide, which contributes to alkaliphily, in the C-125 genome and an ortholog of tupA cannot be found in the B.subtilis genome. Out of 11 σ factors which belong to the extracytoplasmic function family, 10 are unique to B.halodurans, suggesting that they may have a role in the special mechanism of adaptation to an alkaline environment.  相似文献   

5.
The complete genomic sequence of an intracellular bacterial pathogen, Mycoplasma penetrans HF-2 strain, was determined. The HF-2 genome consists of a 1 358 633 bp single circular chromosome containing 1038 predicted coding sequences (CDSs), one set of rRNA genes and 30 tRNA genes. Among the 1038 CDSs, 264 predicted proteins are common to the Mycoplasmataceae sequenced thus far and 463 are M.penetrans specific. The genome contains the two-component system but lacks the essential cellular gene, uridine kinase. The relatively large genome of M.penetrans HF-2 among mycoplasma species may be accounted for by both its rich core proteome and the presence of a number of paralog families corresponding to 25.4% of all CDSs. The largest paralog family is the p35 family, which encodes surface lipoproteins including the major antigen, P35. A total of 44 genes for p35 and p35 homologs were identified and 30 of them form one large cluster in the chromosome. The genetic tree of p35 paralogs suggests the occurrence of dynamic chromosomal rearrangement in paralog formation during evolution. Thus, M.penetrans HF-2 may have acquired diverse repertoires of antigenic variation-related genes to allow its persistent infection in humans.  相似文献   

6.
The completed genome of Methanococcus jannaschii, including the main chromosome and two extra-chromosomal elements, predicts a proteome comprised of 1783 proteins. How many of those proteins are expressed at any given time and the relative abundance of the expressed proteins, however, cannot be predicted solely from the genome sequence. Two-dimensional gel electrophoresis coupled with peptide mass spectrometry is being used to identify the proteins expressed by M. jannaschii cells grown under different conditions as part of an effort to correlate protein expression with regulatory mechanisms. Here we describe the identification of 170 of the most abundant proteins found in total lysates of M. jannaschii grown under optimal fermentation conditions. To optimize the number of proteins detected, two different protein specific stains (Coomassie Blue R250 or silver nitrate) and two different first dimension separation methods (isoelectric focusing or nonequilibrium pH gradient electrophoresis) were used. Thirty-two percent of the proteins identified are annotated as hypothetical (21% conserved hypothetical and 11% hypothetical), 21% are enzymes involved in energy metabolism, 12% are proteins required for protein synthesis, and the remainder include proteins necessary for intermediary metabolism, cell division, and cell structure. Evidence of post-translational modification of numerous M. jannaschii proteins has been found, as well as indications of incomplete dissociation of protein-protein complexes. These results demonstrate the complexity of proteome analysis even when dealing with a relatively simple genome.  相似文献   

7.
Knowledge of rice genome brings new dimensions to the management of abiotic stresses; however, gene sequences in the rice genome are yet to be assigned structure and function. Hydrogen peroxide, salicylates and jasmonates act as signal molecules in plants employing common machinery to manage abiotic stress. The present work is primarily focused to assign a structurefunction relationship by modeling of the hypothetical proteins of SA-JA signaling pathway known in Arabidopsis thaliana and compare them with corresponding proteins in rice in silico. Thirteen known gene sequences with their encoded proteins for SA/JA pathway in model plant A. thaliana were obtained and similar gene sequences from rice were retrieved at NCBI. Five rice gene sequences Os09g0392100, Os03g0233200, OsJ_33269, OsJ_23610 and Os01g0194300 resulted in hypothetical protein products with unknown structure and function. Modeling and comparison of 5 proteins from rice and Arabidopsis showed 73 - 98% identity with acceptable RMSD values of 0.6 - 1.7 upon superimposition. Results suggest conserved nature of these proteins during evolution. The hypothetical protein from rice contains similar functional protein domain as that in A. thaliana and therefore are likely to perform similar functions in rice. There is a cross talk between the genes in SA/JA pathway wherein Os09g0392100 or EDS1, Os03g0233200 or PR5, OsJ_33269 or PAD4 and OsJ_23610 or SFD-1 activates the pathway and Os01g0194300 or NPR1 inhibit the pathway. Further investigation through wet-lab experiments are in progress to look into suppression/activation of the genes of SAJA signaling in rice plants exposed to abiotic stress.  相似文献   

8.
The solution structure of MTH1175, a 124-residue protein from the archaeon Methanobacterium thermoautotrophicum has been determined by NMR spectroscopy. MTH1175 is part of a family of conserved hypothetical proteins (COG1433) with unknown functions which contains multiple paralogs from all complete archaeal genomes and the archaeal gene-rich bacterium Thermotoga maritima. Sequence similarity indicates this protein family may be related to the nitrogen fixation proteins NifB and NifX. MTH1175 adopts an α/β topology with a single mixed β-sheet, and contains two flexible loops and an unstructured C-terminal tail. The fold resembles that of Ribonuclease H and similar proteins, but differs from these in several respects, and is not likely to have a nuclease activity. This revised version was published online in July 2006 with corrections to the Cover Date.  相似文献   

9.
We report here the complete genomic sequence of the Chilean human isolate of Andes virus CHI-7913. The S, M, and L genome segment sequences of this isolate are 1,802, 3,641 and 6,466 bases in length, with an overall GC content of 38.7%. These genome segments code for a nucleocapsid protein of 428 amino acids, a glycoprotein precursor protein of 1,138 amino acids and a RNA-dependent RNA polymerase of 2,152 amino acids. In addition, the genome also has other ORFs coding for putative proteins of 34 to 103 amino acids. The encoded proteins have greater than 98% overall similarity with the proteins of Andes virus isolates AH-1 and Chile R123. Among other sequenced Hantavirus, CHI-7913 is more closely related to Sin Nombre virus, with an overall protein similarity of 92%. The characteristics of the encoded proteins of this isolate, such as hydrophobic domains, glycosylation sites, and conserved amino acid motifs shared with other Hantavirus and other members of the Bunyaviridae family, are identified and discussed.  相似文献   

10.
11.
The complete nucleotide sequences of the genomes of the type 2 ( P712 , Ch, 2ab ) and type 3 (Leon 12a1b ) poliovirus vaccine strains were determined. Comparison of the sequences with the previously established genome sequence of type 1 (LS-c, 2ab ) poliovirus vaccine strain revealed that 71% of the nucleotides in the genome RNAs were common, that the 5' and 3' termini of the genomes were highly homologous, and that more than 80% of the nucleotide differences in the coding region occurred in the third letter position of in-phase codons, resulting in a low frequency of amino acid difference. These results strongly suggested that the serotypes of poliovirus derived from a common prototype. A comparison of the amino acid sequences predicted from the genome sequences showed highest variation in the capsid protein region, whereas non-structural proteins are highly conserved. Initiation of polyprotein synthesis occurs in all three strains more than 740 nucleotides downstream from the 5' end. An analysis of the non-coding region suggests that small peptides that could potentially originate from this region are conserved. The amino acid sequences immediately surrounding the cleavage signals, however, show a higher than average degree of variation. The analysis of the amino acid sequences of the capsid protein VP1 of all serotypes has led to the prediction of potential antigenic sites on the virion involved in neutralization.  相似文献   

12.
The Hsp90 family of proteins in Arabidopsis thaliana   总被引:6,自引:0,他引:6       下载免费PDF全文
The 90-kDa heat shock protein (Hsp90) is an essential molecular chaperone in eukaryotic cells, with key roles in the folding and activation of proteins involved in signal transduction and control of the cell cycle. A search for Hsp90 sequences in the Arabidopsis thaliana genome revealed that this family includes 7 members. The AtHsp90-1 through AtHsp90-4 proteins constitute the cytoplasmic subfamily, whereas the AtHsp90-5, AtHsp90-6, and AtHsp90-7 proteins are predicted to be within the plastidial, mitochondrial, and endoplasmic reticulum compartments, respectively. The deduced amino acid sequences of each of the cytoplasmic proteins contains the highly conserved C-terminal pentapeptide MEEVD. All of the AtHsp90 sequences include a conserved adenosine triphosphate-binding domain, whereas only the cytoplasmic and endoplasmic reticulum-resident sequences include an adjacent charged linker domain that is common in mammalian and yeast sequences. The occurrence of multiple AtHsp90 proteins in the cytoplasm and of family members in other subcellular compartments suggests a range of specific functions and target polypeptides.  相似文献   

13.
14.
A hypothetical protein is predicted to be expressed from an open reading frame without known experimental evidence of translation. They constitute a substantial fraction of proteomes. Domain extraction from these hypothetical sequences helps to search for protein coding genes for protein structural and functional annotation. We describe the analysis of prediction data in a sequence dataset of hypothetical protein orthologs of Pongo abelii (orangutan) and Sus scrofa (pig). It should be noted that these orangutan-pig orthologs are also non-homologous to human proteins. These predicted data find application in the genome wide annotation of proteins in poorly understood genomes.

Abbreviations

PDB - Protein Data Bank, DEG - Database of Essential Genes, CDD - Conserved Domain Database, IUCN - International Union for Conservation of Nature.  相似文献   

15.
16.
Members of the Structural Maintenance of Chromosome (SMC) family have long been of interest to molecular and evolutionary biologists for their role in chromosome structural dynamics, particularly sister chromatid cohesion, condensation, and DNA repair. SMC and related proteins are found in all major groups of living organisms and share a common structure of conserved N and C globular domains separated from the conserved hinge domain by long coiled-coil regions. In eukaryotes there are six paralogous proteins that form three heterodimeric pairs, whereas in prokaryotes there is only one SMC protein that homodimerizes. From recently completed genome sequences, we have identified SMC genes from 34 eukaryotes that have not been described in previous reports. Our phylogenetic analysis of these and previously identified SMC genes supports an origin for the vertebrate meiotic SMC1 in the most recent common ancestor since the divergence from invertebrate animals. Additionally, we have identified duplicate copies due to segmental duplications for some of the SMC paralogs in plants and yeast, mainly SMC2 and SMC6, and detected evidence that duplicates of other paralogs were lost, suggesting differential evolution for these genes. Our analysis indicates that the SMC paralogs have been stably maintained at very low copy numbers, even after segmental (genome-wide) duplications. It is possible that such low copy numbers might be selected during eukaryotic evolution, although other possibilities are not ruled out.  相似文献   

17.
Mycoplasma pulmonis is a wall-less eubacterium belonging to the Mollicutes (trivial name, mycoplasmas) and responsible for murine respiratory diseases. The genome of strain UAB CTIP is composed of a single circular 963 879 bp chromosome with a G + C content of 26.6 mol%, i.e. the lowest reported among bacteria, Ureaplasma urealyticum apart. This genome contains 782 putative coding sequences (CDSs) covering 91.4% of its length and a function could be assigned to 486 CDSs whilst 92 matched the gene sequences of hypothetical proteins, leaving 204 CDSs without significant database match. The genome contains a single set of rRNA genes and only 29 tRNAs genes. The replication origin oriC was localized by sequence analysis and by using the G + C skew method. Sequence polymorphisms within stretches of repeated nucleotides generate phase-variable protein antigens whilst a recombinase gene is likely to catalyse the site-specific DNA inversions in major M.pulmonis surface antigens. Furthermore, a hemolysin, secreted nucleases and a glyco-protease are predicted virulence factors. Surprisingly, several of the genes previously reported to be essential for a self-replicating minimal cell are missing in the M.pulmonis genome although this one is larger than the other mycoplasma genomes fully sequenced until now.  相似文献   

18.
Chagas disease (CD) causes the highest burden of parasitic diseases in the Western Hemisphere and is therefore a priority for drug research and development. Platelet-activating factor (PAF) causes the CD parasite Trypanosoma cruzi to differentiate, which suggests that the parasite may express PAF receptors. Here, we explored the T. cruzi proteome for PAF receptor-like proteins. From a total of 23,000 protein sequences, we identified 29 hypothetical proteins that are predicted to have seven transmembrane domains (TMDs), which is the main characteristic of the G protein-coupled receptors (GPCRs), including the PAF receptor. The TMDs of these sequences were independently aligned with domains from 25 animal PAF receptors and the sequences were analysed for conserved residues. The conservation score mean values for the TMDs of the hypothetical proteins ranged from 31.7-44.1%, which suggests that if the putative T. cruzi PAF receptor is among the sequences identified, the TMDs are not highly conserved. These results suggest that T. cruzi contains several GPCR-like proteins and that one of these GPCRs may be a PAF receptor. Future studies may further validate the PAF receptor as a target for CD chemotherapy.  相似文献   

19.
Desulfitobacterium strains have the ability to dechlorinate halogenated compounds under anaerobic conditions by dehalorespiration. The complete genome of the tetrachloroethene (PCE)-dechlorinating strain Desulfitobacterium hafniense Y51 is a 5,727,534-bp circular chromosome harboring 5,060 predicted protein coding sequences. This genome contains only two reductive dehalogenase genes, a lower number than reported in most other dehalorespiring strains. More than 50 members of the dimethyl sulfoxide reductase superfamily and 30 paralogs of the flavoprotein subunit of the fumarate reductase are encoded as well. A remarkable feature of the genome is the large number of O-demethylase paralogs, which allow utilization of lignin-derived phenyl methyl ethers as electron donors. The large genome reveals a more versatile microorganism that can utilize a larger set of specialized electron donors and acceptors than previously thought. This is in sharp contrast to the PCE-dechlorinating strain Dehalococcoides ethenogenes 195, which has a relatively small genome with a narrow metabolic repertoire. A genomic comparison of these two very different strains allowed us to narrow down the potential candidates implicated in the dechlorination process. Our results provide further impetus to the use of desulfitobacteria as tools for bioremediation.  相似文献   

20.
The Mycobacterium tuberculosis genome contains about 4000 genes, of which approximately a third code for proteins of unknown function or are classified as conserved hypothetical proteins. We have determined the three-dimensional structure of one of these, the rv0216 gene product, which has been shown to be essential for M. tuberculosis growth in vivo. The structure exhibits the greatest similarity to bacterial and eukaryotic hydratases that catalyse the R-specific hydration of 2-enoyl coenzyme A. However, only part of the catalytic machinery is conserved in Rv0216 and it showed no activity for the substrate crotonyl-CoA. The structure of Rv0216 allows us to assign new functional annotations to a family of seven other M. tuberculosis proteins, a number if which are essential for bacterial survival during infection and growth.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号