首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 18 毫秒
1.
B. L. Hibner  W. D. Burke    T. H. Eickbush 《Genetics》1991,128(3):595-606
The multigene families that encode the chorion (eggshell) of the silk moth, Bombyx mori, are closely linked on one chromosome. We report here the isolation and characterization of two segments, totaling 102 kb of genomic DNA, containing the genes expressed during the early period of choriogenesis. Most of these early genes can be divided into two multigene families, ErA and ErB, organized into five divergently transcribed ErA/ErB gene pairs. Nucleotide sequence identity in the major coding regions of the ErA genes was 96%, while nucleotide sequence identity for the ErB major coding regions was only 63%. Selection pressure on the encoded proteins cannot explain this difference in the level of sequence conservation between the ErA and ErB gene families, since when only fourfold redundant codon positions are considered, the divergence within the ErA genes is 8%, while the divergence within the ErB genes (corrected for multiple substitutions at the same site) is 110%. The high sequence identity of the ErA major exons can be explained by sequence exchange events similar to gene conversion localized to the major exon of the ErA genes. These gene conversions are correlated with the presence of clustered copies of the nucleotide sequence GGXGGX, encoding paired glycine residues. This sequence has previously been correlated with gradients of gene conversion that extend throughout the coding and noncoding regions of the High-cysteine (Hc) chorion genes of B. mori. We suggest that the difference in the extent of the conversion tracts in these gene families reflects a tendency for these recombination events to become localized over time to the protein encoding regions of the major exons.  相似文献   

2.
3.
Analyses of 55 individual and 31 concatenated protein data sets encoded in Reclinomonas americana and Marchantia polymorpha mitochondrial genomes revealed that current methods for constructing phylogenetic trees are insufficiently sensitive (or artifact-insensitive) to ascertain the sister of mitochondria among the current sample of eight alpha-proteobacterial genomes using mitochondrially-encoded proteins. However, Rhodospirillum rubrum came as close to mitochondria as any alpha-proteobacterium investigated. This prompted a search for methods to directly compare eukaryotic genomes to their prokaryotic counterparts to investigate the origin of the mitochondrion and its host from the standpoint of nuclear genes. We examined pairwise amino acid sequence identity in comparisons of 6,214 nuclear protein-coding genes from Saccharomyces cerevisiae to 177,117 proteins encoded in sequenced genomes from 45 eubacteria and 15 archaebacteria. The results reveal that approximately 75% of yeast genes having homologues among the present prokaryotic sample share greater amino acid sequence identity to eubacterial than to archaebacterial homologues. At high stringency comparisons, only the eubacterial component of the yeast genome is detectable. Our findings indicate that at the levels of overall amino acid sequence identity and gene content, yeast shares a sister-group relationship with eubacteria, not with archaebacteria, in contrast to the current phylogenetic paradigm based on ribosomal RNA. Among eubacteria and archaebacteria, proteobacterial and methanogen genomes, respectively, shared more similarity with the yeast genome than other prokaryotic genomes surveyed.  相似文献   

4.
We describe a vertebrate hyaluronan and proteoglycan binding link protein gene family (HAPLN), consisting of four members including cartilage link protein. The encoded proteins share 45-52% overall amino acid identity. In contrast to the average sequence identity between family members, the sequence conservation between vertebrate species was very high. Human and mouse link proteins share 81-96% amino acid sequence identity. Two of the four link protein genes (HAPLN2 and HAPLN4) were restricted in expression to the brain/central nervous system, while one of the four genes (HAPLN3) was widely expressed. Genomic structures revealed that all four HAPLN genes were similar in exon-intron organization and were also similar in genomic organization to the 5' exons for the CSPG core protein genes. Strikingly, all four HAPLN genes were located immediately adjacent to the four CSPG core protein genes creating four pairs of CSPG-HAPLN genes within the mammalian genome. Furthermore, the two brain-specific HAPLN genes (HAPLN2 and HAPLN4) were physically linked to the brain-specific CSPG genes encoding brevican and neurocan, respectively. The tight physical association of the HAPLN and CSPG genes supports a hypothesis that the first HAPLN gene arose as a partial gene duplication event from an ancestral CSPG gene. There is some degree of coordinated expression of each gene pair. Collectively, the four HAPLN genes are expressed by most tissue types, reflecting the fundamental importance of the hyaluronan-dependent extracellular matrix to tissue architecture and function in vertebrate species. Comparison of the genomic structures for the HAPLN, CSPG genes and other members of the link module superfamily provide strong support for a common evolutionary origin from an ancestral gene containing one link module encoding exon.  相似文献   

5.
The nucleotide sequence of the 1794-bp fragment containing the crtD gene from Rhodobacter sphaeroides 2.4.1 encoding for methoxyneurosporene dehydrogenase has been determined. A 63% sequence identity was found when compared with the nucleotide sequence of the crtD gene from Rhodobacter capsulatus. A putative regulatory palindromic motif present in the crtD gene from R. capsulatus also exists in this gene from R. sphaeroides. The translated open reading frame of the crtD gene of R. sphaeroides has identified a polypeptide of 495 amino acids which shares a 56% sequence identity with the same CrtD protein of R. capsulatus. The N- and C-termini of these CrtD proteins present a high degree of similarity with the N- and C-termini of other carotenoid dehydrogenases including those encoded by crtI genes. This is in good agreement with the previously hypothesized homology between CrtI and CrtD proteins.  相似文献   

6.
The complete nucleotide sequence of the Neurospora crassa trp-3 gene-encoding tryptophan synthetase has been determined; we present an analysis of its structure. A comparison of the deduced amino acid sequence of the trp-3 polypeptide with its homologs in Saccharomyces cerevisiae (encoded by the TRP5 gene) and Escherichia coli (encoded by the trpA and trpB genes) shows that the A and B domains (amino acid segments homologous to the trpA and trpB polypeptides, respectively) of the N. crassa and yeast polypeptides are in the same order (NH2-A-B-COOH). This arrangement is the reverse of the gene order characteristic of all prokaryotes that have been examined. N. crassa tryptophan synthetase has strong homology to the yeast TRP5 polypeptide (A domains have 54% identity; B domains have 75% identity), and somewhat weaker homology to the E. coli trpA and trpB polypeptides (A domains have 31% identity; B domains have 50% identity). The two domains of the N. crassa polypeptide are linked by a connector of 54-amino acid residues that has less than 25% identity to the 45-residue connector of the yeast polypeptide, although secondary structure analysis predicts both connectors would be alpha-helical. In contrast to the yeast TRP5 gene, which has no introns, the trp-3 coding region is interrupted by two introns 77 and 71 nucleotides in length. Both introns are located near the 5'-end of the gene and therefore not near the segment encoding the connector.  相似文献   

7.
During the synthesis of fatty acids and their utilization in plastids, fatty acyl moieties are linked to acyl carrier protein (ACP). In contrast to previously cloned organ-specific ACP isoforms, we have now isolated a cDNA clone for a potentially constitutive ACP isoform from a spinach root library. Identity between the amino acid sequence encoded by this cDNA and N-terminal sequence data for ACP-II protein from spinach leaf indicates that the root cDNA encodes ACP-II. The deduced amino acid sequence for ACP-II shows 62% identity with spinach leaf ACP-I. Southern analysis suggests that multiple ACP genes or pseudogenes occur in the spinach genome. High-stringency northern blot analysis and RNase protection studies confirm that, within the region encoding the mature ACP-II, the cloned ACP sequence is expressed in leaves and seeds as well as in roots. Quantitative RNase protection data indicate that the ratio of ACP-I and ACP-II mRNA sequences in leaf is similar to the ratio of the two proteins.  相似文献   

8.
We have compared the recF genes from Escherichia coli K-12, Salmonella typhimurium, Pseudomonas putida, and Bacillus subtilis at the DNA and amino acid sequence levels. To do this we determined the complete nucleotide sequence of the recF gene from Salmonella typhimurium and we completed the nucleotide sequence of recF gene from Pseudomonas putida begun by Fujita et al. (1). We found that the RecF proteins encoded by these two genes contain respectively 92% and 38% amino acid identity with the E. coli RecF protein. Additionally, we have found that the S. typhimurium and P. putida recF genes will complement an E. coli recF mutant, but the recF gene from Bacillus subtilis [showing about 20% identity with E. coli (2)] will not. Amino acid sequence alignment of the four proteins identified four highly conserved regions. Two of these regions are part of a putative phosphate binding loop. In one region (position 36), we changed the lysine codon (which is essential for ATPase, GTPase and kinase activity in other proteins having this phosphate binding loop) to an arginine codon. We then tested this mutation (recF4101) on a multicopy plasmid for its ability to complement a recF chromosomal mutation and on the E. coli chromosome for its effect on sensitivity to UV irradiation. The strain with recF4101 on its chromosome is as sensitive as a null recF mutant strain. The strain with the plasmid-borne mutant allele is however more UV resistant than the null mutant strain. We conclude that lysine-36 and possibly a phosphate binding loop is essential for full recF activity. Lastly we made two chimeric recF genes by exchanging the amino terminal 48 amino acids of the S. typhimurium and E. coli recF genes. Both chimeras could complement E. coli chromosomal recF mutations.  相似文献   

9.
The ubiquitin-activating enzyme (E1) gene family in Arabidopsis thaliana   总被引:6,自引:1,他引:5  
Conjugation of multiple ubiquitins serves as a committed step in the degradation of a variety of intracellular eukaryotic proteins by the 26S proteasome. Conjugates are formed via a three-enzyme cascade; the initial step requires ubiquitin-activating enzyme (E1), which couples ubiquitin activation to ATP hydrolysis. Previously, we showed that many higher plants contain multiple E1 proteins and described several E1 genes from wheat. To facilitate understanding of the roles of the different plant E1s, we characterized the E1 gene and protein family from Arabidopsis thaliana . Arabidopsis E1s are encoded by two genes ( AtUBA1 and AtUBA2 ) that synthesize approximately 123-kDa proteins with 81% amino acid sequence identity to each other and 44–75% sequence identity with confirmed E1s from other organisms. Like other E1 proteins, AtUBA1 and 2 contain a cysteine residue in the putative active site for forming the ubiquitin thiol-ester intermediate. Enzymatic analysis of the corresponding proteins expressed in Escherichia coli demonstrated that both proteins activate ubiquitin in an ATP-dependent reaction and transfer the activated ubiquitin to a variety of Arabidopsis E2s with near equal specificity. Expression studies by quantitative RT-PCR and histochemistry with transgenic plants containing AtUBA promoter-β-glucuronidase-coding region fusions showed that the AtUBA1 and 2 genes are co-expressed in most, if not all, Arabidopsis tissues and cells. Collectively, the data indicate that E1 proteins, and presumably the rest of the ubiquitin pathway, are present throughout Arabidopsis . They also show that the AtUBA1 and 2 genes are not differentially expressed nor do they encode E1s with dramatically distinct enzymatic properties.  相似文献   

10.
A 65-kb region of DNA from Streptomyces viridochromogenes Tü57, containing genes encoding proteins involved in the biosynthesis of avilamycins, was isolated. The DNA sequence of a 6.4-kb fragment from this region revealed four open reading frames (ORF1 to ORF4), three of which are fully contained within the sequenced fragment. The deduced amino acid sequence of AviM, encoded by ORF2, shows 37% identity to a 6-methylsalicylic acid synthase from Penicillium patulum. Cultures of S. lividans TK24 and S. coelicolor CH999 containing plasmids with ORF2 on a 5.5-kb PstI fragment were able to produce orsellinic acid, an unreduced version of 6-methylsalicylic acid. The amino acid sequence encoded by ORF3 (AviD) is 62% identical to that of StrD, a dTDP-glucose synthase from S. griseus. The deduced amino acid sequence of AviE, encoded by ORF4, shows 55% identity to a dTDP-glucose dehydratase (StrE) from S. griseus. Gene insertional inactivation experiments of aviE abolished avilamycin production, indicating the involvement of aviE in the biosynthesis of avilamycins.  相似文献   

11.
Mouse glandular kallikreins are encoded by a family of closely linked genes which are located on chromosome 7 at a site corresponding to the genetically defined Tam-1, Prt-4, and Prt-5 loci. We have characterized 24 kallikrein genes by genomic cloning and restriction mapping of 310 kilobase pairs of BALB/c mouse DNA. Most of these genes are highly homologous, have the same exon/intron organization, and are linked in clusters of up to 11 genes. Partial sequence analysis of the kallikrein genes has facilitated identification of those members of the family for which protein sequence data exist and assignment of those which are pseudogenes or encode proteins of unknown function. We find that a maximum of 14 mouse kallikrein genes have the potential to encode functional proteins.  相似文献   

12.
Genes encoding fowlpox virus (FWPV) structural proteins have been identified mainly by sequence homology with those from vaccinia virus (VACV), but little is known about the encoded proteins. Production of monoclonal antibodies (MAbs) against Poxine and HP1-440 (Munich) clone FP9 allowed the identification of three immunodominant FWPV proteins: the 39-kDa core protein (encoded by FPV168, homologous to VACV A4L), a 30- and 35-kDa protein doublet, and an abundant 63-kDa protein. The 30- and 35-kDa proteins are nonglycosylated, antigenically related proteins present in the intracellular mature virus membrane and localizing closely with the viral factories. N-terminal sequencing identified the 35-kDa protein as encoded by FPV140 (the FWPV homolog of VACV H3L). The 63-kDa protein forms covalently linked dimers and oligomers. It remained mainly insoluble upon detergent treatment of purified virus but did not localize closely with the viral factory. N-terminal sequencing was unsuccessful, suggesting N-terminal blocking. CNBr digestion generated a peptide encoded by FPV191, predicted to encode one of two FWPV A-type inclusion (ATI) proteins. The characteristics of the 63-kDa protein were inconsistent with published observations on cowpox or VACV ATI proteins (it appears to be essential). The 63-kDa protein, however, shares characteristics with both VACV p4c virus occlusion and 14-kDa fusion proteins. Gene assignment at the poxvirus ATI locus (between VACV A24R and A28L) is complicated by sequence redundancies and variations, often due to deletions and multiple frameshift mutations. The identity of FPV191 in relation to genes at this locus is discussed.  相似文献   

13.
14.
C. Yue  M. Osier  C. P. Novotny    R. C. Ullrich 《Genetics》1997,145(2):253-260
This paper concerns the manner in which combinatorial mating proteins of the fungus, Schizophyllum commune, recognize one another to form complexes that regulate target gene expression. In Schizophyllum, tightly linked Y and Z mating-type genes do not promote development in the combinations present in haploid strains (i.e., self combinations). When the Y and Z genes from two different mating types are brought together by the fusion of two haploid cells, the Y and Z proteins from different mating types recognize one another as nonself, form a complex and activate development. Several Y and Z alleles are present in the population and all nonself combinations of Y and Z alleles are equally functional. We have made chimeric genes among Y1, Y3, Y4 and Y5 and examined their mating-type specificities by transformation and mating tests. These studies show that the specificity of Y protein recognized by Z protein is encoded within a short region of N-terminal amino acids. The critical region is not precisely the same in each Y protein and in each Y-Z protein interaction. For Y3 protein compared with Y4 protein, the critical residues are in an N-terminal region of 56 amino acids (residues 17-72), with 40% identity and 65% similarity. Two-hybrid studies show that: the first 144 amino acids of Y4 protein are sufficient to bind Z3 and Z5 proteins, but not Z4 protein, and proteins deleted of the Y4 specificity region do not bind Z3, Z4 or Z5 protein. Thus the specificity determinant of the Y protein is essential for protein-protein recognition, Y-Z protein binding and mating activity.  相似文献   

15.
The nucleotide sequence of a Salmonella typhimurium DNA segment of 549 base pairs which encompasses the operator-promoter of the pts operon, the entirety of the ptsH gene, encoding HPr of the phosphotransferase system (PTS), the first 29 nucleotides of the ptsI gene, encoding Enzyme I of the PTS, and the intercistronic region between the ptsH and ptsI genes was determined and compared with the corresponding sequence from Escherichia coli (De Reuse et al., 1985). The two sequences showed 91% overall identity, with some regions showing sequence conservation and others exhibiting relative divergence. Two open reading frames were identified in both species: one encoded HPr on the 'sense' strand (255 nucleotides; 12 nucleotide differences, no amino acid differences); the other, on the anti-sense strand, consisted of 291 nucleotides (13 nucleotide differences, 13 amino acid differences). While HPr bears a net negative charge, the putative protein encoded by the open reading frame on the anti-sense strand is strongly basic. Computer analyses of HPr proteins from five different bacterial genera revealed four regions which show strong sequence identity and therefore are presumed to be critical for maintenance of biological activity. Two of these regions were specific to Gram-positive bacteria. Proposed functions for each of these regions are discussed. Relative evolutionary distances between the HPr proteins were also computed.  相似文献   

16.
We have isolated and structurally characterized genomic DNA and cDNA sequences encoding ribulose-1,5-bisphosphate carboxylase/oxygenase (Rbu-P2 carboxylase) activase from barley (Hordeum vulgare L.). Three Rbu-P2 carboxylase activase (Rca) polypeptides are encoded in the barley genome by two closely linked, tandemly oriented nuclear genes (RcaA and RcaB); cDNAs encoding each of the three Rbu-P2 carboxylase activase polypeptides were isolated from cDNA libraries of barley leaf mRNA. RcaA produces two mRNAs, which encode polypeptides of 42 and 46 kDa, by an alternative splicing mechanism identical to that previously reported for spinach and Arabidopsis Rca genes (Werneke, J.M., Chatfield, J.M., and Ogren, W. L. (1989) Plant Cell 1, 815-825). RcaB is transcribed to produce a single mRNA, which encodes a mature peptide of 42 kDa. Genomic Southern blots indicate that RcaA and RcaB represent the entire Rbu-P2 carboxylase activase gene family in barley. The genes share 80% nucleotide sequence identity, and the 42-kDa polypeptides encoded by RcaA and RcaB share 87% amino acid sequence identity. Coding regions of the two barley Rca genes are separated by 1 kilobase pair of flanking DNA. DNA sequence motifs similar to those thought to control light-regulated gene expression in other nuclear-encoded plastid polypeptide genes are found at the 5' end of both barley Rca genes. Probes specific to three mRNAs were used to determine the relative contribution each species makes to the total Rca mRNA pool.  相似文献   

17.
PnPMA1, a gene encoding a putative P-type plasma membrane H(+)-ATPase, has been isolated by differential screening of a Phytophthora nicotianae germinated cyst cDNA library. PnPMA1 is differentially expressed during pathogen asexual development with a more than 10-fold increase in expression in germinated cysts, the stage at which plant infection is initiated, compared to vegetative or sporulating hyphae or motile zoospores. PnPMA1 proteins are encoded by two closely linked genes that have no introns and encode identical proteins having 1,068 amino acid residues and a molecular mass of 116.3kDa. PnPMA1 shows moderate identity (30-50%) to plant and fungal plasma membrane H(+)-ATPases and weak identity to other P-type cation-transporting ATPases. PnPMA1 contains all the catalytic domains characteristic of H(+)-ATPases but also has a distinct domain of approximately 155 amino acids that forms a putative cytoplasmic loop between transmembrane domains 8 and 9, a feature that is not present in PMA1 proteins from other organisms. Polyclonal antibodies raised against the 155 residue domain were shown by immunogold labelling to react with a protein in the plasma membrane of P. nicotianae germinated cysts but not with the plasma membrane of motile zoospores. Genetic complementation experiments demonstrated that the P. nicotianae PnPMA1 is functional in yeast, Saccharomyces cerevisiae.  相似文献   

18.
19.
Opium poppy latex contains a group of laticifer-specific, low-molecular-weight polypeptides called major latex proteins (MLPs). Here we describe a new member of the MLP gene family (gMLP 22) which shares 79.6% nucleotide and 84.6% amino acid sequence identity with previously isolated clones. DNA gel blot analysis indicates that the MLPs are encoded by at least eight genes which are divided into two subfamilies. The expression pattern for each subfamily, as seen in RNA gel blots, appears to be identical and laticifer-specific.  相似文献   

20.
To find potential targets of novel antimicrobial agents, we identified essential genes of Streptococcus pneumoniae using comparative genomics and allelic replacement mutagenesis. We compared the genome of S. pneumoniae R6 with those of Bacillus subtilis, Enterococcus faecalis, Escherichia coli, and Staphylococcus aureus, and selected 693 candidate target genes with > 40% amino acid sequence identity to the corresponding genes in at least two of the other species. The 693 genes were disrupted and 133 were found to be essential for growth. Of these, 32 encoded proteins of unknown function, and we were able to identify orthologues of 22 of these genes by genomic comparisons. The experimental method used in this study is easy to perform, rapid and efficient for identifying essential genes of bacterial pathogens.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号