首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
3.
M Levine  G M Rubin  R Tjian 《Cell》1984,38(3):667-673
Several human DNA sequences were isolated by virtue of homology to a highly conserved region that has been identified in a number of homeotic genes in Drosophila. Structural analysis of the human DNAs indicate that two separate and distinct regions sharing a high degree of homology with the homeo box sequences of Drosophila are separated by only 5 kb in the human genome. Sequence determination of these regions reveals that both human DNA sequences contain a region capable of coding 61 amino acids, which shares greater than 90% homology with the peptide sequences specified by the homeo box domain of Drosophila homeotic genes, Antennapedia, fushi tarazu, and Ultrabithorax. By contrast, the human DNA sequences lying outside of the 190 nucleotide homeo box region share virtually no sequence homology, either with the flanking sequences of the other human clones or with flanking regions of the known Drosophila homeotic genes.  相似文献   

4.
The large subunit of eukaryotic ribosomes contains acidic phosphoproteins which are related to L7/L12 from Escherichia coli. In the brine shrimp Artemia these proteins are designated eL12 and eL12'. We have isolated cDNA clones for these proteins from a cDNA bank that was constructed by the use of size-fractionated poly(A)-rich RNA (8-10S fraction) from Artemia and a synthetic oligonucleotide as primer. Clones containing DNA sequences coding for eL12 and eL12 were characterized by hybrid-selected translation and DNA sequencing. The proteins eL12 and eL12' share an identical peptide of 22 amino acids at their carboxy termini whereas the remaining part of the protein shows little sequence homology. The nucleotide sequences show a different codon use for the amino acids in the common carboxy terminus, thereby excluding a common exon coding for this part of both proteins. Despite the differences in amino acid sequence in the major part of eL12 and eL12' the proteins have a considerable degree of homology on the basis of the distribution of hydrophobic and hydrophilic amino acids over the polypeptide chains, in agreement with a related folding and function of both proteins. Relative levels of mRNA coding for eL12, eL12' and elongation factor 1 alpha were determined during the development of Artemia from a dormant cyst to a nauplius. The data show a coordinate expression of the genes for EF-1 alpha and both ribosomal proteins, excluding a differential expression of the genes for these related ribosomal proteins during embryogenesis. Analysis of the gene copy number for eL12 and eL12' indicates the presence of a few genes for each protein.  相似文献   

5.
This paper describes the structure of a 70-kb porcine gene for nuclear factor I, including its promoter region, comprising a total of 11 exons. Different mRNAs that we have isolated as cDNAs from both porcine liver and human HeLa cells presumably are generated from this gene by differential splicing events. One cDNA species from porcine liver that lacks exon 9 carries coding information for a protein of 439 amino acids. The in vitro translated protein displays all the properties of an NFI-like protein with high affinity toward the sequence element TGG(N)6GCCAA, as shown by gel shift analysis, and no or little affinity toward CCAAT box containing sequences. Cotranslation experiments with full-length and truncated variants of the protein demonstrate that it binds as a dimer to its cognate DNA recognition sequence. Its DNA-binding domain which is retained in all cDNA clones was mapped by deletion analysis to the 250 N-terminal amino acids of the protein. No structural homologies are observed between this protein and other known DNA-binding proteins; instead, the protein contains a novel alpha-helical sequence motif consisting of several lysine residues spaced at intervals of seven amino acids which we have termed the "lysine helix". The C-terminal portion of the protein derived from full-length cDNAs encodes a short amino acid sequence which is identical with the heptapeptide repeat CT7 observed in the C-terminal domain of the largest subunits of yeast and mouse RNA polymerase II. This region is removed by differential splicing in some of the NFI/CTF cDNAs and thus may be of functional significance.  相似文献   

6.
7.
8.
The nucleotide sequence of the recA gene of Thiobacillus ferrooxidans has been determined. No SOS box characteristic of LexA-regulated promoters could be identified in the 196-bp region upstream from the coding region. The cloned T. ferrooxidans recA gene was expressed in Escherichia coli from both the lambda pR and lac promoters. It was not expressed from the 2.2-kb of T. ferrooxidans DNA preceding the gene. The T. ferrooxidans recA gene specifies a protein of 346 amino acids that has 66% and 69% homology to the RecA proteins of E. coli and Pseudomonas aeruginosa, respectively. Most amino acids that have been identified as being of functional importance in the E. coli RecA protein are conserved in the T. ferrooxidans RecA protein. Although some amino acids that have been associated with proteolytic activity have been substituted, the cloned protein has retained protease activity towards the lambda and E. coli LexA repressors.  相似文献   

9.
10.
We have isolated and determined the nucleotide sequence and genomic organization of the genes encoding Ly-3.1 and Ly-3.2. These genes span approximately 14 kb on chromosome 6 and consist of six exons and five introns. The exons correlate roughly with the putative functional domains, namely, a leader exon, a variable and joining region-like exon, a hinge region-like exon, a transmembrane exon, and two intracytoplasmic exons. There is no intervening sequence between V- and J-like gene segments, indicating that rearrangement is not necessary for the expression of the Ly-3 gene. In the 5'-flanking region there is no "TATA" box nor "CAAT" box; however, three "GC" boxes are located upstream of the ATG initiator codon. There are short stretches of sequence homologous to 5'-flanking sequences of the Ly-2 gene. In addition, the sequences CTCTGTGGCA at -748 exhibits homology to the enhancer core sequence of the human Ig H chain and TCR genes. Comparison of the nucleotide sequence corresponding to the extracellular portion between Ly-3.1 and Ly-3.2 revealed a single base difference which results in an amino acid substitution. Therefore it is likely that this amino acid difference is responsible for the previously defined Ly-3 allotypes.  相似文献   

11.
To study the regulation of hair differentiation, a murine genomic clone, gUHS-SER-M16, was isolated that contained two members of the family of serine-rich ultra high sulfur protein genes. One of the genes, gUHS-SER-1, encodes 230 amino acids with 40% cysteine and 23% serine; the other gene, gUHS-SER-2, encodes 223 amino acids with 41% cysteine and 21% serine. The similarity between the two genes is 73%, and both have several 10-amino acid repeats within their coding regions. In the prospective promoter region, there are several regions of similarity including a "TATA" box, with neither gene having a "CAT" box. At the 3' untranslated region, there is no similarity, and thus a fragment from this region was used as a hybridization probe for RNA dot-blots and for in situ hybridizations. The RNA dot-blot showed elevated levels of mRNA during the active phases of hair growth and low levels during the resting phases. In situ hybridizations show that mRNA for the ultra high sulfur protein gene is found during the active phases of the hair cycle not only in the medulla and the inner root sheath of the forming hair but also in upper layers of the epidermis of skin.  相似文献   

12.
B F Lang 《The EMBO journal》1984,3(9):2129-2136
The DNA sequence of the second intron in the mitochondrial gene for subunit 1 of cytochrome oxidase (cox1), and the 3'' part of the structural gene have been determined in Schizosaccharomyces pombe. Comparing the presumptive amino acid sequence of the 3'' regions of the cox1 genes in fungi reveals similarly large evolutionary distances between Aspergillus nidulans, Saccharomyces cerevisiae and S. pombe. The comparison of exon sequences also reveals a stretch of only low homology and of general size variation among the fungal and mammalian genes, close to the 3'' ends of the cox1 genes. The second intron in the cox1 gene of S. pombe contains an open reading frame, which is contiguous with the upstream exon and displays all characteristics common to class I introns. Three findings suggest a recent horizontal gene transfer of this intron from an Aspergillus type fungus to S. pombe. (i) The intron is inserted at exactly the same position of the cox1 gene, where an intron is also found in A. nidulans. (ii) Both introns contain the highest amino acid homology between the intronic unassigned reading frames of all fungi identified so far (70% identity over a stretch of 253 amino acids). However, in the most homologous region, a GC-rich sequence is inserted in the A. nidulans intron, flanked by two direct repeats of 5 bp. The 37-bp insert plus 5 bp of direct repeat amounts to an extra 42 bp in the A. nidulans intron. (iii) TGA codons are the preferred tryptophan codons compared with TGG in all mitochondrial protein coding sequences of fungi and mammalia.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

13.
The mitochondrial genomes of cytoplasmic "petite" (rho-) mutants of Saccharomyces cerevisiae have been used to sequence the cytochrome b gene. A continuous sequence of 6.2 kilobase pairs has been obtained from 71.4 to 80.2 units of the wild type map. This region contains all the cytochrome b mutations previously assigned to the cob1 and cob2 genetic loci. Analysis of the DNA sequence has revealed that in the strain D273-10B, the cytochrome b gene is composed of three exons. The longest exon (b1) codes for the first 252 to 253 amino acids from the NH2-terminal end of the protein. The next two exons (b2 and b3) code for 16 to 18 and 115 to 116 amino acids, respectively. The complete cytochrome b polypeptide chain consists of 385 amino acids. Based on the amino acid composition, the yeast protein has a molecular weight of 44,000. The three exon regions of the cytochrome b gene are separated by two introns. The intron between b1 and b2 is 1414 nucleotides long and contains a reading frame that is continuous with the reading frame of exon b1. This intron sequence is potentially capable of coding for another protein of 384 amino acid residues. The second intron is 733 nucleotides long. This sequence is rich in A + T and includes a G + C cluster that may be involved in processing of the cytochrome b messenger. The organization of the cytochrome b region in S. cerevisiae D273-10B is somewhat less complex than has been reported for other yeast strains i which exon b1 appears to be further fragmented into three smaller exons.  相似文献   

14.
The HPR1 gene has been cloned by complementation of the hyperrecombination phenotype of hpr1-1 strains by using a color assay system. HPR1 is a gene that is in single copy on chromosome IV of Saccharomyces cerevisiae, closely linked to ARO1, and it codes for a putative protein of 752 amino acids (molecular mass, 88 kilodaltons). Computer searches revealed homology (48.8% conserved homology; 24.8% identity) with the S. cerevisiae TOP1 gene in an alpha-helical stretch of 129 amino acids near the carboxy-terminal region of both proteins. The ethyl methanesulfonate-induced hpr1-1 mutation is a single-base change that produces a stop codon at amino acid 559 coding for a protein that lacks the carboxy-terminal TOP1 homologous region. Haploid strains carrying deletions of the HPR1 gene show a slightly reduced mitotic growth rate and extremely high rates of intrachromosomal excision recombination (frequency, 10 to 15%) but have a undetectable effect on rDNA recombination. Double-null mutants hpr1 top1 grow very poorly. We conclude that Hpr1 is a novel eucaryotic protein, mutation of which causes an increase in mitotic intrachromosomal excision recombination, and that it may be functionally related to an activity of the topoisomerase I protein.  相似文献   

15.
A seven-generation family with 30 members affected by highly variable autosomal dominant zonular pulverulent cataracts has been previously described. We have localized the cataracts to a 19-cM interval on chromosome 2q33-q35 including the gamma-crystallin gene cluster. Maximum lod scores are 4.56 (theta=0.02) with D2S157, 3.66 (theta=0.12) with D2S72, and 3.57 (theta=0.052) with CRYG. Sequencing and allele-specific oligonucleotide analysis of the pseudo gammaE-crystallin promoter region from individuals in the pedigree suggest that activation of the gammaE-crystallin pseudo gene is unlikely to cause the cataracts in the family. In addition, base changes in the TATA box but not the Sp1-binding site have been found in unaffected controls and can be excluded as a sole cause of cataracts. In order to investigate the underlying genetic mechanism of cataracts in this family further, exons of the highly expressed gammaC- and gammaD-crystallin genes have been sequenced. The gammaD-crystallin gene shows no abnormalities, but a 5-bp duplication within exon 2 of the gammaC-crystallin gene has been found in one allele of each affected family member and is absent from both unaffected family members and unaffected controls. This mutation disrupts the reading frame of the gammaC-crystallin coding sequence and is predicted to result in the synthesis of an unstable gammaC-crystallin with 38 amino acids of the first "Greek key" motif followed by 52 random amino acids. This finding suggests that the appropriate association of mutant betagamma-crystallins into oligomers is not necessary to cause cataracts and may give us new insights into the genetic mechanism of cataract formation.  相似文献   

16.
A Kudo  F Melchers 《The EMBO journal》1987,6(8):2267-2272
The murine gene lambda 5 is selectively expressed in pre-B lymphocytes. Of the three exons encoding lambda 5, exons II and III show strong homologies to immunoglobulin lambda light (L) chain gene segments, i.e. to J lambda intron and exon, and C lambda exon sequences respectively. We have now found, 4.6 kb upstream of lambda 5, another gene composed of two exons which is selectively expressed in pre-B cell lines as a 0.85 kb mRNA potentially coding for a protein of 142 amino acids including a 19 amino acid-long signal peptide. The 5' sequences of this gene show homologies to sequences encoding the variable regions of kappa and lambda L chains and of heavy (H) chains. The deduced amino acid sequence contains the consensus cysteine residues as well as other consensus amino acids at positions which characterize immunoglobulin (Ig) domains. We call the second gene VpreB. The 3' end of VpreB encoding the 26 carboxyl terminal amino acids shows no homology to any known nucleotide sequence. The putative protein encoded by VpreB is a potential candidate for association with the putative protein encoded by lambda 5, and thereby a candidate for association with H chains in pre-B cells. Southern blot analysis of DNA from liver (germ line) and 70Z/3 pre-B cell lines reveals two genes which hybridize to the VpreB gene. We call VpreB1 the gene which is found 5' of lambda 5. The other gene, called VpreB2, which has not yet been located within the genome, shows 97% nucleotide sequence homology to VpreB1 in an area of 1 kb which covers the coding region of the gene.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

17.
The nucleotide sequence of the Escherichia coli dnaC gene and the primary structure of the dnaC protein were determined. The NH2-terminal amino acid sequence of the dnaC protein matched that predicted from the nucleotide sequence of the 735-base pair coding region. The dnaC gene lacks characteristic promoter structures; neither the "Pribnow box" nor the "-35 sequence" was detected within 222 base pairs upstream from the initiator ATG codon. There is, however, a typical Shine-Dalgarno sequence 7-10 base pairs before the ATG codon. An upstream open reading frame, separated by just 2 base pairs from the coding region of dnaC, encodes the COOH-terminal half of the dnaT product (protein i; Masai, H., Bond, M. W., and Arai, K. (1986) Proc. Natl. Acad. Sci. U. S. A. 83, 1256-1260). The dnaC protein contains 245 amino acids with a calculated molecular weight of 27,894 consistent with the observed value (29,000). Similar to dnaG and dnaT, dnaC uses several minor codons; the significance of these minor codons to the low level expression of the protein product in E. coli cells remains to be determined. The in vitro site-directed mutagenesis method was employed to determine the functional region involved in interaction with dnaB protein. The first cysteine residue located in the NH2-terminal region of the dnaC protein (Cys69) was shown to be important for this activity. Overall sequence homology between dnaC protein and lambda P protein, functionally analogous to the dnaC protein in the lambda phage DNA replication, is not extensive. There are, however, several short stretches of homologous regions including the NH2-terminal eight amino acids and the Cys78 region of dnaC protein.  相似文献   

18.
Summary The genes coding for rRNAs from mustard chloroplasts were mapped within the inverted repeat regions of intact ctDNA and on ctDNA fragments cloned in pBR322. R-loop analysis and restriction endonuclease mapping show that the genes for 16S rRNA map at distances of 17 kb from the junctions of the repeat regions with the large unique region. The genes for 23S rRNA are located at distances of 2.8 kb from the junctions with the small unique region. Genes for 4.5S and 5S rRNA are located in close proximity to the 23S rRNA genes towards the small unique region. DNA sequencing of portions of the 5 terminal third from the mustard 16S rRNA gene shows 96–99% homology with the corresponding regions of the maize, tobacco and spinach chloroplast genes. Sequencing of the region proximal to the 16S rRNA gene reveals the presence of a tRNAVal gene in nearly the same position and with identical sequence as in maize, tobacco and spinach. Somewhat less but still strong homology is also observed for the tDNA Val/16S rDNA intercistronic regions and for the regions upstream of the tRNAVal gene. However, due to many small and also a few larger deletions and insertions in the leader region, common reading frames coding for homologous peptides larger than 44 amino acids can not be detected; it is therefore unlikely that this region contains a protein coding gene.  相似文献   

19.
20.
Structure of a gene for rat calmodulin   总被引:6,自引:0,他引:6  
The structural organization of the entire rat calmodulin gene was determined by cloning and sequencing overlapping genomic and cDNA clones from rat genomic and brain cDNA libraries. The intron/exon organization was determined by direct comparison of these sequences. Rat calmodulin gene is 9000 bases long and consisted of six exons interrupted by introns of variable sizes. The first intron separates the initiation codon (ATG) from the coding region of the protein. Three out of four intron/exon junctions in the coding region reside in the middle of calcium binding subdomains and do not correlate with the quarterly divided intramolecular homology of the protein. Their positions exactly coincide with those of the corrected version of chicken calmodulin gene. The rat calmodulin gene harbors a stretch of sequences homologous to a rat middle repetitive "identifier sequence" in the middle of the third intron. Analysis of the immediate 5' upstream region detected a TATA box (TATATATAT) and three C-G boxes (CCGCCC) but not a CAT box (CCAAT). A conserved sequence (GCGCCGCGYCYYGGGGGC) was found at -125 for rat and at -204 for chicken calmodulin genes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号