首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
Full-length coding sequences of two novel human cadherin cDNAs were obtained by sequence analysis of several EST clones and 5' and 3' rapid amplification of cDNA ends (RACE) products. Exons for a third cDNA sequence were identified in a public-domain human genomic sequence, and the coding sequence was completed by 3' RACE. One of the sequences (CDH7L1, HGMW-approved gene symbol CDH7) is so similar to chicken cadherin-7 gene that we consider it to be the human orthologue. In contrast, the published partial sequence of human cadherin-7 is identical to our second cadherin sequence (CDH7L2), for which we propose CDH19 as the new name. The third sequence (CDH7L3, HGMW-approved gene symbol CDH20) is almost identical to the mouse "cadherin-7" cDNA. According to phylogenetic analysis, this mouse cadherin-7 and its here presented human homologue are most likely the orthologues of Xenopus F-cadherin. These novel human genes, CDH7, CDH19, and CDH20, are localized on chromosome 18q22-q23, distal of both the gene CDH2 (18q11) encoding N-cadherin and the locus of the six desmosomal cadherin genes (18q12). Based on genetic linkage maps, this genomic region is close to the region to which Paget's disease was linked. Interestingly, the expression patterns of these three closely related cadherins are strikingly different.  相似文献   

3.
We have determined the nucleotide sequence of 4508 base pairs of human genomic DNA which contain the human serine esterase gene from cytotoxic T lymphocytes (SECT) (equivalent to the 1-3E cDNA clone) and include 879 bp of 5' flanking DNA and 393 bp of 3' flanking DNA. The gene consists of five exons of 88, 148, 136, 261, and 257 nucleotides separated by four introns of 1043, 455, 205, and 643 nucleotides. The location of introns with respect to protein coding sequences in the SECT gene is identical to that of the human cathepsin G and murine granzyme B genes. Comparison of SECT gene exonic sequences to murine granzyme B-F cDNA sequences indicates similarities of 75 and 72% for granzymes B and C and 61, 59, and 61% for granzymes D, E, and F, respectively. The 5' flanking sequence of the SECT gene showed similarity only to the 5' flanking sequence of the murine granzyme B gene, indicating that these genes are homologous. Comparison of the SECT gene sequence to the human cathepsin G sequence indicated no similarity in the 5' flanking DNA although the exonic sequences show 64% sequence similarity overall and 45% sequence similarity in the respective 3' untranslated regions. These similarities suggest that the SECT and cathepsin G genes are members of the same family of serine protease genes. Evidence from high and low stringency Southern transfer analysis of human genomic DNA indicates the presence of another gene of at least 85% sequence similarity to the SECT gene.  相似文献   

4.
5.
Little is known about the primary amino acid structure of human cartilage link protein (CRTL1). We screened a human genomic library with a cDNA encoding the 3' untranslated region and the adjoining B1 domain of chicken link protein. One clone was isolated and characterized. A 3.5-kb EcoRI-KpnI fragment from this genomic clone that contains the human B1 exon was used to map the gene to chromosome 5q13----q14.1. The same fragment was used to screen a cDNA library prepared from mRNA of Caco-2, a human colon tumor cell line. Two overlapping clones were isolated and shown to encode all of CRTL1. The deduced amino acid sequence is 354 residues long. The amino acid sequence shows a striking degree of identity to the porcine (96%), rat (96%), and chicken (85%) link protein sequences. Furthermore, there is greater than 86% homology between the 3' untranslated region of the genes encoding human and porcine link proteins. These results indicate that there has been strong evolutionary pressure against changes in the coding and 3' untranslated regions of the gene encoding cartilage link protein.  相似文献   

6.
7.
Extragenic suppressors of +1 frameshift mutations in proline codons map in genes encoding two major proline tRNA isoacceptors. We have shown previously that one isoacceptor encoded by the SUF2 gene (chromosome 3) contains no intervening sequence. SUF2 suppressor mutations result from the base insertion of a G within a 3'-GGA-5' anticodon, allowing the tRNA to read a 4-base code word. In this communication we describe suppressor mutations in genes encoding a second proline tRNA isoacceptor (wild-type anticodon 3'-GGU-5') that result in a novel mechanism for translation of a 4-base genetic code word. The genes that encode this isoacceptor include SUF7 (chromosome 13), SUF8 (chromosome 8), trn1 (chromosome 1), and at least two additional unmapped genes, all of which contain an intervening sequence. We show that suppressor mutations in the SUF7 and SUF8 genes result in G-to-U base substitutions at position 39 that disrupted the normal G . C base pairing in the last base pair of the anticodon stem adjacent to the anticodon loop. These anticodon stem mutations might alter the size of the anticodon loop and permit the use of a 3'-GGGU-5' sequence within the loop to read 4-base proline codons. Uncertainty regarding the exact structure of the mature suppressor tRNAs results from the possibility that anticodon stem mutations might affect sites of intervening sequence removal. The possible role of the intervening sequence in the generation of mature suppressor tRNA is discussed. Besides an analysis of suppressor tRNA genes, we have extended previous observations of the apparent relationship between tRNA genes and repetitive delta sequences found as solo elements or in association with the transposable element TY1. Hybridization studies and a computer analysis of the DNA sequence surrounding the SUF7 gene revealed two incomplete, inverted delta sequences that form a stem and loop structure located 165 base pairs from the 5' end of the tRNA gene. In addition, sequences beginning 164 base pairs from the 5' end of the trn1 gene also exhibit partial homology to delta. These observations provide further evidence for a nonrandom association between tRNA genes and delta sequences.  相似文献   

8.
9.
10.
11.
12.
Eggshell protein genes of Schistosoma mansoni that encode a 14 kDa protein have been shown to be highly conserved and expressed in a sex-, tissue-, and temporal-specific manner. To initiate studies on the eggshell protein genes of S. haematobium, a cDNA probe, pSMf 61-46, representing a S. mansoni eggshell protein mRNA was used to screen a S. haematobium genomic library. Of the seven independent recombinant clones isolated, two (lambda SH 2-1 and lambda SH 6-1) were analyzed and compared to those of S. mansoni. lambda SH 2-1 and lambda SH 6-1 each contain a different genomic copy of the gene encoding a 19.8 and 17.6 kDa protein, respectively. This is due to an additional 78 bp present in the coding region of lambda SH 2-1 relative to lambda SH 6-1. The rest of the coding sequences are identical, and the 5' and 3' untranslated regions are nearly identical. The deduced amino acid sequences of S. haematobium eggshell proteins are very rich in glycine (47 and 50%) when compared to 43.5% glycine in the protein encoded by S. mansoni. Long stretches of glycines, as many as 15 in a row, occur in the S. haematobium sequence. DNA comparison of the eggshell protein genes of the two schistosome species yielded an overall homology of 83.1%. The homology is much higher in the 5' and 3' untranslated regions than in the protein-coding regions. Genomic clones of both species contained second open reading frames, which appeared to be kept open as a consequence of the amino acid composition of the other. There are no introns in S. haematobium or S. mansoni eggshell protein genes, and the genomic Southern data indicated a similar arrangement of these genes in the genome of both species. Primer extension experiments and dideoxynucleotide sequencing of the RNA determined the mRNA cap site sequence as ATCAT and ATCAC in lambda SH 2-1 and lambda SH 6-1, respectively. Northern blot analysis determined the size of the mRNA to be about 1.0 kp. Expression of the RNA from these genes appears to be regulated in a manner similar to the corresponding genes in S. mansoni. mRNA is found only in mature females and first appears at 70 days after infection of hamsters. DNA sequence comparisons of the 5' flanking regions of S. haematobium and S. mansoni eggshell protein genes to each other and to those of silkmoth and Drosophila revealed several short sequence elements that are shared.(ABSTRACT TRUNCATED AT 400 WORDS)  相似文献   

13.
The gene encoding the human cellular retinol-binding protein (CRBP) has been isolated from genomic libraries and its structure determined. Only one copy of the gene is present in the human genome. We have located the CRBP gene to segment 3p11-3qter on human chromosome 3 using hybridizations to mouse-human, rat-human and hamster-human cell hybrids. The gene harbors four exons encoding 24, 59, 33, and 16 amino acid residues respectively. The second intervening sequence alone occupies 19 kb of the 21 kb of the CRBP gene. The nucleotide sequence of the gene has been determined with the exception of the second intron. The positions of the introns agree with those in the rat CRBPII, the rat liver fatty-acid-binding protein and the mouse adipose P2 protein genes encoding molecules belonging to the same protein family as CRBP. In contrast to the other sequenced members of this family the promoter of the CRBP gene resembles those found in the 'housekeeping' genes in that it is (G + C)-rich, contains multiple copies of the CCGCCC sequence and lacks TATA box. A 9-bp homology containing the core sequence of the simian virus 40 enhancer repeat was found in the 5' upstream region. A genomic Southern blot probed with CRBP cDNA revealed hybridizing bands in restricted chicken and frog DNA.  相似文献   

14.
We describe the isolation and characterization of the gene encoding the mouse high affinity Fc receptor Fc gamma RI. Using a mouse cDNA Fc gamma RI probe four unique overlapping genomic clones were isolated and were found to encode the entire 9 kb of the mouse Fc gamma RI gene. Sequence analysis of the gene showed that six exons account for the entire Fc gamma RI cDNA sequences including the 5'- and 3'-untranslated sequences. The first and second exons encode the signal peptide; exons 3, 4, and 5 encode the extracellular Ig binding domains; and exon 6 encodes the transmembrane domain, the cytoplasmic region, and the entire 3'-untranslated sequence. This exon pattern is similar to Fc gamma RIII and Fc epsilon RI but differs from the related Fc gamma RII gene which contains 10 exons and encodes the b1 and b2 Fc gamma RII. Southern blot analysis had shown that the mouse Fc gamma RI gene is a single copy gene with no RFLP in inbred strains of mice, but analysis of an intersubspecies backcross of mice showed that unlike other mouse FcR genes which are on mouse chromosome 1 the locus encoding Fc gamma RI, termed Fcg1, is located on chromosome 3. Interestingly, the Fcg1 locus is located near the end of a region with known linkage homology to human chromosome 1. Analysis of human x rodent somatic cell hybrid cell lines indicates that the human FCG1 locus encoding the human Fc gamma RI maps to chromosome I and therefore possibly linked to other FcR genes on this chromosome. These results suggest that the linkage relationships among these genes in the human genome are not preserved in the mouse.  相似文献   

15.
16.
The nucleotide sequence of the protective antigen (PA) gene from Bacillus anthracis and the 5' and 3' flanking sequences were determined. PA is one of three proteins comprising anthrax toxin; and its nucleotide sequence is the first to be reported from B. anthracis. The open reading frame (ORF) is 2319 bp long, of which 2205 bp encode the 735 amino acids of the secreted protein. This region is preceded by 29 codons, which appear to encode a signal peptide having characteristics in common with those of other secreted proteins. A consensus TATAAT sequence was located at the putative -10 promoter site. A Shine-Dalgarno site similar to that found in genes of other Bacillus sp. was located 7 bp upstream from the ATG start codon. The codon usage for the PA gene reflected its high A + T (69%) base composition and differed from those of genes for bacterial proteins from most other sequences examined. The TAA translation stop codon was followed by an inverted repeat forming a potential termination signal. In addition, a 192-codon ORF of unknown significance, theoretically encoding a 21.6-kDa protein, preceded the 5' end of the PA gene.  相似文献   

17.
We have isolated and structurally characterized genomic DNA and cDNA sequences encoding ribulose-1,5-bisphosphate carboxylase/oxygenase (Rbu-P2 carboxylase) activase from barley (Hordeum vulgare L.). Three Rbu-P2 carboxylase activase (Rca) polypeptides are encoded in the barley genome by two closely linked, tandemly oriented nuclear genes (RcaA and RcaB); cDNAs encoding each of the three Rbu-P2 carboxylase activase polypeptides were isolated from cDNA libraries of barley leaf mRNA. RcaA produces two mRNAs, which encode polypeptides of 42 and 46 kDa, by an alternative splicing mechanism identical to that previously reported for spinach and Arabidopsis Rca genes (Werneke, J.M., Chatfield, J.M., and Ogren, W. L. (1989) Plant Cell 1, 815-825). RcaB is transcribed to produce a single mRNA, which encodes a mature peptide of 42 kDa. Genomic Southern blots indicate that RcaA and RcaB represent the entire Rbu-P2 carboxylase activase gene family in barley. The genes share 80% nucleotide sequence identity, and the 42-kDa polypeptides encoded by RcaA and RcaB share 87% amino acid sequence identity. Coding regions of the two barley Rca genes are separated by 1 kilobase pair of flanking DNA. DNA sequence motifs similar to those thought to control light-regulated gene expression in other nuclear-encoded plastid polypeptide genes are found at the 5' end of both barley Rca genes. Probes specific to three mRNAs were used to determine the relative contribution each species makes to the total Rca mRNA pool.  相似文献   

18.
19.
Isolation and expression of cDNA encoding the murine homologues of CD1.   总被引:5,自引:0,他引:5  
The cDNA encoding the murine CD1.1 and CD1.2 gene products were isolated and their complete nucleotide sequence was determined. The nucleotide sequence and genomic organization of these molecules were similar to human CD1. The sequences in the alpha 1- alpha 3 domains were almost identical to previously reported genomic clones from a different strain, indicating limited polymorphism among these molecules. The predicted amino acid sequence in the transmembrane region and in the cytoplasmic tail was identical for CD1.1 and CD1.2. The two cDNA were also homologous in the 5' untranslated region but diverged in the 3' untranslated region. In contrast to human CD1, which is expressed at high levels in thymus, the expression of CD1 message in murine thymus was not detected in either thymus leukemia Ag positive or negative strains. Cell expressing murine CD1.1 were generated after transfer of the CD1.1 cDNA into murine cell lines. Immunoprecipitation with a rat anti-mouse CD1.1 mAb showed that the transfected CD1 was expressed on the cell surface as a beta 2-microglobulin-linked heterodimer. These results demonstrate that the murine and human CD1 genes, although encoding homologous transmembrane glycoproteins, are expressed in distinct tissues and may serve different functions.  相似文献   

20.
Ten genomic DNA clones encoding the human leukocyte common Ag (LCA, CD45) gene were isolated by screening human genomic DNA libraries with LCA cDNA probes. One genomic DNA clone contains the promoter region and the first two exons, as determined by primer extension analyses and S1 nuclease protection studies as well as nucleotide sequence determination. The first exon does not encode a peptide, while the second exon contains the initiation ATG codon and encodes the signal peptide. The other nine genomic DNA clones, which are separated from the first genomic clone by an unknown distance, are connected and span a total of 73 kb. The nine connected genomic clones encode a total of 31 exons. The 33 exons encoded by these 10 genomic clones account for the entire cDNA sequences including the 5' and 3' untranslated sequences. Exon 3 and exons 7 through 15 encode the extracellular domain sequences that are common to all LCA isoforms. Differential usage of exons 4, 5, and 6, generates at least five distinct LCA isoforms. Exon 16 encodes the transmembrane peptide. The cytoplasmic region of the leukocyte common antigens is composed of two homologous domains. Exons 17 through 24 encode the first domain, and exons 25 through 32 encode the second domain. The comparison of these exons indicated that the homologous domains were generated by duplication of several exons. The most 3' exon (exon 33) encodes the carboxy terminus of the LCA molecules and includes the entire 3' untranslated sequence.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号