首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Complementary and genomic DNA clones corresponding to the human serum amyloid P component (SAP) mRNA have been isolated and analyzed. The nucleotide sequences of the cDNA and the corresponding regions of the genomic SAP DNA reported here were identical, and revealed that after coding for a signal peptide of 19 amino acids and the first two amino acids of the mature SAP protein, there is one small intron of 115-base pairs (bp), followed by a nucleotide sequence coding for the remaining 202 amino acid residues. The SAP gene has an ATATAAA sequence 29-bp upstream from the cap site, but there is no CAAT box-like sequence. A possible polyadenylation signal sequence, ATTAAA, was found to be located 28-bp upstream from the polyadenylation site. A comparison of the genomic SAP DNA sequence with that of human C-reactive protein (CRP) revealed a striking overall homology which was not uniform: several highly conserved regions were bounded by non-homologous regions. This comparison provides further support for the hypothesis that SAP and CRP are products of a gene duplication event.  相似文献   

2.
Genomic DNA sequence for human C-reactive protein   总被引:12,自引:0,他引:12  
The gene for the prototype acute phase reactant, C-reactive protein, has been isolated from two lambda phage libraries containing inserted human DNA fragments using synthetic oligonucleotide probes. Nucleotide sequence analysis indicates that after coding for a signal peptide of 18 amino acids and the first two amino acids of the mature protein, there is an intron of 278 base pairs followed by the nucleotide sequence for the remaining 204 amino acids. The intron is unusual in that it contains on the positive strand a poly(A) stretch 16 nucleotides long and a poly(GT) region 30 nucleotides long which could adopt the Z-form of DNA. The nucleotide sequence reported here confirms the amino acid sequence of mature C-reactive protein as originally reported except that it codes for an additional 19 amino acids beginning at position 62. Thus DNA sequence analysis predicts that the mature protein consists of 206 amino acids rather than 187 as originally reported. The mRNA cap site is located 104 nucleotides from the start of the signal peptide and there is a 3' noncoding region 1.2 kilobase pairs in length. The gene has a typical promoter containing the sequences TATAAAT and CAAT 29 and 81 base pairs upstream, respectively, of the cap site.  相似文献   

3.
The gene responsible for cystic fibrosis, the most common severe autosomal recessive disorder, is located on the long arm of human chromosome 7, region q31-q32. The gene has recently been identified and shown to be approximately 250 kb in size. To understand the structure and to provide the basis for a systematic analysis of the disease-causing mutations in the gene, genomic DNA clones spanning different regions of the previously reported cDNA were isolated and used to determine the coding regions and sequences of intron/exon boundaries. A total of 22,708 bp of sequence, accounting for approximately 10% of the entire gene, was obtained. Alignment of the genomic DNA sequence with the cDNA sequence showed perfect colinearity between the two and a total of 27 exons, each flanked by consensus splice signals. A number of repetitive elements, including the Alu and Kpn families and simple repeats, such as (GT)17, (GATT)7, and (TA)14, were detected in close vicinity of some of the intron/exon boundaries. At least three of the simple repeats were found to be polymorphic in the population. Although an internal amino acid sequence homology could be detected between the two halves of the predicted polypeptide, especially in the regions of the two putative nucleotide-binding folds (NBF1 and NBF2), the lack of alignment of the nucleotide sequence as well as the different positions of the exon/intron boundaries does not seem to support the hypothesis of a recent gene duplication event. To facilitate detection of mutations by direct sequence analysis of genomic DNA, 28 sets of oligonucleotide primers were designed and tested for their ability to amplify individual exons and the immediately flanking sequences in the introns.  相似文献   

4.
5.
6.
7.
8.
9.
Structure and DNA sequence of the mouse MnSOD gene   总被引:4,自引:0,他引:4  
  相似文献   

10.
Structure and sequence of the human homeobox gene HOX7.   总被引:13,自引:0,他引:13  
A cosmid containing the human sequence HOX7, homologous to the murine Hox-7 gene, was isolated from a genomic library, and the positions of the coding sequences were determined by hybridization. DNA sequence analysis demonstrated two exons that code for a homeodomain-containing protein of 297 amino acids. The open reading frame is interrupted by a single intron of approximately 1.6 kb, the splice donor and acceptor sites of which conform to known consensus sequences. The human HOX7 coding sequence has a very high degree of identity with the murine Hox-7 cDNA. Within the homeobox, the two sequences share 94% identity at the DNA level, all substitutions being silent. This high level of sequence similarity is not confined to the homeodomain; overall the human and murine HOX7 gene products show 80% identity at the amino acid level. Both the 5' and 3' untranslated regions also show significant similarity to the murine gene, with 79 and 70% sequence identity, respectively. The sequence upstream of the coding sequence of exon 1 contains a GC-rich putative promoter region. There is no TATA box, but a CCAAT and numerous GC boxes are present. The region encompassing the promoter region, exon 1, and the 5' region of exon 2 have a higher than expected frequency of CpG dinucleotides; numerous sites for rare-cutter restriction enzymes are present, a characteristic of HTF islands.  相似文献   

11.
The mitochondrial genomes of cytoplasmic "petite" (rho-) mutants of Saccharomyces cerevisiae have been used to sequence the cytochrome b gene. A continuous sequence of 6.2 kilobase pairs has been obtained from 71.4 to 80.2 units of the wild type map. This region contains all the cytochrome b mutations previously assigned to the cob1 and cob2 genetic loci. Analysis of the DNA sequence has revealed that in the strain D273-10B, the cytochrome b gene is composed of three exons. The longest exon (b1) codes for the first 252 to 253 amino acids from the NH2-terminal end of the protein. The next two exons (b2 and b3) code for 16 to 18 and 115 to 116 amino acids, respectively. The complete cytochrome b polypeptide chain consists of 385 amino acids. Based on the amino acid composition, the yeast protein has a molecular weight of 44,000. The three exon regions of the cytochrome b gene are separated by two introns. The intron between b1 and b2 is 1414 nucleotides long and contains a reading frame that is continuous with the reading frame of exon b1. This intron sequence is potentially capable of coding for another protein of 384 amino acid residues. The second intron is 733 nucleotides long. This sequence is rich in A + T and includes a G + C cluster that may be involved in processing of the cytochrome b messenger. The organization of the cytochrome b region in S. cerevisiae D273-10B is somewhat less complex than has been reported for other yeast strains i which exon b1 appears to be further fragmented into three smaller exons.  相似文献   

12.
Gene structure and nucleotide sequence for rat cytochrome P-450c   总被引:2,自引:0,他引:2  
Two clones from rat genomic libraries that contain the entire gene for rat cytochrome P-450c have been isolated. lambda MC4, the first clone isolated from an EcoR1 library, contained a 14-kb insert. A single 5.5-kb EcoR1 fragment from lambda MC4, the EcoR1 A fragment, hybridized to a partial cDNA clone for the 3' end of the cytochrome P-450c mRNA. This fragment was sequenced using the dideoxynucleotide chain termination methodology with recombinant M13 bacteriophage templates. Comparison of this sequence with the complete cDNA sequence of cytochrome P-450MC [Yabusaki et al. (1984) Nucleic. Acids Res. 12, 2929-2938] revealed that the EcoR1 A fragment contained the entire cytochrome P-450c gene with the exception of a 90-bp leader sequence. The gene sequence is in perfect agreement with the cDNA sequence except for two bases in exon 2. A second genomic clone, lambda MC10, which was isolated from a HaeIII library, contains the missing leading sequence as well as 5' regulatory sequences. The entire gene is about 6.1 kb in length with seven exons separated by six introns, all of the intron/exon junctions being defined by GT/AG. Amino- and carboxy-terminal information are contained in exons 2 and 7, respectively. These exons contain the highly conserved DNA sequences that have been observed in other cytochrome P-450 species. Potential regulatory sequences have been located both 5' to the gene as well as within intron I. A comparison of the coding information for cytochrome P-450c with the sequence of murine cytochrome P3-450 and rat cytochrome P-450d revealed a 70% homology in both the DNA and amino acid sequence, suggesting a common ancestral gene. Genomic blot analyses of rat DNA indicated that the 3-methylcholanthrene-inducible family of cytochrome P-450 isozymes is more limited in number compared to the phenobarbital-inducible isozymes. Cross-hybridization studies with human DNA suggest a high degree of conservation between rat cytochrome P-450c and its human homolog although gross structural differences do exist between the two genes.  相似文献   

13.
14.
The gene for human C-reactive protein (CRP) is mapped within a 34-kilobase pair genomic DNA segment identified by chromosome walking through overlapping DNA fragments cloned into a lambda phage library. Within 16 kilobase pairs upstream and downstream of the locus for the authentic CRP gene, only one other sequence homologous to that for CRP could be found. Sequencing analysis indicates this sequence to be a pseudogene with 50-80% region-specific homology. Comparison of the authentic CRP gene cloned from genomic DNA libraries independently prepared from three patients indicates no difference in the 5' and 3' flanking region, promoter region, or coding sequence. Only a polymorphism in the length of the poly(GT) stretch located in the intron is observed. There appears to be only one gene locus and copy per haploid chromosome for the authentic CRP gene and its pseudogene.  相似文献   

15.
S S Fojo  S W Law  H B Brewer 《FEBS letters》1987,213(1):221-226
The complete nucleic acid sequence of human preproapolipoprotein (apo) C-II has been determined from 2 apoC-II clones isolated from 2 different human genomic DNA libraries. The cloned fragments were approx. 14 and 18 kb long, and sequence analysis established that the apoC-II gene consists of 3338 nucleotides containing 3 intervening sequences of 2391, 167, and 298 bases. The first intron is located within the 5'-untranslated region of apoC-II and contains 4 Alu type sequences. The second intron interrupts the codon specifying amino acid - 11 of the apoC-II signal peptide. The last intron, which contains a 38 bp sequence which is repeated 6 times, interrupts the codon specifying for amino acid +44 of the mature apolipoprotein.  相似文献   

16.
Plasmid clones containing cDNA coding for the B-chain of human Clq were isolated from a liver cDNA library. The longest cDNA insert isolated contained all the coding sequence for amino acid residues B1 to B226 plus a 3' non-translated region of 264 nucleotides that extended into the poly(A) tail, thus accounting for 950 nucleotides of the mRNA. The B-chain mRNA was estimated by Northern-blot analysis to be 1.46 kb (kilobases) long, which indicated that approx. 500 bases were not accounted for in the cDNA clone. A cosmid clone containing the C1q-B chain gene was isolated from a human genomic DNA library. The precise 5' limit of gene was not established, but from the data available it appears that the gene is approx. 2.6 kb long. The coding sequence for residues B1 to B226 in the gene is interrupted by one intron, of 1.1 kb, which is located within the codon coding for glycine at position B36. This glycine residue is located in the middle of the triple-helical regions found in C1q at exactly the position where there is an unusual structural feature, i.e. a bend in each of the helical regions brought about by the interruption of the Gly-Xaa-Yaa repeating triplet sequences in the A- and C-chains and the presence of an 'extra' triplet in the B-chain. Nucleotide sequencing of the 5' end of the gene indicates the presence of a predominantly hydrophobic stretch of 29 amino acids, immediately before residue B1, which could serve as a signal peptide.  相似文献   

17.
Structure of the mouse C-reactive protein gene   总被引:3,自引:0,他引:3  
A genomic DNA clone corresponding to the mouse C-reactive protein (CRP) has been isolated and characterized. The mouse CRP gene is 1.9-kilobase pairs in length and contains a single intron of 213-base pairs which interrupts the codon for the 2nd amino acid residue of the mature CRP protein. We compared nucleotide sequences of the mouse and human CRP genes and discussed structures of possible regulatory sequences. With this characterization, the isolation and sequence analyses of a set of mouse and human pentraxin genes, i.e. CRP and serum amyloid P component genes is not complete.  相似文献   

18.
19.
The nucleotide sequences coding for murine complement component C3 have been determined from a cloned genomic DNA fragment and several overlapping cloned complementary DNA fragments. The amino acid sequence of the protein was deduced. The mature beta and alpha subunits contain 642 and 993 amino acids respectively. Including a 24 amino acid signal peptide and four arginines in the beta-alpha transition region, which are probably not contained in the mature protein, the unglycosylated single chain precursor protein preproC3 would have a molecular mass of 186 484 Da and consist of 1663 amino acid residues. The C3 messenger RNA would be composed of a 56 +/- 2 nucleotide long 5' non-translated region, 4992 nucleotides of coding sequence, and a 3' non-translated region of 39 nucleotides, excluding the poly A tail. The beta chain contains only three cysteine residues, the alpha chain 24, ten of which are clustered in the carboxy terminal stretch of 175 amino acids. Two potential carbohydrate attachment sites are predicted for the alpha chain, none for the beta chain. From a comparison with human C3 cDNA sequence (of which over 80% has been determined) an extensive overall sequence homology was observed. Human and murine preproC3 would be of very similar length and share several noteworthy properties: the same order of the subunits in the precursor, the same basic residue multiplet in the beta-alpha transition region, and a glutamine residue in the thioester region. The equivalent position of the known factor I cleavage sites in human C3 alpha could be located in the murine C3 alpha chain and the size and sequence of the resulting peptide were deduced. A comparison of the amino acid sequences of murine C3 and human alpha 2-macroglobulin is given. Several areas of strong sequence homology are observed, and we conclude that the two genes must have evolved from a common ancestor.  相似文献   

20.
豌豆外源凝集素基因的克隆及序列分析   总被引:11,自引:0,他引:11  
从豌豆幼叶分离基因组DNA,设计特异引物,用聚合酶链式反应方法扩增出豌豆外源凝集素基因并克隆到E.coli质粒pBluescriptSK(+)的EcoRV位点。进一步亚克隆至pUC19。序列分析表明,克隆到的片段大小为832bp,包含了豌豆外源凝集素基因完整的编码序列。该基因无内含子,同报道的已知序列相比,其核苷酸序列及推测的氨基酸序列的同源率分别为99.6%和98.9%。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号