首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
We have isolated cDNAs for carcinoembryonic antigen (CEA) and for a normal cross-reacting antigen (NCA) and report here their nucleotide and derived amino acid sequences. Our data show that both the CEA and NCA polypeptides are organized into extracellular domains, some with cysteine-linked loops, that share extensive sequence homology (approximately 78% overall) with each other and appear similar to immunoglobulin superfamily members. A major difference between the two apoproteins is the presence of a single loop-domain in NCA compared to three tandemly repeated loop-domains in CEA. Sequence comparisons between the extracellular domains of CEA and NCA show that the N-terminal and adjacent loop domains of each apoprotein have high homology (85-90%) to each other, while comparison of loop-domain regions reveals a possible nonrandom distribution of base changes and altered amino acids near certain cysteine residues that are inferred to be involved in forming disulfide loops. Both apoproteins show high identity in their hydrophobic C-termini that are reminiscent of the type of transmembrane tails seen in proteins that potentiate signal transduction. These findings, coupled with distinct expression profiles of CEA and NCA mRNAs, suggest that these apoproteins may function as unique cell-surface molecules mediating cell-specific interactions in normal and neoplastic cells.  相似文献   

2.
The amino acid sequence of the bovine mitochondrial nicotinamide nucleotide transhydrogenase was recently deduced from isolated cDNAs and reported [Yamaguchi, M., Hatefi, Y., Trach, K., and Hoch, J.A. (1988) J. Biol. Chem. 263, 2761-2767]. The cDNAs lacked the N-terminal coding region, however, and the 8 N-terminal residues were determined by protein sequencing. In the present study, the nucleotide sequence of the 5' upstream region was determined by dideoxynucleotide sequencing of the transhydrogenase messenger RNA, and amino acid sequences of the N-terminal region and the signal peptide of the enzyme were deduced from the nucleotide sequence. The N-terminal sequence of the enzyme as deduced from the mRNA sequence is the same as that determined by protein sequencing, with one difference. Protein sequencing showed Ser as the N-terminal residue. The mRNA sequence indicated that Ser is the second N-terminal residue, and the first is Cys. That preparations of the enzyme are mixtures of two polypeptides, one polypeptide being one residue shorter at the N terminus than the other, has been pointed out in the above reference. The signal peptide consists of 43 residues, is rich in basic (4 Lys, 2 Arg) and hydroxylated (4 Thr, 3 Ser) amino acids, and lacks acidic residues.  相似文献   

3.
4.
5.
Acid-soluble collagens were prepared from connective tissues in the abalone Haliotis discus foot and adductor muscles with limited proteolysis using pepsin. Collagen preparation solubilized with 1% pepsin contained two types of alpha-chains which were different in their N-terminal amino acid sequences. Accordingly, two types of full-length cDNAs coding for collagen proalpha-chains were isolated from the foot muscle of the same animal and these proteins were named Hdcols (Haliotis discus collagens) 1alpha and 2alpha. The two N-terminal amino acid sequences of the abalone pepsin-solubilized collagen preparation corresponded to either of the two sequences deduced from the cDNA clones. In addition, several tryptic peptides prepared from the pepsin-solubilized collagen and fractionated by HPLC showed N-terminal amino acid sequences identical to those deduced from the two cDNA clones. Hdcols 1alpha and 2alpha consisted of 1378 and 1439 amino acids, respectively, showing the primary structure typical to those of fibril-forming collagens. The N-terminal propeptides of the two collagen proalpha-chains contained cysteine-rich globular domains. It is of note that Hdcol 1alpha completely lacked a short Gly-X-Y triplet repeat sequence in its propeptide. An unusual structure such as this has never before been reported for any fibril-forming collagen. The main triple-helical domains for both chains consisted of 1014 amino acids, where a supposed glycine residue in the triplet at the 598th position from the N-terminus was replaced by alanine in Hdcol 1alpha and by serine in Hdcol 2alpha. Both proalpha-chains of abalone collagens contained six cysteine residues in the carboxyl-terminal propeptide, lacking two cysteine residues usually found in vertebrate collagens. Northern blot analysis demonstrated that the mRNA levels of Hdcols 1alpha and 2alpha in various tissues including muscles were similar to each other.  相似文献   

6.
A cDNA containing the entire coding region for a member of carcinoembryonic antigen (CEA) gene family has been cloned from cDNA library of HLC-1 cells by immunochemical screening with the antibody specific to nonspecific crossreacting antigen (NCA). The cDNA encodes a precursor form of a polypeptide consisting of a 34-residue signal sequence, a 108-residue N-terminal (N-) domain, a 178-residue domain (NCA-I domain) and a 24-residue domain rich in hydrophobic amino acids (M-domain). Each domain has a distinct but homologous amino acid sequence to that of the corresponding domain of CEA. Unlike the coding sequences, the 3'-untranslated sequences differ markedly in the NCA and CEA cDNAs facilitating the preparation of probes that will discriminate between nucleotide sequences for CEA and NCA.  相似文献   

7.
Amino acid sequences of human collagen alpha 1(VI) and alpha 2(VI) chains were completed by cDNA sequencing and Edman degradation demonstrating that the mature polypeptides contain 1009 and 998 amino acid residues respectively. In addition, they contain small signal peptide sequences. Both chains show 31% identity in the N-terminal (approximately 235 residues) and C-terminal (approximately 430 residues) globular domains which are connected by a triple helical segment (335-336 residues). Internal alignment of the globular sequences indicates a repetitive 200-residue structure (15-23% identity) occurring three times (N1, C1, C2) in each chain. These repeating subdomains are connected to each other and to the triple helix by short (15-30 residues) cysteine-rich segments. The globular domains possess several N-glycosylation sites but no cell-binding RGD sequences, which are exclusively found in the triple helical segment. Sequencing of alpha 2(VI) cDNA clones revealed two variant chains with a distinct C2 subdomain and 3' non-coding region. The repetitive segments C1, C2 and, to a lesser extent, N1 show significant identity (15-18%) to the collagen-binding A domains of von Willebrand factor (vWF) and they are also similar to some integrin receptors, complement components and a cartilage matrix protein. Since the globular domains of collagen VI come into close contact with triple helical segments during the formation of tissue microfibrils it suggests that the globular domains bind to collagenous structures in a manner similar to the binding of vWF to collagen I.  相似文献   

8.
Fifty-eight tryptic and Staphylococcus aureus V8 protease generated peptides from bovine dopamine beta-hydroxylase were isolated by reverse-phase high pressure liquid chromatography and sequenced. These peptide sequences were compared with the deduced amino acid sequences of bovine and human dopamine beta-hydroxylase obtained from the cloned cDNAs. Bovine peptide sequences had five differences with the sequence derived from the bovine cDNA, and four of the changes could be accounted for by a single base change in the DNA. N-terminal sequence analysis of the bovine enzyme indicated that it contained two N termini, one of which is 3 amino acids longer than the other and begins with the sequence Ser-Ala-Pro. The amino acid sequences deduced from the bovine and human cDNAs are 19 and 25 amino acids longer, respectively, and these additional amino acids represent leader peptide sequences. Two bovine peptide sequences contained glycosylation sites and gave positive tests for carbohydrate residues, and two others contained the consensus sequence for a glycosylation site but were negative in the carbohydrate test. The bovine enzyme contains 6 Trp, as compared with 7 in the bovine cDNA and 8 in the human cDNA. The protein and bovine cDNA contain 24 Tyr each, as compared with 26 in the human cDNA. These numbers indicate that the true epsilon 1% 280 = 8.95, and, therefore, that it is 28% lower than the previously determined value. The data also identify 5 His-containing regions that may be involved in Cu2+ coordination at the active site.  相似文献   

9.
10.
X L Li  H Chen    L G Ljungdahl 《Applied microbiology》1997,63(12):4721-4728
Two cDNAs encoding two cellulases, CelA and CelC, were isolated from a cDNA library of the polycentric anaerobic fungus Orpinomyces sp. strain PC-2 constructed in Escherichia coli. Nucleotide sequencing revealed that the celA cDNA (1,558 bp) and celC cDNA (1,628 bp) had open reading frames encoding polypeptides of 459 (CelA) and 449 (CelC) amino acids, respectively. The two cDNAs were 76.9 and 67.7% identical at the nucleotide and amino acid levels, respectively. Analysis of the deduced amino acid sequences showed that starting from the N termini, both CelA and CelC had signal peptides, which were followed by noncatalytic repeated peptide domains (NCRPD) containing two repeated sequences of 33 to 40 amino acid residues functioning as docking domains. The NCRPDs and the catalytic domains were separated by linker sequences. The NCRPDs were homologous to those found in several hydrolases of anaerobic fungi, whereas the catalytic domains were homologous to the catalytic domains of fungal cellobiohydrolases and bacterial endoglucanases. The linker sequence of CelA contained predominantly glutamine and proline residues, while that of CelC contained mainly threonine residues. CelA and CelC did not have a typical cellulose binding domain (CBD). CelA and CelC expressed in E. coli rapidly decreased the viscosity of carboxymethyl cellulose (CMC), indicating that there was endoglucanase activity. In addition, they produced cellobiose from CMC, acid-swollen cellulose, and cellotetraose, suggesting that they had cellobiohydrolase activity. The optimal activity conditions with CMC as the substrate were pH 4.3 to 6.8 and 50 degrees C for CelA and pH 4.6 to 7.0 and 40 degrees C for CelC. Despite the lack of a CBD, CelC displayed a high affinity for microcrystalline cellulose, whereas CelA did not.  相似文献   

11.
The amino acid sequences of human carcinoembryonic antigen deduced from the cDNA sequences have been analysed. This antigen contains seven extracellular domains (previously recognized three highly repetitive domains are further divided into A and B subdomains each) which are strikingly homologous to each other and to immunoglobulin variable regions, poly-Ig receptor and Thy 1.1. The N-terminal domain lacks immunoglobulin-like fold but the other six domains have, suggesting that the CEA belongs to immunoglobulin superfamily.  相似文献   

12.
13.
14.
S J Kim  K N Uhm  Y K Kang  O J Yoo 《DNA sequence》1991,1(3):181-187
The complete nucleotide sequences of cDNAs encoding bovine and feline preprogastrins have been cloned from the antral mucosa mRNA. The gastrin mRNA of each animal encodes a preprogastrin of 104 amino acids consisting of a signal peptide, a prosegment of 37 amino acids, and a gastrin 34 sequence, followed by a glycine (the amide donor). The cleavage following a pair of lysine residues yields gastrin 17. We found that pairs of arginine residues flanking gastrin 34, the typical processing site sequence of all other preprogastrins and many peptide hormones, were arginines in the bovine preprogastrin, but the first basic amino acid pair had changed to Arg-Trp (57-58 residues) instead of Arg-Arg in the feline preprogastrin. Comparison of these amino acid and nucleotide sequences with published mammalian sequences showed extensive homology in the coding (63 to 73% amino acid identity) and in the untranslated regions (67 to 89% identity). Prosequence, the most variable region, shows greater amino acid difference between bovine and human preprogastrin (54% identity), and between bovine and rat preprogastrin (54% identity) than between other species (62 to 82% identity).  相似文献   

15.
Human collagen alpha 3(VI) chain mRNA (approximately 10 kb) was cloned and shown by sequence analysis to encode a 25 residue signal peptide, a large N-terminal globule (1804 residues), a central triple helical segment (336 residues) and a C-terminal globule (803 residues). Some of the sequence was confirmed by Edman degradation of peptides. The N-terminal globular segment consists of nine consecutive 200 residue repeats (N1 to N9) showing internal homology and also significant identity (17-25%) to the A domains of von Willebrand Factor and similar domains present in some other proteins. Deletions were found in the N3 and N9 domains of several cDNA clones suggesting variation of these structures by alternative splicing. The C-terminal globule starts immediately after the triple helical segment with two domains C1 (184 residues) and C2 (248 residues) being similar to the N domains. They are followed by a proline rich, repetitive segment C3 of 122 residues, with similarity to some salivary proteins, and domain C4 (89 residues), which is similar to the type III repeats present in fibronectin and tenascin. The most C-terminal domain C5 (70 residues) shows 40-50% identity to a variety of serine protease inhibitors of the Kunitz type. The whole sequence contains 29 cysteines which are mainly clustered in short segments connecting domains N1, C1, C2 and the triple helix, and in the inhibitor domain. Five putative Arg-Gly-Asp cell-binding sequences are exclusively localized in the triple helical segment.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

16.
Synthetic peptides from the N-domains of CEACAMs activate neutrophils.   总被引:4,自引:0,他引:4  
Four members of the carcinoembryonic antigen family, CEACAM1, CEACAM8, CEACAM6 and CEACAM3, recognized by CD66a, CD66b, CD66c and CD66d monoclonal antibodies (mAb), respectively, are expressed on human neutrophils. CD66a, CD66b, CD66c and CD66d mAb binding to neutrophils triggers an activation signal that regulates the adhesive activity of CD11/CD18, resulting in an increase in neutrophil adhesion to human umbilical vein endothelial cells. Molecular modeling of CEACAM1 using IgG and CD4 as models has been performed, and three peptides from the N-terminal domain were found to increase neutrophil adhesion to human umbilical vein endothelial cell monolayers. The peptides were 14 amino acids in length and were predicted to be present at loops and turns between beta-sheets. To better understand the amino acid sequences critical for this biological activity, in the present study we examined the other neutrophil CEACAMs and the highly homologous CEACAM, CEA. Molecular modeling of the N-terminal domains of human CEACAM8, -6, -3 and CEA was performed. Twenty peptides, each 14 amino acids in length, that were homologous to the previously reported peptides from the N-domains of CEACAM1, were synthesized and tested for their ability to alter neutrophil adhesion. Only one new peptide, from the N-domain of CEA, was found to increase neutrophil adhesion, and this peptide differed from the corresponding CEACAM1 peptide by only a single conservative amino acid substitution. Importantly, minor amino acid differences between active and inactive homologous peptides suggest regions of these peptides that are critical for biological activity. The data suggest that the regions SMPF of peptide CD66a-1, QLFG of peptide CD66a-2 and NRQIV of peptide CD66a-3 are critical for the activities of these peptides, and for the native CEACAMs.  相似文献   

17.
We have recently reported a characterization of cDNA clones that encode an apparently novel human collagen that undergoes alternative splicing. These cDNAs covered one-third of the corresponding 2.5-2.8-kilobase mRNAs. We have now determined the complete primary structure of the protein encoded by several overlapping cDNAs isolated from a human endothelial cell library. Since the deduced translation product of the cDNAs is different in structure from all other collagen types, we have given the collagen chain encoded by the cDNAs the designation alpha 1 (XIII). The deduced polypeptide consists of three collagenous domains and four noncollagenous domains, two of them separating the collagenous domains and two located at the N-terminal and C-terminal ends of the polypeptide. Cysteine residues are found in three of the noncollagenous domains and also in the extreme N-terminal collagenous domain. Surprisingly, comparison of the nucleotide sequences encoded by the overlapping cDNA clones demonstrates that there are several alpha 1 (XIII) collagen mRNAs in HT-1080 human fibrosarcoma cells and human endothelial cells which differ in coding potential. Nuclease S1 mapping experiments suggest that these different mRNAs arise through alternative splicing of the precursor RNA at five locations within the coding region. This property makes type XIII collagen unique among all the collagen types studied so far. Its polypeptide length, therefore, may vary between 614 and 526 amino acids, depending on what internal splicing has taken place.  相似文献   

18.
Vipera lebetina venom contains specific coagulant Factor X activator (VLFXA) that cleaves the Arg52-Ile53 bond in the heavy chain of human factor X. VLFXA is a glycoprotein that is composed of a heavy chain (HC) and two light chains (LC) linked by disulfide bonds. The complete amino acid sequences of the three chains of the factor X activator from V. lebetina snake venom are deduced from the nucleotide sequences of cDNAs encoding these chains. The full-length cDNA (2347 bp) sequence of the HC encodes an open reading frame (ORF) of 612 amino acids that includes signal peptide, propeptide and mature metalloproteinase with disintegrin-like and cysteine-rich domains. The light chain LC1 contains 123 and LC2 135 amino acid residues. Both light chains belong to the class of C-type lectin-like proteins. The N-termini of VLFXA chains and inner sequences of peptide fragments detected by liquid chromatography-electrospray ionization tandem mass spectrometry (LC MS/MS) from protein sequence are 100% identical to the sequences deduced from the cDNA. The molecular masses of tryptic fragments of VLFXA chains analyzed by matrix-assisted laser desorption ionization time of flight mass spectrometry (MALDI-TOF MS) also confirm the protein sequences deduced from the cDNAs. These are the first cloned factor X activator heavy and light chains. We demonstrate that the heavy and light chains are synthesized from different genes.  相似文献   

19.
To study the usefulness of low-molecular-weight glutenin subunits (LMW-GS) of Agropyron elongatum (Host) Nevski to wheat (Triticum aestivum L.) quality improvement, we characterized LMW-GS genes of A. elongatum. Nine LMW-GS genes of A. elongatum, which were named AeL1 to AeL9, were cloned by genomic PCR. After sequencing, we obtained complete open reading frames from AeL2 to AeL8 and partial genes of AeL1 and AeL9. All nine sequences are homoeologous to those of wheat and related grasses. Comparison of the deduced amino acid sequences with those of published LMW-GS suggests that the basic structures of all the subunits are very similar. However, except for AeL4 and AeL5, which contain the identical N-terminal sequence with LMW-m, other LMW-GS sequences separated from A. elongatum cannot be classified according to previous criteria for the three types: LMW-m (methionine), LMW-s (serine), and LMW-i (isoleucine), and then 12 groups. In addition, there are some characters in the LMW-GS sequences of A. elongatum: AeL2, AeL3, and AeL6 involve a Cys residue in the signal peptide respectively, which is absent in most of LMW-GS; AeL3, AeL6, AeL8, and AeL9 start their first Cys residues in the N-terminal repetitive domains, respectively; both AeL2 and AeL5 have nine Cys residues, with an extra Cys residue in the N-terminal repetitive domain and the repetitive and glutamine-rich domain; AeL2, AeL3, AeL6, and AeL9 comprise long repetitive domains. Phylogenetic analysis indicates that there is a relatively weak sequence identity between the LMW-GS genes from A. elongatum cloned in this study and those reported from other plants. Three LMW-GS sequences, AeL2, AeL3, and AeL6, are clustered to Glu-A3 from wheat than to those from other plants. The possible use of these genes in relation to the high quality of hybrid wheat is discussed.  相似文献   

20.
cDNAs encoding the entire coding regions of the precursors (p) of rat long chain acyl-CoA (LCAD), short chain acyl-CoA (SCAD) and isovaleryl-CoA dehydrogenase (IVD) have been cloned and sequenced. Three cDNAs for rat liver LCAD together cover a 1440-base pair region. These cDNAs encode the entire 430-amino acid sequence of pLCAD, including the 30-amino acid leader peptide and the 400-amino acid mature LCAD. A single 1773 base pair cDNA for rat SCAD covers the entire coding region (414 amino acids), including the 26-amino acid leader peptide and the 388-amino acid mature peptide. Four identified IVD cDNAs, when combined, encompass a 2104 base region, and encode 424 amino acids including a 30-amino acid leader peptide and the 394-amino acid mature peptide. The identities of all cDNA clones have been confirmed by matching the amino acid sequences predicted from the respective cDNAs to the amino-terminal and tryptic peptide sequences derived from the corresponding purified rat enzyme. Comparison of the sequences of four rat acyl-CoA dehydrogenases, including LCAD, MCAD, SCAD, and IVD, and two of their human counterparts (MCAD and SCAD) reveals a high degree of homology (57 invariant and 92 near invariant residues: 30.6-35.4% of identical residues in pairwise comparisons), suggesting that these enzymes belong to a gene family and have evolved from a common ancestral gene.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号