首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
T cell receptor (TCR) and B cell receptors (BCR) junctions, also known as the CDR3, are where the V, D, and J gene segments converge, coding for a loop structure important for contacting ligands. J segments contribute to the formation of the CDR3 loop through their 5′ ends that vary in length and show high sequence variability. The 5′ ends of J segments of TCRα genes show nucleotide sequence similarities to TCRDδ segments as high as 89% and show a preponderance of murine TCRDδ2 or human TCRDδ3 amino acid sequence similarities. Surprisingly, most of the 5′ ends of TCRJγ segments show nucleotide and amino acid sequence similarities with TCRDβ segments. All murine and human BCRJH segments and most TCRJδ segments contain amino acid sequences at their 5′ ends that resemble their own D segments, a finding that is not seen with TCRJβ segments. TCRα and TCRγ genes thus make up for their lack of separate D segments with distinct D-like segments that are built into the 5′ ends of their J segments. Additionally, in some cases, TCR and BCR genes that utilize separate D segments also receive additional D-like contributions though the 5′ ends of their J segments to add additional diversity to their CDR3 loops.  相似文献   

2.
The human TCR beta-chain gene complex includes at least 57 variable (V) gene segments, a number estimated using a combination of Southern blots of conventional and pulsed field gels, sequence analysis of cDNA clones, and from the analysis of genomic cosmid and phage clones. This number includes six TCR beta-chain V genes in four new families identified here by sequence analysis of clones derived from a human TCR beta-chain specific cDNA library. Comparison of the sequences of the new V beta genes with previously reported V beta sequences reveals predicted similarities but less than 75% nucleic acid identity that establishes them as new V beta families. One of the new V beta gene families includes three genes and the other three are single member families. Identification of these six new V beta genes falling into four V beta families brings the total number of transcribed human V beta families to 24 and makes it possible to refine the estimate of the total number of human TCR V beta genes to 57.  相似文献   

3.
Cloning and sequencing of human LH/hCG receptor cDNA   总被引:7,自引:0,他引:7  
We have isolated and sequenced a cDNA encoding the human luteinizing hormone--choriogonadotropin (LH/hCG) receptor. The deduced amino acid sequence (699 residues) containing seven putative transmembrane segments displays sequence similarity to G protein-coupled receptors. The receptor consists of 335 residue extracellular domain which contains six N-linked glycosylation sites. While the protein is 85 and 87% identical overall with the previously cloned rat and porcine LH/hCG receptor respectively, the most highly conserved regions are the putative transmembrane segments (91 and 94% similarity, respectively).  相似文献   

4.
Messenger (m)RNA sequences complementary to the mRNA sequences for the receptors to epidermal growth factor (EGF), interleukin-2 (IL-2) and transferrin (TF) were written out and compared for homologies with their ligands (EGF, IL-2 and TF, respectively). Highly significant amino acid and nucleotide homologies between the ligands and their appropriate receptor complements were detected in each case. For example, EGF and its receptor complement contained two homologous segments, each being six amino acids in length. When these segments were screened for matches against a protein sequence bank (3060 proteins and 616,748 test segments), only EGF contained either sequence. Similar results were obtained with IL-2 and TF. In each case, the homologous segments corresponded to complementary regions in the ligand binding portion of the receptor.  相似文献   

5.
Amino acid sequences for 11 acetohydroxy acid synthase (EC 4.1.3.18; AHS) polypeptides with experimentally established activity were chosen for computational comparisons to detect conserved local information associated with reaction specificity for each sequence. Windowed analysis by Pearson product moment cross-correlation of six amino acid sidechain properties revealed locally conserved segments common to all proteins with AHS activity. Seven information segments were detected in the same arrangement in sequences for the large subunit polypeptides of prokaryotes, and in the sequences for single polypeptides of eukaryotic AHS. The information segments were numbered 1-7 according to sequential position, and sequence features such as cofactor binding sites were defined for specific segments. Extension of the information segment analysis to seven other proteins of the pyruvate decarboxylase superfamily permitted use of the content and organization of information segments to recognize four classes of enzyme reaction specificity. Estimates of information entropy, based upon a state space defined by reaction specificity, directly reflected the known reaction complexity for all but one enzyme examined. Our data suggest that development of information-segment models for enzyme superfamilies may improve the accuracy of inferring protein activity from sequence.  相似文献   

6.
Pig plasma gelsolin (Mr = 81595; 739 residues) contains 704 identical residues out of a maximum 730 when compared to the cytoplasmic form of human gelsolin. The cDNA sequence also codes for a peptide of 33 residues N-terminal to the nine-residue plasma extension sequence previously reported: these 33 residues are highly homologous to the human signal peptide and plasma extension. Comparison of the gelsolin sequences with chicken brush border villin, severin from Dictyostelium discoideum and fragmin from Physarum polycephalum shows a strong evolutionary relationship between all these proteins. There are six large repeating segments in gelsolin and villin, and three similar segments in severin and fragmin. Although these multiple repeats cannot be related to any known function of these actin-severing proteins, this superfamily of proteins appears to have evolved from an ancestral sequence of 120 to 130 amino acid residues.  相似文献   

7.
Initial human genome sequence analysis has revealed large segments of nearly identical sequence in particular chromosomal regions. The recent origin of these segments and their abundance (approximately 5%) has challenged investigators to elucidate their underlying mechanism and role in primate genome evolution. Although the precise fraction is unknown, some of these duplicated segments have recently been shown to be associated with rapid gene innovation and chromosomal rearrangement in the genomes of man and the great apes.  相似文献   

8.
EEG monopolarly recorded in points F3, F4, O1, O2 of 20 healthy subjects in six states (quiet wakefulness with open or closed eyes, spontaneous button pressings in arbitrary moments of time, listening to clicks, reaction to clicks by pressing the button at random or at equal intervals between stimuli), were processed by means of the computer program transforming the raw EEG tracings to a sequence of stationary segments. The accumulated segments were divided into classes of "similar" ones by a two-stage procedure of cluster analysis. In each lead six types of segments were identified forming populations of structural units of an individual human EEG. Four types were recorded all over the brain: their spectra were of a great resemblance in different brain areas. The EEG of each individual was characterized by a certain combination of segment types which practically did not change by their quality in different states of the subject.  相似文献   

9.
Cloning and sequence analysis of a DNA complementary to the mRNA expressed in undifferentiated mouse F9 teratocarcinoma stem cells but disappearing rapidly after treatment with a tumor-promoting phorbol ester revealed it to be a 1.9 kilobase pairs-long cDNA encoding a protein of 323 amino acid residues. Computer-assisted analyses of the deduced amino acid sequence indicated that this protein contains a typical hydrophobic signal peptide consisting of 33 amino acid residues and six putative membrane-spanning segments. The deduced amino acid sequence, as a whole, bears no significant sequence homology to any previously described protein.  相似文献   

10.
Factor XIII is a plasma protein that participates in the final stages of blood coagulation. The complete amino acid sequence of the b subunit of human factor XIII was determined by a combination of cDNA cloning and amino acid sequence analysis. A lambda gt11 cDNA library prepared from human liver mRNA was screened with an affinity-purified antibody against the b subunit of human factor XIII. Nine positive clones were isolated from 2 X 10(6) phage and plaque-purified. The largest cDNA insert was sequenced and shown to contain 2180 base pairs coding for a portion of the leader sequence (19 amino acids), the mature protein (641 amino acids), a stop codon (TGA), a 3' noncoding region (187 nucleotides), and a poly(A) tail. When the b subunit of human factor XIII was digested with cyanogen bromide, nine peptides were isolated by gel filtration and reverse-phase high-performance liquid chromatography. Amino acid sequence analyses of these peptides were performed with an automated sequenator, and 299 amino acid residues were identified. These amino acid sequences were in complete agreement with the amino acid sequence predicted from the cDNA. The b subunit of factor XIII contained 10 repetitive homologous segments, each composed of about 60 amino acids and 4 half-cystine residues. Each of these repeated segments is a member of a family of repeats present in human beta 2-glycoprotein I, complement factor B, and haptoglobin alpha 1 chain. Three potential Asn-linked carbohydrate attachment sites were also identified in the b subunit of factor XIII.  相似文献   

11.
Structural repertoire of the human VH segments.   总被引:16,自引:0,他引:16  
The VH gene segments produce the part of the VH domains of antibodies that contains the first two hypervariable regions. The sequences of 83 human VH segments with open reading frames, from several individuals, are currently known. It has been shown that these sequences are likely to form a high proportion of the total human repertoire and that an individual's gene repertoire produces about 50 VH segments with different protein sequences. In this paper we present a structural analysis of the amino acid sequences produced by the 83 segments. Particular residue patterns in the sequences of V domains imply particular main-chain conformations, canonical structures, for the hypervariable regions. We show that, in almost all cases, the residue patterns in the VH segments imply that the first hypervariable regions have one of three different canonical structures and that the second hypervariable regions have one of five different canonical structures. The different observed combinations of the canonical structures in the first and second regions means that almost all sequences have one of seven main-chain folds. We describe, in outline, structures of the antigen binding site loops produced by nearly all the VH segments. The exact specificity of the loops is produced by (1) sequence differences in their surface residues, particularly at sites near the centre of the combining site, and (2) sequence differences in the hypervariable and framework regions that modulate the relative positions of the loops.  相似文献   

12.
Shestopalov BV 《Tsitologiia》2003,45(7):702-706
The calculation of protein three-dimensional structure from the amino acid sequence is a fundamental problem to be solved. This paper presents principles of the code theory of protein secondary structure, and their consequence--the amino acid code of protein secondary structure. The doublet code model of protein secondary structure, developed earlier by the author (Shestopalov, 1990), is part of this theory. The theory basis are: 1) the name secondary structure is assigned to the conformation, stabilized only by the nearest (intraresidual) and middle-range (at a distance no more than that between residues i and i + 5) interactions; 2) the secondary structure consists of regular (alpha-helical and beta-structural) and irregular (coil) segments; 3) the alpha-helices, beta-strands and coil segments are encoded, respectively, by residue pairs (i, i + 4), (i, i + 2), (i, i = 1), according to the numbers of residues per period, 3.6, 2, 1; 4) all such pairs in the amino acid sequence are codons for elementary structural elements, or structurons; 5) the codons are divided into 21 types depending on their strength, i.e. their encoding capability; 6) overlappings of structurons of one and the same structure generate the longer segments of this structure; 7) overlapping of structurons of different structures is forbidden, and therefore selection of codons is required, the codon selection is hierarchic; 8) the code theory of protein secondary structure generates six variants of the amino acid code of protein secondary structure. There are two possible kinds of model construction based on the theory: the physical one using physical properties of amino acid residues, and the statistical one using results of statistical analysis of a great body of structural data. Some evident consequences of the theory are: a) the theory can be used for calculating the secondary structure from the amino acid sequence as a partial solution of the problem of calculation of protein three-dimensional structure from the amino acid sequence, and the calculated secondary structure and codon strength distribution can be used for simulating the next step of protein folding; b) one can propose that the same secondary structures can be folded into different tertiary structures and, vice versa, different secondary structures can be folded into the same tertiary structures, provided codon distributions are considered also; c) codons can be considered as first elements of protein three-dimensional structure language.  相似文献   

13.
Transforming growth factor-alpha (TGFA) has been proposed as a candidate gene in the etiology of nonsyndromic cleft lip with or without cleft palate (NS-CL/P) and of nonsyndromic cleft palate only (NS-CPO). Biologic support for a role of TGFA arises from its presence at high levels in the epithelial tissue of the medial edge of the palatal shelves at the time of shelf fusion in mice. Genetic support for the role of TGFA in clefting comes from the reported association of TGFA alleles with human NS-CPO and NS-CL/P. In this study we report the sequence and structure of human genomic TGFA and the search for causal TGFA mutations in 250 individuals with NS-CL/P or NS-CPO by conformational analysis of the coding sequence, splice junctions, and a portion of the 3' untranslated region strongly homologous between human and mouse. We confirm that human TGFA is composed of six exons and here report several new sequence substitutions and their frequencies. Five variants in conserved segments may represent rare causes for clefting in humans and provide support for the role of TGFA in facial morphogenesis.  相似文献   

14.
E Dassa  M Hofnung 《The EMBO journal》1985,4(9):2287-2293
The MalG protein is needed for the transport of maltose in Escherichia coli K12. We present the sequence of gene malG. The deduced amino acid sequence corresponds to a protein of 296 amino acid residues (mol. wt. = 32 188 daltons). This protein is largely hydrophobic (hydrophobic index = 0.83) and is thus presumably an integral inner membrane protein which could span the membrane through six hydrophobic segments. We provide direct evidence from fusion proteins for the translation frame and we also identified the in vitro made MalG protein. We have found a sequence which is highly conserved between MalG and MalF, the other integral inner membrane protein of the maltose transport system. This conserved sequence is also present in all known integral membrane proteins of binding protein-dependent transport systems, always at the same distance (approximately 90 residues) from their COOH terminus. We discuss briefly this finding.  相似文献   

15.
16.
An average-sized human Y chromosome contains about 3,000 copies of the repeating DNA family DYZ1. A major repeating unit of the family, pHY10, has been cloned and an entire 3,564-bp sequence has already been determined by Nakahori et al. (1986). In the present study, pHY10 was divided into six consecutive segments, A to F, which were independently amplified by the PCR technique to see if they were strictly male-specific. pHY10 appears to consist of segments of various male-specificity. The B segment was apparently male-specific; however, the use of additional techniques (Southern-blot analysis or second PCR amplification in combination with the standard PCR) revealed homologous sequences in some females. None of the six segments of pHY10 may be male-specific in a strict sense. Different segments appear to be conserved during evolution to different extents. The 323-bp E segment appears to be the least conserved and to be responsible for the generation of most variations within the DYZ1 family.  相似文献   

17.
Composite human VK genes and a model of their evolution.   总被引:17,自引:9,他引:8       下载免费PDF全文
A phage library and two cosmid libraries were screened for human VK genes. Two recombinant phage and four cosmid clones were analysed in detail by restriction mapping and sequencing. Each one contained a single VKI sequence. Two of these six sequences are potentially functional VK genes and four are pseudogenes. Two pseudogenes derived from different genomic DNAs are highly homologous and are therefore either allelic variants or the products of a recent duplication event. Comparisons of our sequences with all fully determined human VKI amino acid and DNA sequences reveal identical segments which at first sight appear like minigenes. But these segments do not coincide with the subregions and some of the segments include both, framework and complementarity determining regions (FR, CDR, ref. 2). The findings may be explained by an evolutionary model generating composite genes by gene conversion and selection.  相似文献   

18.
From the spectrin gene to the assembly of the membrane skeleton   总被引:1,自引:0,他引:1  
The complete nucleotide sequence coding for the chicken brain alpha-spectrin was determined. It comprises the entire coding frame, 5'- and 3'-untranslated sequences terminating in a poly(A)-tail. The deduced amino acid sequence shows that the alpha-chain contains 22 segments, 20 of which correspond to the typical 106 residue repeat of the human erythrocyte spectrin. Some segments non-homologous to the repeat structure reside in the middle and COOH-terminal regions. Sequence comparisons with other proteins show that these segments evidently harbour some structural and functional features such as: homology to alpha-actinin and dystrophin, two typical EF-hand structures (calcium-binding) and a putative calmodulin-binding site in the COOH-terminus and a sequence homologous to various src-tyrosine kinases and to phospholipase C in the middle of the molecule. Comparison of our sequence with other partial alpha-spectrin sequences shows that alpha-spectrin is well conserved in different species and that the human erythrocyte alpha-spectrin is divergent.  相似文献   

19.
The sequence relationships of a range of cultivable and noncultivable human and animal rotaviruses were investigated by hybridization of rotavirus cDNA probes to genomic RNAs immobilized on diazobenzyloxymethyl paper. Under conditions of low stringency (34% base mismatch tolerated) most genome segments exhibited partial homology except for genes 4 and 5. In contrast, under more stringent conditions of hybridization in which no more than 8% base mismatch was tolerated, few segments exhibited homology. Generally the human and animal rotaviruses were found to possess distinct nucleic acid sequences that exhibit only a low order of sequence relatedness. These results are consistent with the notion that both cumulative changes in nucleic acid sequences and the interchange of segments may be involved in the evolution of distinct rotavirus strains.  相似文献   

20.
DNA sequence analysis reveals the gene encoding human neutrophil elastase to be contained on a 6-kb EcoRI fragment. The gene contains five exons and closely resembles rat mast cell proteinase II and mouse adipsin in its exon structure and intron splice phase. Non-coding regions are very rich in repetitive DNA, containing seven Alu-like segments, three distinct clustered direct repeats with monomer lengths of 53 (six repeats), 23 (three repeats) and 41 (ten repeats) nucleotides, and a 200-nucleotide AT-rich region. Protein sequence analysis, inferred from the coding regions of the gene, indicates that neutrophil elastase may contain an unusual activation peptide similar to that found in the other major neutrophil serine proteinase, cathepsin G.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号