首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The proteins of the X-tox family have imperfectly conserved tandem repeats of several defensin-like motifs known as cysteine-stabilized αβ (CS-αβ) motifs. These immune-related proteins are inducible and expressed principally in hemocytes, but they have lost the antimicrobial properties of the ancestral defensins from which they evolved. We compared x-tox gene structure and expression in three lepidopteran species (Spodoptera frugiperda, Helicoverpa armigera and Bombyx mori). Synteny and phylogenetic analyses showed that the x-tox exons encoding CS-αβ motifs were phylogenetically closely related to defensin genes mapping to chromosomal positions close to the x-tox genes. We were able to define two groups of paralogous x-tox exons (three in Noctuids) that each followed the expected species tree. These results suggest that the ancestor of the three species already possessed an x-tox gene with at least two proto-domains, and an additional duplication/fusion should have occurred in the ancestor of the two noctuid species. An expansion of the number of exons subsequently occurred in each lineage. Alternatively, the proto x-tox gene possessed more copy and each group of x-tox domains might undergo concerted evolution through gene conversion. Accelerated protein evolution was detected in x-tox domains when compared to related defensins, concomitantly to multiplication of exons and/or the possible activation of concerted evolution. The x-tox genes of the three species have similar structural organizations, with repeat motifs composed of CS-αβ-encoding exons flanked by introns in phase 1. Diverse mechanisms underlie this organization: (i) the acquisition of new repeat motifs, (ii) the duplication of preexisting repeat motifs and (iii) the duplication of modules. A comparison of gDNA and cDNA structures showed that alternative splicing results in the production of multiple X-tox protein isoforms from the x-tox genes. Differences in the number and sequence of CS-αβ motifs in these isoforms were found between species, but also between individuals of the same species. Thus, our analysis of the genetic organization and expression of x-tox genes in three lepidopteran species suggests a rapid evolution of the organization of these genes.  相似文献   

2.
Complete structure of the chicken alpha 2(VI) collagen gene   总被引:4,自引:0,他引:4  
Type VI collagen is a hybrid molecule consisting of a short triple helix flanked by two large globular domains. These globular domains are composed of several homologous repeats which show a striking similarity to the collagen-binding motifs found in von Willebrand factor. The alpha 2(VI) subunit contains three of these homologous repeats termed D1, D2 and D3. We have isolated and characterized the entire gene for chicken alpha 2(VI) collagen. This gene, which is present as a single copy in the chicken genome, is 26 kbp long and comprises 28 exons. All exons can be classified in three groups. (a) The triple-helical domain is encoded by 19 short exons (27-90 bp) separated by introns of phase class 0. These exons are multiples of 9 bp and encode an integral number of collagenous Gly-Xaa-Yaa triplets. (b) The homologous repeats D1-D3 are encoded by one or two very long exons each (153-1578 bp). These exons are separated by introns of phase class 1. (c) The homologous repeats and the collagen sequence are linked to each other by three short adapter segments which are each encoded by a single exon (21-46 bp). The modular nature of the polypeptide is thus clearly reflected by the mosaic structure of its gene. The size of the exons and the phase class of the introns suggest that the alpha 2(VI) gene evolved by duplication and shuffling of two different primordial exons, one of 9 bp encoding a collagen Gly-Xaa-Yaa triplet and one of 600 bp encoding the precursor of the homologous repeats.  相似文献   

3.
The genes for alpha-fetoprotein and albumin arose by duplication of an ancestral gene that contained three genetic domains. These domains were generated by the triplication of a primordial genetic domain composed of five exons or subdomains. That the primordial domain itself arose by amplification of a simpler sequence is suggested by nucleotide sequence homologies among the subdomains of the mouse alpha-fetoprotein gene. A detailed analysis of these homologies reveals that each of the five subdomain families contains remnants of a 27-base-long repeat from which the entire alpha-fetoprotein coding sequence has been assembled. A consensus sequence for the 27 nucleotide repeat is derived, and the positions of the repeats within each subdomain are described. A model is proposed for the evolution of the primordial domain by the amplification and divergence of the 27 base-pair sequence, along with the condensation of the repeats into subdomains separated by intervening sequences. It is postulated that the role of intervening sequences may be to limit sequence amplification in genes such as alpha-fetoprotein and albumin whose protein products cannot tolerate size variation.  相似文献   

4.
We report the complete sequence of the gene encoding mouse glial fibrillary acidic protein (GFAP), the intermediate filament (IF) protein specific to astrocytes. The 9.8 kb gene includes nine exons separated by introns ranging in size from 0.2 to 2.5 kb. A comparison of the organization of the GFAP gene with that of genes encoding other IF proteins reveals that the structure of IF genes is highly conserved in spite of considerable divergence at the amino acid level. Thus, most of the evolutionary events leading to the placement of introns in IF genes must have occurred prior to the duplication and subsequent divergence of IF genes from a presumptive common ancestral sequence. The conserved gene organization is unrelated to structural features of IF proteins. A curious feature of the GFAP gene is the large number of repeated sequences found in the introns. Six tracts of reiterated di- or trinucleotides are present, plus tandem repeats of two different novel sequences. One repeat is unique to the GFAP gene; the other occurs elsewhere in the mouse genome, although at relatively low frequency.  相似文献   

5.
The structural organization of the two closely related vitellogenin genes A1 and A2 has been determined and compared by electron microscopy. In both genes the mRNA-coding sequence of 6 kb is interrupted 33 times, leading to a total gene length of 21 kb for gene A1 and 16 kb for gene A2. Thus both genes have a mean exon length of 0.175 kb, while the mean intron length is 0.45 kb in gene A1 and 0.31 kb in gene A2. Because the introns interrupt the structural sequence at homologous positions in genes A1 and A2, we suggest that these two genes are the products of a duplication of an ancestral gene which had an intron-exon arrangement similar to that of the extant genes. Since the duplication event, the sequence and length of the analogous introns have changed rapidly, whereas homologous exons have diverged to an extent of only 5% of their sequences. The results suggest different mechanisms of evolution for exons and introns. While the exons evolved primarily by point mutations, such mutations, as well as deletion, insertion and duplication events, were important in the evolution of the introns.  相似文献   

6.
The gene for bovine interphotoreceptor retinoid-binding protein (IRBP) has been cloned, and its nucleotide sequence has been determined. The IRBP gene is about 11.6 kilobase pairs (kb) and contains four exons and three introns. It transcribed into a large mRNA of approximately 6.4 kb and translated into a large protein of 145,000 daltons. To prove the identity of the genomic clone, we determined the protein sequence of several tryptic and cyanogen bromide fragments of purified bovine IRBP protein and localized them in the protein predicted from its nucleotide sequence. There is a 4-fold repeat structure in the protein sequence with 30-40% sequence identity and many conservative substitutions between any two of the four protein repeats. The third and fourth repeats are the most similar pair. All three of the introns in the IRBP gene fall in the fourth protein repeat. Two of the exons, the first and the fourth, are large, 3173 and 2447 bases, respectively. The introns are each about 1.5-2.2 kb long. The human IRBP gene has a sequence that is similar to one of the introns from the bovine gene. The unexpected gene structure and protein repeat structure in the bovine gene lead us to propose a model for the evolution of the IRBP gene.  相似文献   

7.
Concerted and divergent evolution within the rat gamma-crystallin gene family   总被引:11,自引:0,他引:11  
The nucleotide sequences of six rat gamma-crystallin genes have been determined. All genes have the same mosaic structure: the first exons contain a relatively short (25 to 44 base-pair) 5' non-coding region and the first nine base-pairs of the coding sequence, the second exons encode protein motifs I and II, while protein motifs III and IV are encoded by the third exons. The third exons also contain a 60 to 67-base-pair long 3' non-coding region. In the gamma 1-2 gene, the splice acceptor site of the third exon has been shifted three base-pairs upstream. Hence, the protein product of this gene is one amino acid residue longer. The first introns, though varying in length from 85 to 100 base-pairs, are conserved in sequence. The second introns vary considerably in length (0.9 X 10(3) to 1.9 X 10(3) base-pairs) and sequence. The second exons of the genes show concerted evolution and have undergone multiple gene conversions. In contrast, the third exons show divergent evolution. From the sequences of the third exons, an evolutionary tree of the gene family was constructed. This tree suggests that three of the present genes derive directly from the genes that originated from a tandem duplication of a two-gene cluster. Two duplications of the last gene of the four-gene cluster then yielded the other three genes. Region a' of the third exon, encoding protein motif III, is variable, while the region encoding protein motif IV (b') is constant. We postulate that this variability in region a' is due to a period of radiation after each gene duplication. A comparison of the rat sequences with those of orthologous sequences from other species shows that the variation in region a' is now preserved. Hence, it might specify the specific functional property of each gamma-crystallin protein within the lens.  相似文献   

8.
The MDR1 gene, responsible for multidrug resistance in human cells, encodes a broad specificity efflux pump (P-glycoprotein). P-glycoprotein consists of two similar halves, each half including a hydrophobic transmembrane region and a nucleotide-binding domain. On the basis of sequence homology between the N-terminal and C-terminal halves of P-glycoprotein, we have previously suggested that this gene arose by duplication of a primordial gene. We have now determined the complete intron/exon structure of the MDR1 gene by direct sequencing of cosmid clones and enzymatic amplification of genomic DNA segments. The MDR1 gene includes 28 introns, 26 of which interrupt the protein-coding sequence. Although both halves of the protein-coding sequence are composed of approximately the same number of exons, only two intron pairs, both within the nucleotide-binding domains, are located at conserved positions in the two halves of the protein. The other introns occur at different locations in the two halves of the protein and in most cases interrupt the coding sequence at different positions relative to the open reading frame. These results suggest that the P-glycoprotein arose by fusion of genes for two related but independently evolved proteins rather than by internal duplication.  相似文献   

9.
Exogastrula-inducing peptides (EGIPs) were identified in embryos of the sea urchin Anthocidaris crassispina as polypeptides with structural similarity to epidermal growth factor (EGF) that severely affect gastrulation of sea urchin embryos to induce exogastrulation. Here we have obtained genomic clones for the EGIP precursor gene (EGIP) and determined its genomic organization. The EGIP gene spans the length of 9 kb in the genome and is composed of seven exons and six introns. Each of the four EGF motifs in the precursor protein is encoded by a single exon, and all the exon boundaries are in phase 1, suggesting that EGIP have been generated during evolution by duplication of an exon encoding a single ancient EGIP sequence. The 5'-flanking sequence of EGIP from -4372 to +194 revealed the presence of multiple repeat sequences including direct and inverted repeats as well as two clusters of GGGG/CCCC elements. The function of the upstream flanking region of EGIP was examined by introducing the gene constructs into embryos in which different regions from the flanking DNA were placed upstream to the GFP reporter gene. Systematic deletion of the upstream DNA revealed the presence of potent enhancer activity between -372 and -210.  相似文献   

10.
D Jenne  K K Stanley 《Biochemistry》1987,26(21):6735-6742
The S-protein/vitronectin gene was isolated from a human genomic DNA library, and its sequence of about 5.3 kilobases including the adjacent 5' and 3' flanking regions was established. Alignment of the genomic DNA nucleotide sequence and the cDNA sequence indicated that the gene consisted of eight exons and seven introns. The intron positions in the S-protein gene and their phase type were compared to those in the hemopexin gene which shares amino acid sequence homologies with transin and the S-protein. Three introns have been found at equivalent positions; two other introns are very close to these positions and are interpreted as cases of intron sliding. Introns 3-7 occur at a conserved glycine residue within repeating peptide segments, whereas introns 1 and 2 are at the boundaries of the Somatomedin B domain of S-protein. The analysis of the exon structure in relation to repeating peptide motifs within the S-protein strongly suggests that it contains only seven repeats, one less than the hemopexin molecule. A very similar repeat pattern like that in hemopexin is shown to be present also in two other related proteins, transin and interstitial collagenase. An evolutionary model for the generation of the repeat pattern in the S-protein and the other members of this novel "pexin" gene family is proposed, and the sequence modifications for some of the repeats during divergent evolution are discussed in relation to known unique functional properties of hemopexin and S-protein.  相似文献   

11.
Computer-assisted sequence analysis was applied to detect the most apparent nonrandom sequence motifs in eukaryotic introns. We describe in detail a method, which we call distance analysis, that we applied to the extensive study of 405 eukaryotic intron sequences. We observed very strong two-base periodicities for almost all tetranucleotides that are tandem repeats of nonhomopolymeric dinucleotides (the exception was GCGC and CGCG). We also observed, by using a fixed-point alignment method, that these periodic sequence motifs belong to large clusters of dinucleotides repeated tandemly as many as 15–35 times, which corresponds to the cluster lengths of 30–70 bases. We did not observe two-base periodicity of tetranucleotides in the collections of either 262 spliced eukaryotic exons or 107 bacterial genes. Instead, these sequences displayed strong three-base periodicity of some other tetranucleotides. These findings suggest that introns and exons display distinct sequence properties that can be used for mapping purposes.  相似文献   

12.
Isolation and sequencing of three genes, MPAO1, MPAO2 and MPAO3, coding for polyamine oxidase (PAO) from maize (Zea mays) are reported here. Gene organization is extremely conserved among these copies, being composed of eight exons and seven introns. Furthermore, these genes encode for a protein of an almost identical amino acid sequence. These data suggest that the three MPAO copies have been derived from gene duplication of a common ancestor gene. Long inverted repeat sequences, also present in other maize genes, have been found within the second intron. Promoter sequences of MPAO1 and MPAO2 genes have been analysed for putative cis-acting elements. According to genomic Southern blot analysis, the MPAO gene family in maize and other monocots is represented by a small number of copies. Northern and western blot analysis have revealed a tissue-specific accumulation of both MPAO mRNA and protein.  相似文献   

13.
Structure and evolution of the bovine prothrombin gene   总被引:6,自引:0,他引:6  
The cloned bovine prothrombin gene has been characterized by partial DNA sequence analysis, including the 5' and 3' flanking sequences and all the intron-exon junctions. The gene is approximately 15.4 x 10(3) base-pairs in length and comprises 14 exons interrupted by 13 introns. The exons coding for the prepro-leader peptide and the gamma-carboxyglutamic acid-containing region are similar in organization to the corresponding exons in the factor IX and protein C genes. This region has probably evolved as a result of recent gene duplication and exon shuffling events. The exons coding for the kringles and the serine protease region of the prothrombin gene are different in organization from the homologous regions in other genes, suggesting that introns have been inserted into these regions after the initial gene duplication events.  相似文献   

14.
Nucleotide sequence of the gene for the b subunit of human factor XIII   总被引:9,自引:0,他引:9  
R E Bottenus  A Ichinose  E W Davie 《Biochemistry》1990,29(51):11195-11209
Factor XIII (Mr 320,000) is a blood coagulation factor that stabilizes and strengthens the fibrin clot. It circulates in blood as a tetramer composed of two a subunits (Mr 75,000 each) and two b subunits (Mr 80,000 each). The b subunit consists of 641 amino acids and includes 10 tandem repeats of 60 amino acids known as GP-I structures, short consensus repeats (SCR), or sushi domains. In the present study, the human gene for the b subunit has been isolated from three different genomic libraries prepared in lambda phage. Fifteen independent phage with inserts coding for the entire gene were isolated and characterized by restriction mapping, Southern blotting, and DNA sequencing. The gene was found to be 28 kilobases in length and consisted of 12 exons (I-XII) separated by 11 intervening sequences. The leader sequence was encoded by exon I, while the carbonyl-terminal region of the protein was encoded by exon XII. Exons II-XI each coded for a single sushi domain, suggesting that the gene evolved through exon shuffling and duplication. The 12 exons in the gene ranged in size from 64 to 222 base pairs, while the introns ranged in size from 87 to 9970 nucleotides and made up 92% of the gene. The introns contained four Alu repetitive sequences, one each in introns A, E, I, and J. A fifth Alu repeat was present in the flanking 3' end of the gene. Two partial KpnI repeats were also found in the introns, including one in intron I and one in intron J. The KpnI repeat in intron J was 89% homologous to a sequence of approximately 2200 nucleotides flanking the gene coding for human beta globin and approximately 3800 nucleotides from the L1 insertion present in the gene for human factor VIII. Intron H also contained an "O" family repeat, while two potential regions for Z-DNA were identified within introns G and J. One nucleotide change was found in the coding region of the gene when its sequence was compared to that of the cDNA. This difference, however, did not result in a change in the amino acid sequence of the protein.  相似文献   

15.
16.
Two modes of evolution of repeated domains in proteins have been described: (1) a conservative mode, whereby individual domains are conserved across gene duplication and speciation events, and (2) a concerted mode, whereby repeat domains become homogenized within a gene, presumably by intragenic partial duplication and/or gene conversion. The evolution of repeated EGF-like and fibronection-type-III-like (Fn-III) domains in the vertebrate extracellular matrix proteins tenascin-X (TNX) and tenascin-C (TNC) was studied by comparisons between human and mouse orthologs and between the paralogous TNC and TNX genes. The EGF-like repeats have largely been homogenized within each gene by concerted evolution since the duplication of the two genes but have been conserved since the divergence of rodents and primates. The Fn-III domains of TNC have likewise mainly evolved in a conservative fashion since the divergence of rodents and primates. In contrast, the Fn-III repeats of TNX fall into three distinct categories with regard to mode of evolution: (1) The three C-terminal repeats have been conserved since before duplication of the TNX and TNC genes. (2) Certain other repeats have been homogenized within each gene since gene duplication but have been conserved since the divergence of rodents and primates. (3) Still other repeats have evolved in a concerted fashion in rodent and primate lineages since their divergence. Remarkably, certain introns adjacent to the exons encoding these concertedly evolving Fn-III repeats have themselves evolved in a concerted fashion. This is the first known example of concerted evolution of repeated introns within a protein-coding gene.  相似文献   

17.
We have determined the genetic stability of three independent intragenic human HPRT gene duplications and the structure of each duplication at the nucleotide sequence level. Two of the duplications were isolated as spontaneous mutations from the HL60 human myeloid leukemia cell line, while the third was originally identified in a Lesch-Nyhan patient. All three duplications are genetically unstable and have a reversion rate approximately 100-fold higher than the rate of duplication formation. The molecular structures of these duplications are similar, with direct duplication of HPRT exons 2 and 3 and of 6.8 kb (HL60 duplications) or 13.7 kb (Lesch-Nyhan duplication) of surrounding HPRT sequence. Nucleotide sequence analyses of duplication junctions revealed that the HL60-derived duplications were generated by unequal homologous recombination between clusters of Alu repeats contained in HPRT introns 1 and 3, while the Lesch-Nyhan duplication was generated by the nonhomologous insertion of duplicated HPRT DNA into HPRT intron 1. These results suggest that duplication substrates of different lengths can be generated from the human HPRT exon 2-3 region and can undergo either homologous or nonhomologous recombination with the HPRT locus to form gene duplications.  相似文献   

18.
19.
Transforming acidic coiled-coil proteins (TACC1, 2, and 3) are essential proteins associated with the assembly of spindle microtubules and maintenance of bipolarity. Dysregulation of TACCs is associated with tumorigenesis, but studies of microsatellite instability in TACC genes have not been extensive. Microsatellite or simple sequence repeat instability is known to cause many types of cancer. The present in silico analysis of SSRs in human TACC gene sequences shows the presence of mono- to hexa-nucleotide repeats, with the highest densities found for mono- and di-nucleotide repeats. Density of repeats is higher in introns than in exons. Some of the repeats are present in regulatory regions and retained introns. Human TACC genes show conservation of many repeat classes. Microsatellites in TACC genes could be valuable markers for monitoring numerical chromosomal aberrations and or cancer.  相似文献   

20.
We have determined the nucleotide sequence of the rat apolipoprotein (apo-) A-IV gene and analyzed its structural and evolutionary relationships to the human apolipoprotein A-I, E, and C-III genes. The rat A-IV gene is 2.4 kilobases in size and consists of three exons (142, 126, and 1157 base pairs) interrupted by two introns (277 and 673 base pairs). The 5'-nontranslated region and most of the signal peptide are encoded by the first exon. Thus, the apo-A-IV gene lacks an intron in the 5'-nontranslated region of its mRNA in contrast to all other known apolipoprotein genes. Sequences coding for amphipathic docosapeptides span both the second and third exons of the rat A-IV gene. We demonstrate that this is also true for the human apolipoprotein genes. This gene family seems to have evolved by the duplication of an ancestral minigene that resulted in the formation of two exons. Thereafter, evolution of these sequences was dominated by intraexonic amplification of repeating units coding for amphipathic peptides. Sequence divergence of these repeats resulted in the functional differentiation of the apolipoproteins. However, conservation of the fundamental amphipathic pattern allowed members of this protein family to retain their lipid-binding properties.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号