首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In this work, we examined the genetic diversity and evolution of the WAG-2 gene based on new WAG-2 alleles isolated from wheat and its relatives. Only single nucleotide polymorphisms (SNP) and no insertions and deletions (indels) were found in exon sequences of WAG-2 from different species. More SNPs and indels occurred in introns than in exons. For exons, exons+introns and introns, the nucleotide polymorphism π decreased from diploid and tetraploid genotypes to hexaploid genotypes. This finding indicated that the diversity of WAG-2 in diploids was greater than in hexaploids because of the strong selection pressure on the latter. All dn/ds ratios were < 1.0, indicating that WAG-2 belongs to a conserved gene affected by negative selection. Thirty-nine of the 57 particular SNPs and eight of the 10 indels were detected in diploid species. The degree of divergence in intron length among WAG-2 clones and phylogenetic tree topology suggested the existence of three homoeologs in the A, B or D genome of common wheat. Wheat AG-like genes were divided into WAG-1 and WAG-2 clades. The latter clade contained WAG-2, OsMADS3 and ZMM2 genes, indicating functional homoeology among them.  相似文献   

2.
3.
4.
A heterotrimeric GTP-binding protein (G protein) plays a number of important roles in the signal-transduction pathways of eukaryotic cells. The allotetraploid tobacco genome has two alpha-subunit genes, NtGA1 and NtGA2, of the heterotrimeric G protein. In this study, we determined the nucleotide sequences and the exon-intron structures of the NtGA loci in tobacco and its ancestral diploid species. The genomic sequences of the NtGA loci were interrupted by 13 introns. The sizes of most exons (12 of 14) were completely conserved among the NtGA genes and the Arabidopsis alpha-subunit gene (GPA1), but most introns (11 of 13) in the NtGA genes were longer than those in GPA1. In comparison with the genomic sequences of the NtGA orthologues of ancestral Nicotiana sylvestris and Nicotiana tomentosiformis, the tobacco NtGA1 and NtGA2 were concluded to be homoeologous and assigned to the S and T genomes, respectively. More than 300 mutations including insertions-deletions (indels) and nucleotide substitutions were found in the intron regions between the NtGA1 and NtGA2 loci, whereas the exon sequences were highly conserved among these and GPA1. The structural comparison revealed larger divergence at the NtGA2 locus than at NtGA1.  相似文献   

5.
Retrotransposons constitute the majority of pseudogenic protein coding regions of most eukaryotic genomes. Most genomes carry tens to thousands of retrotransposon copies derived from dozens of distinct families, but most if not all of these copies are non-functional and contain disabling mutations, including large numbers of indels. Until recently, most regions rich in these elements were virtually ignored in all but the most complete genome sequencing projects, and the full extent of their impact on the structure and function of the genomes of higher eukaryotes was under-appreciated. Even when new retrotransposons are encountered and annotated by automated gene finding programs and similarity searches, coding regions are treated as exons and invariably and not surprisingly mistranslated because of numerous frameshift mutations and large indels. Very few functional retrotransposons contain introns, as in silico annotations imply. While many repetitive DNA consensus sequences have been assembled from collections of largely full-length copies using full-length templates, we have shown that repetitive DNA consensus sequence contigs representing long, moderately high copy-number elements can also be generated ex novo in the absence of templates from very short overlapping sequences. We have devised an in silico strategy to recover and reconstruct consensus sequences of elements up to 20,000 bp by building dense contigs of hundreds of overlapping 400 to 900-bp records found in the Genbank Genome Survey Sequence database. The results are hypothetical ancestral sequences that encode elements that appear to be fully functional with intact open reading frames and other conserved features.  相似文献   

6.
7.
J J Bao  R N Sifers  V J Kidd  F D Ledley  S L Woo 《Biochemistry》1987,26(24):7755-7759
alpha 1-Antichymotrypsin belongs to a supergene family that includes alpha 1-antitrypsin, antithrombin III, ovalbumin, and angiotensinogen. The human chromosomal alpha 1-antichymotrypsin gene has been cloned and its molecular structure established. The gene is approximately 12 kb in length and contains five exons and four introns. The locations of the introns within the alpha 1-antichymotrypsin gene are identical with those of the human alpha 1-antitrypsin and angiotensinogen genes. Other members of this supergene family contain introns located at nonhomologous positions of the genes. The homologous organization of the alpha 1-antichymotrypsin and alpha 1-antitrypsin genes corresponds with the high degree of homology between their protein sequences and suggests that these loci arose by recent gene duplication. A model is presented for the evolution of both the genomic structure and the protein sequences of the serine protease inhibitor superfamily.  相似文献   

8.
A low level of genetic variation has limited the application of molecular markers for characterizing important traits in cultivated tomato. To detect polymorphisms in tomato conserved ortholog sets (COS), expressed sequence tags (ESTs) were searched against tomato and Arabidopsis genomic sequences to define the positions of introns. Introns were amplified from 12 different accessions of tomato by polymerase chain reaction and nucleotide sequences were determined by sequencing. Results indicated that there was a possibility of 71% to amplify introns from tomato genomic DNA through this approach. A total of 201 introns were sequenced from 86 COS unigenes. The intron positions and numbers were conserved between tomato and Arabidopsis, but average intron length was three times longer in tomato than in Arabidopsis. A total of 307 single nucleotide polymorphisms (SNPs) and 75 indels were detected in introns of 57 COS unigenes among 12 tomato lines. Within cultivated tomato germplasm 172 SNPs and 47 indels were detected in introns of 33 COS unigenes. In addition, 41 SNPs were identified in the exons of 27 COS unigenes. The frequency of SNPs was 2.4 times higher in introns than in exons in the 22 COS unigenes having both intronic and exonic polymorphisms. These results indicate that intronic regions may contain sufficient variation to develop sufficient marker resources for genome-wide analysis in cultivated tomato.  相似文献   

9.

Background

Members of the predatory gastropod genus Conus use a venom comprised of a cocktail of peptide neurotoxins, termed conotoxins or conopeptides, to paralyze prey and conotoxin gene family members diversify via strong positive selection. Because Conus venoms are used primarily to subdue prey, the evolution of venoms is likely affected by predator-prey interactions.

Methodology/Principal Findings

To identify the selective forces that drive the differentiation of venoms within species of Conus, we examined the distribution of alleles of a polymorphic O-superfamily conotoxin locus of Conus ebraeus at Okinawa, Guam and Hawaii. Previous analyses of mitochondrial cytochrome oxidase I gene sequences suggest that populations of C. ebraeus, a worm-eating Conus, are not structured genetically in the western and central Pacific. Nonetheless, because the sample size from Guam was relatively low, we obtained additional data from this location and reexamined patterns of genetic variation at the mitochondrial gene at Okinawa, Guam and Hawaii. We also utilized a DNA-based approach to identify prey items of individuals of C. ebraeus from Guam and compared this information to published data on diets at Okinawa and Hawaii. Our results show that conotoxin allelic frequencies differ significantly among all three locations, with strongest differentiation at Hawaii. We also confirm previous inferences that C. ebraeus exhibits no genetic differentiation between Okinawa, Guam and Hawaii at the mitochondrial locus. Finally, DNA-based analyses show that eunicid polychaetes comprise the majority of the prey items of C. ebraeus at Guam; while this results compares well with observed diet of this species at Okinawa, C. ebraeus preys predominantly on nereid polychaetes at Hawaii.

Conclusions/Significance

These results imply that strong selection pressures affect conotoxin allelic frequencies. Based on the dietary information, the selection may derive from geographic variation in dietary specialization and local coevolutionary arms races between Conus and their prey.  相似文献   

10.
A combination of three independent biological features, genomic organization, diagnostic amino acid sites, and rare indels, was used to elucidate the phylogeny of the vertebrate serpin (serine protease inhibitor) superfamily. A strong correlation between serpin gene families displaying (1) a conserved exon-intron pattern and (2) family-specific combinations of amino acid residues at specific sites suggests that present-day vertebrates encompass six serpin gene families which evolved from primordial genes by massive intron insertion before or during early vertebrate radiation. Introns placed at homologous positions in the gene sequences in combination with diagnostic sequence characters may also constitute a reliable kinship indicator for other protein superfamilies.  相似文献   

11.
Calcium vector protein (CaVP) is an EF-hand Ca(2+)-binding protein, which is unique to the protochordate, amphioxus. CaVP is supposed to act as a Ca(2+) signal transductor, but its exact function remains unknown. Not only its function but also its exact evolutionary relationship to other Ca(2+)-binding proteins is unclear. To investigate the evolution of CaVP, we have determined the complete sequences of CaVP cDNAs from two amphioxus species, Branchiostoma lanceolatum and B. floridae, whose open reading frame cDNA and amino acid sequences show 96.5 and 98.2% identity, respectively. We have also elucidated the structure of the gene of B. floridae CaVP, which is made up of seven exons and six introns. The positions of four of the six introns (introns 1, 2, 3, and 5) are identical with those of calmodulin, troponin C, and the Spec protein of the sea urchin. These latter proteins belong to the so-called troponin C superfamily (TnC superfamily) and thus CaVP likely also belongs to this family. Intron 6 is positioned in the 3' noncoding region and is unique to CaVP, so it may represent a landmark of the CaVP lineage only. The position of intron 4 is not conserved in the genes of the TnC superfamily or CaVP, and seems to result from either intron sliding or the addition of an intron (randomly inserted into or close to domain III) to the genes of the TnC superfamily during their evolution.  相似文献   

12.
Previous analyses of the alpha-esterase cluster of Drosophila melanogaster revealed 10 active genes and the DmalphaE4a-Psi pseudogene. Here, we reconstruct the evolution of the pseudogene from the sequences of 12 alleles from widely scattered D. melanogaster populations and single alleles from Drosophila simulans and Drosophila yakuba. All of the DmalphaE4a-Psi alleles contain numerous inactivating mutations, suggesting that pseudogene alleles are fixed in natural populations. Several lines of evidence also suggest that DmalphaE4a is now evolving without selective constraint in the D. melanogaster lineage. There are three polymorphic indels which result in frameshifts; a key nucleotide of the intron splice acceptor is polymorphic; the neutral mutation parameter is the same for replacement and silent sites; one of the nonsilent polymorphisms results in a stop codon; only 1 of the 13 replacement polymorphisms is biochemically conservative; residues that are conserved among active esterases have different states in DmalphaE4a-Psi; and there are about half as many transitional polymorphisms as transversional ones. In contrast, the D. simulans and D. yakuba orthologs DsalphaE4a and DyalphaE4a do not have the inactivating mutations of DmalphaE4a-Psi and appear to be evolving under the purifying selection typical of protein- encoding genes. For instance, there have been more substitutions in the introns than in the exons, and more in silent sites than in replacement sites. Furthermore, most of the amino acid substitutions that have occurred between DyalphaE4a and DsalphaE4a are located in sites that typically vary among active alpha-esterases rather than those that are usually conserved. We argue that the original alphaE4a gene had a function which it has lost since the divergence of the D. melanogaster and D. simulans lineages.  相似文献   

13.
14.
15.
Concerted and divergent evolution within the rat gamma-crystallin gene family   总被引:11,自引:0,他引:11  
The nucleotide sequences of six rat gamma-crystallin genes have been determined. All genes have the same mosaic structure: the first exons contain a relatively short (25 to 44 base-pair) 5' non-coding region and the first nine base-pairs of the coding sequence, the second exons encode protein motifs I and II, while protein motifs III and IV are encoded by the third exons. The third exons also contain a 60 to 67-base-pair long 3' non-coding region. In the gamma 1-2 gene, the splice acceptor site of the third exon has been shifted three base-pairs upstream. Hence, the protein product of this gene is one amino acid residue longer. The first introns, though varying in length from 85 to 100 base-pairs, are conserved in sequence. The second introns vary considerably in length (0.9 X 10(3) to 1.9 X 10(3) base-pairs) and sequence. The second exons of the genes show concerted evolution and have undergone multiple gene conversions. In contrast, the third exons show divergent evolution. From the sequences of the third exons, an evolutionary tree of the gene family was constructed. This tree suggests that three of the present genes derive directly from the genes that originated from a tandem duplication of a two-gene cluster. Two duplications of the last gene of the four-gene cluster then yielded the other three genes. Region a' of the third exon, encoding protein motif III, is variable, while the region encoding protein motif IV (b') is constant. We postulate that this variability in region a' is due to a period of radiation after each gene duplication. A comparison of the rat sequences with those of orthologous sequences from other species shows that the variation in region a' is now preserved. Hence, it might specify the specific functional property of each gamma-crystallin protein within the lens.  相似文献   

16.
Nuclear protein-coding genes of euglenids (Discoba, Euglenozoa, Euglenida) contain conventional (spliceosomal) and nonconventional introns. The latter have been found only in euglenozoans. A unique feature of nonconventional introns is the ability to form a stable and slightly conserved RNA secondary structure bringing together intron ends and placing adjacent exons in proximity. To date, little is known about the mechanism of their excision (e.g. whether it involves the spliceosome or not). The tubA gene of Euglena gracilis harbors three conventional and three nonconventional introns. While the conventional introns are excised as lariats, nonconventional introns are present in the cell solely as circular RNAs with full-length ends. Based on this discovery as well as on previous observations indicating that nonconventional introns are observed frequently at unique positions of genes, we suggest that this new type of intronic circRNA might play a role in intron mobility.  相似文献   

17.
The structural organization of the two closely related vitellogenin genes A1 and A2 has been determined and compared by electron microscopy. In both genes the mRNA-coding sequence of 6 kb is interrupted 33 times, leading to a total gene length of 21 kb for gene A1 and 16 kb for gene A2. Thus both genes have a mean exon length of 0.175 kb, while the mean intron length is 0.45 kb in gene A1 and 0.31 kb in gene A2. Because the introns interrupt the structural sequence at homologous positions in genes A1 and A2, we suggest that these two genes are the products of a duplication of an ancestral gene which had an intron-exon arrangement similar to that of the extant genes. Since the duplication event, the sequence and length of the analogous introns have changed rapidly, whereas homologous exons have diverged to an extent of only 5% of their sequences. The results suggest different mechanisms of evolution for exons and introns. While the exons evolved primarily by point mutations, such mutations, as well as deletion, insertion and duplication events, were important in the evolution of the introns.  相似文献   

18.
The conotoxin proteins are disulfide rich small peptides that target ion channels and G protein coupled receptors. And they provide promising application in treating some chronic pain, epilepsy, cardiovascular diseases, and so on. Conotoxins may be classified into 11 superfamilies: A, D, I1, I2, J, L, M, O, P, S, and T according to the disulfide connectivity, highly conserved N-terminal precursor sequence and similar mode of actions. Successful prediction mature conotoxin superfamily peptide has important signification for the biological and pharmacological functions of the toxins. In this study, a new algorithm of increment of diversity combined with modified Mahalanobis discriminant is presented to predict five superfamilies by using the pseudo amino acid composition. The results of jackknife cross-validation test show that the overall prediction sensitivity and specificity are 88% and 91%, respectively. The predictive algorithm is also used to predict three O-conotoxin families. The 72% sensitivity and 78% specificity are obtained. These results indicate that the conotoxin superfamily peptides correlate with their amino acid compositions.  相似文献   

19.
Kim JY  Mahé A  Guy S  Brangeon J  Roche O  Chourey PS  Prioul JL 《Gene》2000,245(1):89-102
Two maize putative cell-wall invertase genes (Incw3 and Incw4) have been isolated by screening a genomic DNA library (Zea mays L. W22) using the cDNA probes encoding the two maize cell-wall invertases Incw1 and Incw2. The Incw3 and Incw4 genes contain six exons/five introns and five exons/four introns, respectively. The protein sequences deduced from both genes revealed a beta-fructosidase motif and a cysteine catalytic site known to be conserved in invertase genes. A detailed analysis of the protein and nucleotide sequences provides evidence that the Incw3 and the Incw4 genes encode putative cell-wall invertases. Furthermore, the isoelectric point deduced from the INCW4 protein sequence suggested that the Incw4 gene may encode a unique type of cell-wall invertase unbound in the apoplast. Gene expression studies using RT-PCR and in-situ RT-PCR hybridization showed that the Incw3 expression is organ/tissue-specific and developmentally regulated. In contrast, the Incw4 gene is constitutively expressed in all vegetative and reproductive tissues tested.  相似文献   

20.

Background  

Alternative splicing (AS) is a key molecular process that endows biological functions with diversity and complexity. Generally, functional redundancy leads to the generation of new functions through relaxation of selective pressure in evolution, as exemplified by duplicated genes. It is also known that alternatively spliced exons (ASEs) are subject to relaxed selective pressure. Within consensus sequences at the splice junctions, the most conserved sites are dinucleotides at both ends of introns (splice dinucleotides). However, a small number of single nucleotide polymorphisms (SNPs) occur at splice dinucleotides. An intriguing question relating to the evolution of AS diversity is whether mutations at splice dinucleotides are maintained as polymorphisms and produce diversity in splice patterns within the human population. We therefore surveyed validated SNPs in the database dbSNP located at splice dinucleotides of all human genes that are defined by the H-Invitational Database.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号