首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 421 毫秒
1.
In this work, we examined the genetic diversity and evolution of the WAG-2 gene based on new WAG-2 alleles isolated from wheat and its relatives. Only single nucleotide polymorphisms (SNP) and no insertions and deletions (indels) were found in exon sequences of WAG-2 from different species. More SNPs and indels occurred in introns than in exons. For exons, exons+introns and introns, the nucleotide polymorphism π decreased from diploid and tetraploid genotypes to hexaploid genotypes. This finding indicated that the diversity of WAG-2 in diploids was greater than in hexaploids because of the strong selection pressure on the latter. All dn/ds ratios were < 1.0, indicating that WAG-2 belongs to a conserved gene affected by negative selection. Thirty-nine of the 57 particular SNPs and eight of the 10 indels were detected in diploid species. The degree of divergence in intron length among WAG-2 clones and phylogenetic tree topology suggested the existence of three homoeologs in the A, B or D genome of common wheat. Wheat AG-like genes were divided into WAG-1 and WAG-2 clades. The latter clade contained WAG-2, OsMADS3 and ZMM2 genes, indicating functional homoeology among them.  相似文献   

2.
In Arabidopsis thaliana, primary metabolic genes (PMGs) are more evolutionarily conserved and intron-rich than secondary metabolic genes. We observed that PMGs are more primitive and pan-taxonomically persistent as compared to secondary (SMGs) and non-metabolic genes (NMGs). This difference in primitiveness and persistence is primarily correlated with intron number and is independent of gene expression level. We propose a twofold explanation behind higher intron enrichment in PMGs. Firstly, introns might increase protein versatility amongst PMGs through alternative splicing, providing selective advantage of PMGs and making them more persistent across diverse plant taxa. Also, multifunctional PMGs may acquire functional domains by increasing the intronic burden. Additionally, single nucleotide polymorphisms (SNPs) accumulate at a higher rate in introns as compared to exons. Moreover, a strong negative correlation between cumulative exonic SNPs density and intron number indicates that introns may protect the exonic regions against the deleterious effect of these mutations, making them more conserved.  相似文献   

3.
A heterotrimeric GTP-binding protein (G protein) plays a number of important roles in the signal-transduction pathways of eukaryotic cells. The allotetraploid tobacco genome has two alpha-subunit genes, NtGA1 and NtGA2, of the heterotrimeric G protein. In this study, we determined the nucleotide sequences and the exon-intron structures of the NtGA loci in tobacco and its ancestral diploid species. The genomic sequences of the NtGA loci were interrupted by 13 introns. The sizes of most exons (12 of 14) were completely conserved among the NtGA genes and the Arabidopsis alpha-subunit gene (GPA1), but most introns (11 of 13) in the NtGA genes were longer than those in GPA1. In comparison with the genomic sequences of the NtGA orthologues of ancestral Nicotiana sylvestris and Nicotiana tomentosiformis, the tobacco NtGA1 and NtGA2 were concluded to be homoeologous and assigned to the S and T genomes, respectively. More than 300 mutations including insertions-deletions (indels) and nucleotide substitutions were found in the intron regions between the NtGA1 and NtGA2 loci, whereas the exon sequences were highly conserved among these and GPA1. The structural comparison revealed larger divergence at the NtGA2 locus than at NtGA1.  相似文献   

4.
SSCP-SNP in pearl millet—a new marker system for comparative genetics   总被引:6,自引:0,他引:6  
A considerable array of genomic resources are in place in pearl millet, and marker-aided selection is already in use in the public breeding programme at ICRISAT. This paper describes experiments to extend these publicly available resources to a single nucleotide polymorphism (SNP)-based marker system. A new marker system, single-strand conformational polymorphism (SSCP)-SNP, was developed using annotated rice genomic sequences to initially predict the intron-exon borders in millet expressed sequence tags (ESTs) and then to design primers that would amplify across the introns. An adequate supply of millet ESTs was available for us to identify 299 homologues of single-copy rice genes in which the intron positions could be precisely predicted. PCR primers were then designed to amplify approximately 500-bp genomic fragments containing introns. Analysis of these fragments on SSCP gels revealed considerable polymorphism. A detailed DNA sequence analysis of variation at four of the SSCP-SNP loci over a panel of eight inbred genotypes showed complex patterns of variation, with about one SNP or indel (insertion-deletion) every 59 bp in the introns, but considerably fewer in the exons. About two-thirds of the variation was derived from SNPs and one-third from indels. Most haplotypes were detected by SSCP. As a marker system, SSCP-SNP has lower development costs than simple sequence repeats (SSRs), because much of the work is in silico, and similar deployment costs and through-put potential. The rates of polymorphism were lower but useable, with a mean PIC of 0.49 relative to 0.72 for SSRs in our eight inbred genotype panel screen. The major advantage of the system is in comparative applications. Syntenic information can be used to target SSCP-SNP markers to specific chromosomal regions or, conversely, SSCP-SNP markers can be used to unravel detailed syntenic relationships in specific parts of the genome. Finally, a preliminary analysis showed that the millet SSCP-SNP primers amplified in other cereals with a success rate of about 50%. There is also considerable potential to promote SSCP-SNP to a COS (conserved orthologous set) marker system for application across species by more specifically designing primers to precisely match the model genome sequence.Electronic Supplementary Material Supplementary material is available for this article at  相似文献   

5.
The role of spliceosomal intronic structures played in evolution has only begun to be elucidated. Comparative genomic analyses of fungal snoRNA sequences, which are often contained within introns and/or exons, revealed that about one-third of snoRNA-associated introns in three major snoRNA gene clusters manifested polymorphisms, likely resulting from intron loss and gain events during fungi evolution. Genomic deletions can clearly be observed as one mechanism underlying intron and exon loss, as well as generation of complex introns where several introns lie in juxtaposition without intercalating exons. Strikingly, by tracking conserved snoRNAs in introns, we found that some introns had moved from one position to another by excision from donor sites and insertion into target sties elsewhere in the genome without needing transposon structures. This study revealed the origin of many newly gained introns. Moreover, our analyses suggested that intron-containing sequences were more prone to sustainable structural changes than DNA sequences without introns due to intron''s ability to jump within the genome via unknown mechanisms. We propose that splicing-related structural features of introns serve as an additional motor to propel evolution.  相似文献   

6.
SLC11A1 (also known as Natural Resistance Associated Macrophage Protein1, NRAMP1) plays a crucial role in resistance of inbred mice to infection with several intracellular pathogens such as Mycobacterium, Leishmania and Salmonella. In this study, PCR amplification and sequencing were performed to obtain the genomic organization and sequence of porcine SLC11A1 gene by comparative genomic analysis. Results showed that porcine SLC11A1 gene consists of 15 exons and 14 introns, which is consistent with that of mice and human. All introns were sequenced and their nucleotide sequences were submitted to GenBank. The exon/intron boundaries were determined by comparing cDNA sequence with amplified genomic DNA sequences. Mutational analysis was performed on exonic and neighboring intronic region by denaturing high-performance liquid chromatography (DHPLC) and sequencing confirmation. Forty polymorphisms were identified; six are located in exons and thirty-four in introns. Two exonic polymorphisms are nonsynonymous changes (D6H and V175I), three are synonymous changes (S23, G33 and I155), and one is in 3' UTR. The availability of the fine genomic organization and identification of the polymorphisms will facilitate the evaluation of porcine SLC11A1 functional role in diseases resistance or susceptibility.  相似文献   

7.
Many alternative splicing events are regulated by pentameric and hexameric intronic sequences that serve as binding sites for splicing regulatory factors. We hypothesized that intronic elements that regulate alternative splicing are under selective pressure for evolutionary conservation. Using a Wobble Aware Bulk Aligner genomic alignment of Caenorhabditis elegans and Caenorhabditis briggsae, we identified 147 alternatively spliced cassette exons that exhibit short regions of high nucleotide conservation in the introns flanking the alternative exon. In vivo experiments on the alternatively spliced let-2 gene confirm that these conserved regions can be important for alternative splicing regulation. Conserved intronic element sequences were collected into a dataset and the occurrence of each pentamer and hexamer motif was counted. We compared the frequency of pentamers and hexamers in the conserved intronic elements to a dataset of all C. elegans intron sequences in order to identify short intronic motifs that are more likely to be associated with alternative splicing. High-scoring motifs were examined for upstream or downstream preferences in introns surrounding alternative exons. Many of the high- scoring nematode pentamer and hexamer motifs correspond to known mammalian splicing regulatory sequences, such as (T)GCATG, indicating that the mechanism of alternative splicing regulation is well conserved in metazoans. A comparison of the analysis of the conserved intronic elements, and analysis of the entire introns flanking these same exons, reveals that focusing on intronic conservation can increase the sensitivity of detecting putative splicing regulatory motifs. This approach also identified novel sequences whose role in splicing is under investigation and has allowed us to take a step forward in defining a catalog of splicing regulatory elements for an organism. In vivo experiments confirm that one novel high-scoring sequence from our analysis, (T)CTATC, is important for alternative splicing regulation of the unc-52 gene.  相似文献   

8.
Few intraspecific genetic linkage maps have been reported for cultivated tomato, mainly because genetic diversity within Solanum lycopersicum is much less than that between tomato species. Single nucleotide polymorphisms (SNPs), the most abundant source of genomic variation, are the most promising source of polymorphisms for the construction of linkage maps for closely related intraspecific lines. In this study, we developed SNP markers based on expressed sequence tags for the construction of intraspecific linkage maps in tomato. Out of the 5607 SNP positions detected through in silico analysis, 1536 were selected for high-throughput genotyping of two mapping populations derived from crosses between ‘Micro-Tom’ and either ‘Ailsa Craig’ or ‘M82’. A total of 1137 markers, including 793 out of the 1338 successfully genotyped SNPs, along with 344 simple sequence repeat and intronic polymorphism markers, were mapped onto two linkage maps, which covered 1467.8 and 1422.7 cM, respectively. The SNP markers developed were then screened against cultivated tomato lines in order to estimate the transferability of these SNPs to other breeding materials. The molecular markers and linkage maps represent a milestone in the genomics and genetics, and are the first step toward molecular breeding of cultivated tomato. Information on the DNA markers, linkage maps, and SNP genotypes for these tomato lines is available at http://www.kazusa.or.jp/tomato/.  相似文献   

9.
Cone snails, which are predatory marine gastropods, produce a cocktail of venoms used for predation, defense and competition. The major venom component, conotoxin, has received significant attention because it is useful in neuroscience research, drug development and molecular diversity studies. In this study, we report the genomic characterization of nine conotoxin gene superfamilies from 18 Conus species and investigate the relationships among conotoxin gene structure, molecular evolution and diversity. The I1, I2, M, O2, O3, P, S, and T superfamily precursors all contain three exons and two introns, while A superfamily members contain two exons and one intron. The introns are conserved within a certain gene superfamily, and also conserved across different Conus species, but divergent among different superfamilies. The intronic sequences contain many simple repeat sequences and regulatory elements that may influence conotoxin gene expression. Furthermore, due to the unique gene structure of conotoxins, the base substitution rates and the number of positively selected sites vary greatly among exons. Many more point mutations and trinucleotide indels were observed in the mature peptide exon than in the other exons. In addition, the first example of alternative splicing in conotoxin genes was found. These results suggest that the diversity of conotoxin genes has been shaped by point mutations and indels, as well as rare gene recombination or alternative splicing events, and that the unique gene structures could have made a contribution to the evolution of conotoxin genes.  相似文献   

10.
Genome-wide analysis of NBS-LRR-encoding genes in Arabidopsis   总被引:34,自引:0,他引:34       下载免费PDF全文
The Arabidopsis genome contains approximately 200 genes that encode proteins with similarity to the nucleotide binding site and other domains characteristic of plant resistance proteins. Through a reiterative process of sequence analysis and reannotation, we identified 149 NBS-LRR-encoding genes in the Arabidopsis (ecotype Columbia) genomic sequence. Fifty-six of these genes were corrected from earlier annotations. At least 12 are predicted to be pseudogenes. As described previously, two distinct groups of sequences were identified: those that encoded an N-terminal domain with Toll/Interleukin-1 Receptor homology (TIR-NBS-LRR, or TNL), and those that encoded an N-terminal coiled-coil motif (CC-NBS-LRR, or CNL). The encoded proteins are distinct from the 58 predicted adapter proteins in the previously described TIR-X, TIR-NBS, and CC-NBS groups. Classification based on protein domains, intron positions, sequence conservation, and genome distribution defined four subgroups of CNL proteins, eight subgroups of TNL proteins, and a pair of divergent NL proteins that lack a defined N-terminal motif. CNL proteins generally were encoded in single exons, although two subclasses were identified that contained introns in unique positions. TNL proteins were encoded in modular exons, with conserved intron positions separating distinct protein domains. Conserved motifs were identified in the LRRs of both CNL and TNL proteins. In contrast to CNL proteins, TNL proteins contained large and variable C-terminal domains. The extant distribution and diversity of the NBS-LRR sequences has been generated by extensive duplication and ectopic rearrangements that involved segmental duplications as well as microscale events. The observed diversity of these NBS-LRR proteins indicates the variety of recognition molecules available in an individual genotype to detect diverse biotic challenges.  相似文献   

11.
We describe the complete genomic sequences for the tobacco and Arabidopsis homologues of tomato LAT59, a previously described member of a family of pectate lyase-like genes. Translation of the tobacco gene, Nt59, predicts a protein with 93.5% overall amino acid similarity to LAT59. Nt59 has two introns whose positions are exactly conserved with the two introns of LAT59. Both LAT59 and Nt59 are specifically expressed in pollen and their promoter and 5-UTR sequences are highly similar. Furthermore, two promoter elements shown to be important for pollen expression of LAT59 are conserved in the Nt59 promoter. The Arabidopsis homologue, At59, was found by examination of four candidates. At59 has 72.6% amino acid similarity to LAT59 and the position of one of its two introns is conserved with one of the LAT59 introns. At59 is also pollen-expressed and although its promoter sequence is quite different from the Nt59 and LAT59 promoters, the two promoter elements are somewhat conserved.  相似文献   

12.
Orthologous introns have identical positions relative to the coding sequence in orthologous genes of different species. By analyzing the complete genomes of five plants we generated a database of 40,512 orthologous intron groups of dicotyledonous plants, 28,519 orthologous intron groups of angiosperms, and 15,726 of land plants (moss and angiosperms). Multiple sequence alignments of each orthologous intron group were obtained using the Mafft algorithm. The number of conserved regions in plant introns appeared to be hundreds of times fewer than that in mammals or vertebrates. Approximately three quarters of conserved intronic regions among angiosperms and dicots, in particular, correspond to alternatively-spliced exonic sequences. We registered only a handful of conserved intronic ncRNAs of flowering plants. However, the most evolutionarily conserved intronic region, which is ubiquitous for all plants examined in this study, including moss, possessed multiple structural features of tRNAs, which caused us to classify it as a putative tRNA-like ncRNA. Intronic sequences encoding tRNA-like structures are not unique to plants. Bioinformatics examination of the presence of tRNA inside introns revealed an unusually long-term association of four glycine tRNAs inside the Vac14 gene of fish, amniotes, and mammals.  相似文献   

13.
14.

Background  

Tomato species are of significant agricultural and ecological interest, with cultivated tomato being among the most common vegetable crops grown. Wild tomato species are native to diverse habitats in South America and show great morphological and ecological diversity that has proven useful in breeding programs. However, relatively little is known about nucleotide diversity between tomato species. Until recently limited sequence information was available for tomato, preventing genome-wide evolutionary analyses. Now, an extensive collection of tomato expressed sequence tags (ESTs) is available at the SOL Genomics Network (SGN). This database holds sequences from several species, annotated with quality values, assembled into unigenes, and tested for homology against other genomes. Despite the importance of polymorphism detection for breeding and natural variation studies, such analyses in tomato have mostly been restricted to cultivated accessions. Importantly, previous polymorphisms surveys mostly ignored the linked meta-information, limiting functional and evolutionary analyses. The current data in SGN is thus an under-exploited resource. Here we describe a cross-species analysis taking full-advantage of available information.  相似文献   

15.
Most molecular phylogenetic studies of vertebrates have been based on DNA sequences of mitochondrial-encoded genes. MtDNA evolves rapidly and is thus particularly useful for resolving relationships among recently evolved groups. However, it has the disadvantage that all of the mitochondrial genes are inherited as a single linkage group so that only one independent gene tree can be inferred regardless of the number of genes sequenced. Introns of nuclear genes are attractive candidates for independent sources of rapidly evolving DNA: they are pervasive, most of their nucleotides appear to be unconstrained by selection, and PCR primers can be designed for sequences in adjacent exons where nucleotide sequences are conserved. We sequenced intron 7 of the beta-fibrinogen gene (beta-fibint7) for a diversity of woodpeckers and compared the phylogenetic signal and nucleotide substitution properties of this DNA sequence with that of mitochondrial-encoded cytochrome b (cyt b) from a previous study. A few indels (insertions and deletions) were found in the beta-fibint7 sequences, but alignment was not difficult, and the indels were phylogentically informative. The beta-fibint7 and cyt b gene trees were nearly identical to each other but differed in significant ways from the traditional woodpecker classification. Cyt b evolves 2.8 times as fast as beta-fibint7 (14. 0 times as fast at third codon positions). Despite its relatively slow substitution rate, the phylogenetic signal in beta-fibint7 is comparable to that in cyt b for woodpeckers, because beta-fibint7 has less base composition bias and more uniform nucleotide substitution probabilities. As a consequence, compared with cyt b, beta-fibint7 nucleotide sites are expected to enter more distinct character states over the course of evolution and have fewer multiple substitutions and lower levels of homoplasy. Moreover, in contrast to cyt b, in which nearly two thirds of nucleotide sites rarely vary among closely related taxa, virtually all beta-fibint7 nucleotide sites appear free of selective constraints, which increases informative sites per unit sequenced. However, the estimated gamma distribution used to model rate variation among sites suggests constraints on some beta-fibint7 sites. This study suggests that introns will be useful for phylogenetic studies of recently evolved groups.  相似文献   

16.

Background

One of the goals of genomics is to identify the genetic loci responsible for variation in phenotypic traits. The completion of the tomato genome sequence and recent advances in DNA sequencing technology allow for in-depth characterization of genetic variation present in the tomato genome. Like many self-pollinated crops, cultivated tomato accessions show a low molecular but high phenotypic diversity. Here we describe the whole-genome resequencing of eight accessions (four cherry-type and four large fruited lines) chosen to represent a large range of intra-specific variability and the identification and annotation of novel polymorphisms.

Results

The eight genomes were sequenced using the GAII Illumina platform. Comparison of the sequences with the reference genome yielded more than 4 million single nucleotide polymorphisms (SNPs). This number varied from 80,000 to 1.5 million according to the accessions. Almost 128,000 InDels were detected. The distribution of SNPs and InDels across and within chromosomes was highly heterogeneous revealing introgressions from wild species and the mosaic structure of the genomes of the cherry tomato accessions. In-depth annotation of the polymorphisms identified more than 16,000 unique non-synonymous SNPs. In addition 1,686 putative copy-number variations (CNVs) were identified.

Conclusions

This study represents the first whole genome resequencing experiment in cultivated tomato. Substantial genetic differences exist between the sequenced tomato accessions and the reference sequence. The heterogeneous distribution of the polymorphisms may be related to introgressions that occurred during domestication or breeding. The annotated SNPs, InDels and CNVs identified in this resequencing study will serve as useful genetic tools, and as candidate polymorphisms in the search for phenotype-altering DNA variations.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-14-791) contains supplementary material, which is available to authorized users.  相似文献   

17.
Because cultivated tomato (Solanum lycopersicum L.) is low in genetic diversity, public, verified single nucleotide polymorphism (SNP) markers within the species are in demand. To promote marker development we resequenced approximately 23 kb in a diverse set of 31 tomato lines including TA496. Three classes of markers were sampled: (1) 26 expressed-sequence tag (EST), all of which were predicted to be polymorphic based on TA496, (2) 14 conserved ortholog set II (COSII) or unigene, and (3) ten published sequences, composed of nine fruit quality genes and one anonymous RFLP marker. The latter two types contained mostly noncoding DNA. In total, 154 SNPs and 34 indels were observed. The distributions of nucleotide diversity estimates among marker types were not significantly different from each other. Ascertainment bias of SNPs was evaluated for the EST markers. Despite the fact that the EST markers were developed using SNP prediction within a sample consisting of only one TA496 allele and one additional allele, the majority of polymorphisms in the 26 EST markers were represented among the other 30 tomato lines. Fifteen EST markers with published SNPs were more closely examined for bias. Mean SNP diversity observations were not significantly different between the original discovery sample of two lines (53 SNPs) and the 31 line diversity panel (56 SNPs). Furthermore, TA496 shared its haplotype with at least one other line at 11 of the 15 markers. These data demonstrate that public EST databases and noncoding regions are a valuable source of unbiased SNP markers in tomato. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users. The use of trade, firm, or corporation names in this publication is for the information and convenience of the reader. Such use does not constitute an official endorsement or approval by the United States Department of Agriculture or the Agricultural Research Service of any product or service to the exclusion of others that may be suitable.  相似文献   

18.
19.
Angiosperms (flowering plants), including both monocots and dicots, contain small catalase gene families. In the dicot, Arabidopsis thaliana, two catalase (CAT) genes, CAT1 and CAT3, are tightly linked on chromosome 1 and a third, CAT2, which is more similar to CAT1 than to CAT3, is unlinked on chromosome 4. Comparison of positions and numbers of introns among 13 angiosperm catalase genomic sequences indicates that intron positions are conserved, and suggests that an ancestral catalase gene common to monocots and dicots contained seven introns. Arabidopsis CAT2 has seven introns; both CAT1 and CAT3 have six introns in positions conserved with CAT2, but each has lost a different intron. We suggest the following sequence of events during the evolution of the Arabidopsis catalase gene family. An initial duplication of an ancestral catalase gene gave rise to CAT3 and CAT1. CAT1 then served as the template for a second duplication, yielding CAT2. Intron losses from CAT1 and CAT3 followed these duplications. One subclade of monocot catalases has lost all but the 5''-most and 3''-most introns, which is consistent with a mechanism of intron loss by replacement of an ancestral intron-containing gene with a reverse-transcribed DNA copy of a fully spliced mRNA. Following this event of concerted intron loss, the Oryza sativa (rice, a monocot) CAT1 lineage acquired an intron in a novel position, consistent with a mechanism of intron gain at proto-splice sites.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号