首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
DNA sequence variations of chalcone synthase (Chs) and Apetala3 gene promoters from 22 cruciferous plant species were analyzed to identify putative conserved regulatory elements. Our comparative approach confirmed the existence of numerous conserved sequences which may act as regulatory elements in both investigated promoters. To confirm the correct identification of a well-conserved UV-light-responsive promoter region, a subset of Chs promoter fragments were tested in Arabidopsis thaliana protoplasts. All promoters displayed similar light responsivenesses, indicating the general functional relevance of the conserved regulatory element. In addition to known regulatory elements, other highly conserved regions were detected which are likely to be of functional importance. Phylogenetic trees based on DNA sequences from both promoters (gene trees) were compared with the hypothesized phylogenetic relationships (species trees) of these taxa. The data derived from both promoter sequences were congruent with the phylogenies obtained from coding regions of other nuclear genes and from chloroplast DNA sequences. This indicates that promoter sequence evolution generally is reflective of species phylogeny. Our study also demonstrates the great value of comparative genomics and phylogenetics as a basis for functional analysis of promoter action and gene regulation.  相似文献   

2.
The identification of noncoding functional elements within vertebrate genomes, such as those that regulate gene expression, is a major challenge. Comparisons of orthologous sequences from multiple species are effective at detecting highly conserved regions and can reveal potential regulatory sequences. The GDF6 gene controls developmental patterning of skeletal joints and is associated with numerous, distant cis-acting regulatory elements. Using sequence data from 14 vertebrate species, we performed novel multispecies comparative analyses to detect highly conserved sequences flanking GDF6. The complementary tools WebMCS and ExactPlus identified a series of multispecies conserved sequences (MCSs). Of particular interest are MCSs within noncoding regions previously shown to contain GDF6 regulatory elements. A previously reported conserved sequence at -64 kb was also detected by both WebMCS and ExactPlus. Analysis of LacZ-reporter transgenic mice revealed that a 440-bp segment from this region contains an enhancer for Gdf6 expression in developing proximal limb joints. Several other MCSs represent candidate GDF6 regulatory elements; many of these are not conserved in fish or frog, but are strongly conserved in mammals.  相似文献   

3.
We analyze the secondary structure of two expansion segments (D2, D3) of the 28S ribosomal (rRNA)-encoding gene region from 527 chalcidoid wasp taxa (Hymenoptera: Chalcidoidea) representing 18 of the 19 extant families. The sequences are compared in a multiple sequence alignment, with secondary structure inferred primarily from the evidence of compensatory base changes in conserved helices of the rRNA molecules. This covariation analysis yielded 36 helices that are composed of base pairs exhibiting positional covariation. Several additional regions are also involved in hydrogen bonding, and they form highly variable base-pairing patterns across the alignment. These are identified as regions of expansion and contraction or regions of slipped-strand compensation. Additionally, 31 single-stranded locales are characterized as regions of ambiguous alignment based on the difficulty in assigning positional homology in the presence of multiple adjacent indels. Based on comparative analysis of these sequences, the largest genetic study on any hymenopteran group to date, we report an annotated secondary structural model for the D2, D3 expansion segments that will prove useful in assigning positional nucleotide homology for phylogeny reconstruction in these and closely related apocritan taxa.  相似文献   

4.
5.
Structure and expression of the murine L-myc gene.   总被引:25,自引:5,他引:20       下载免费PDF全文
  相似文献   

6.
7.
A challenge for mammalian genetics is the recognition of critical regulatory regions in primary gene sequence. One approach to this problem is to compare sequences from genes exhibiting highly conserved expression patterns in disparate organisms. Previous transgenic and transfection analyses defined conserved regulatory domains in the mouse and human adenosine deaminase (ADA) genes. We have thus attempted to identify regions with comparable similarity levels potentially indicative of critical ADA regulatory regions. On the basis of aligned regions of the mouse and human ADA gene, using a 24-bp window, we find that similarity overall (67.7%) and throughout the noncoding sequences (67.1%) is markedly lower than that of the coding regions (81%). This low overall similarity facilitated recognition of more highly conserved regions. In addition to the highly conserved exons, ten noncoding regions >100 bp in length displayed >70% sequence similarity. Most of these contained numerous 24-bp windows with much higher levels of similarity. A number of these regions, including the promoter and the thymic enhancer, were more similar than several exons. A third block, located near the thymic enhancer but just outside of a minimally defined locus control region, exhibited stronger similarity than the promoter or thymic enhancer. In contrast, only fragmentary similarity was exhibited in a region that harbors a strong duodenal enhancer in the human gene. These studies show that comparative sequence analysis can be a powerful tool for identifying conserved regulatory domains, but that some conserved sequences may not be detected by certain functional analyses as transgenic mice. Received: 27 March 1998 / Accepted: 22 September 1998  相似文献   

8.
Multiple sequence alignments have wide applicability in many areas of computational biology, including comparative genomics, functional annotation of proteins, gene finding, and modeling evolutionary processes. Because of the computational difficulty of multiple sequence alignment and the availability of numerous tools, it is critical to be able to assess the reliability of multiple alignments. We present a tool called StatSigMA to assess whether multiple alignments of nucleotide or amino acid sequences are contaminated with one or more unrelated sequences. There are numerous applications for which StatSigMA can be used. Two such applications are to distinguish homologous sequences from nonhomologous ones and to compare alignments produced by various multiple alignment tools. We present examples of both types of applications.  相似文献   

9.
10.
Exon discovery by genomic sequence alignment   总被引:5,自引:0,他引:5  
MOTIVATION: During evolution, functional regions in genomic sequences tend to be more highly conserved than randomly mutating 'junk DNA' so local sequence similarity often indicates biological functionality. This fact can be used to identify functional elements in large eukaryotic DNA sequences by cross-species sequence comparison. In recent years, several gene-prediction methods have been proposed that work by comparing anonymous genomic sequences, for example from human and mouse. The main advantage of these methods is that they are based on simple and generally applicable measures of (local) sequence similarity; unlike standard gene-finding approaches they do not depend on species-specific training data or on the presence of cognate genes in data bases. As all comparative sequence-analysis methods, the new comparative gene-finding approaches critically rely on the quality of the underlying sequence alignments. RESULTS: Herein, we describe a new implementation of the sequence-alignment program DIALIGN that has been developed for alignment of large genomic sequences. We compare our method to the alignment programs PipMaker, WABA and BLAST and we show that local similarities identified by these programs are highly correlated to protein-coding regions. In our test runs, PipMaker was the most sensitive method while DIALIGN was most specific. AVAILABILITY: The program is downloadable from the DIALIGN home page at http://bibiserv.techfak.uni-bielefeld.de/dialign/.  相似文献   

11.
12.
Abstract The puffer fish Takifugu rubripes (Fugu), with its compact genome, is an ideal model organism for comparative genomics. Sonic hedgehog (Shh) is a key protein in the patterning of differentiating cells during embryonic development. We have sequenced the Fugu Shh gene and compared it with the mammalian and zebrafish orthologs, identifying a number of novel conserved, non-coding sequences upstream of exon one and within the two introns. Additional conserved sequences serve to delineate activator regions and enhancers previously characterized through functional analysis. Control elements can thus be rapidly and effectively predicted by comparative methodology in its own right as well as complementing other, functional methods. This work demonstrates the value of using Fugu in comparative genomics, which has allowed identification of new putative regulatory elements, as well as corroborating enhancers identified by the more traditional deletion mapping method.  相似文献   

13.
We describe a multiple alignment program named MAP2 based on a generalized pairwise global alignment algorithm for handling long, different intergenic and intragenic regions in genomic sequences. The MAP2 program produces an ordered list of local multiple alignments of similar regions among sequences, where different regions between local alignments are indicated by reporting only similar regions. We propose two similarity measures for the evaluation of the performance of MAP2 and existing multiple alignment programs. Experimental results produced by MAP2 on four real sets of orthologous genomic sequences show that MAP2 rarely missed a block of transitively similar regions and that MAP2 never produced a block of regions that are not transitively similar. Experimental results by MAP2 on six simulated data sets show that MAP2 found the boundaries between similar and different regions precisely. This feature is useful for finding conserved functional elements in genomic sequences. The MAP2 program is freely available in source code form at http://bioinformatics.iastate.edu/aat/sas.html for academic use.  相似文献   

14.
15.
16.
We investigated the occurrence of gene conversions between paralogous sequences of Salmoninae derived from ancestral tetraploidization and their effect on the evolutionary history of DNA sequences. A microsatellite with long flanking regions (750 bp) including both coding and noncoding sequences was analyzed. Microsatellite size polymorphism was used to detect the alleles of both paralogous counterparts and infer linkage arrangement between loci. DNA sequencing of seven Salmoninae species revealed that paralogous sequences were highly differentiated within species, especially for noncoding regions. Ten gene conversion events between paralogous sequences were inferred. While these events appears to have homogenized regions of otherwise highly differential paralogous sequences, they amplified the differentiation among orthologous sequences. Their effects were larger on coding than on noncoding regions. As a consequence, noncoding sequences grouped by orthologous lineages in phylogenetic trees, whereas coding regions grouped by taxa. Based upon these results, we present a model showing how gene conversion events may also result in the PCR amplification of nonorthologous sequences in different taxa, with obvious complications for phylogenetic inferences, comparative mapping, and population genetic studies. Received: 11 October 2000 / Accepted: 18 September 2001  相似文献   

17.
Brachypodium distachyon (Brachypodium) has been recently recognized as an emerging model system for both comparative and functional genomics in grass species. In this study, 55,221 repeat masked Brachypodium BAC end sequences (BES) were used for comparative analysis against the 12 rice pseudomolecules. The analysis revealed that ~26.4% of BES have significant matches with the rice genome and 82.4% of the matches were homologous to known genes. Further analysis of paired-end BES and ~1.0 Mb sequences from nine selected BACs proved to be useful in revealing conserved regions and regions that have undergone considerable genomic changes. Differential gene amplification, insertions/deletions and inversions appeared to be the common evolutionary events that caused variations of microcolinearity at different orthologous genomic regions. It was found that ~17% of genes in the two genomes are not colinear in the orthologous regions. Analysis of BAC sequences also revealed higher gene density (~9 kb/gene) and lower repeat DNA content (~13.1%) in Brachypodium when compared to the orthologous rice regions, consistent with the smaller size of the Brachypodium genome. The 119 annotated Brachypodium genes were BLASTN compared against the wheat EST database and deletion bin mapped wheat ESTs. About 77% of the genes retrieved significant matches in the EST database, while 9.2% matched to the bin mapped ESTs. In some cases, genes in single Brachypodium BACs matched to multiple ESTs that were mapped to the same deletion bins, suggesting that the Brachypodium genome will be useful for ordering wheat ESTs within the deletion bins and developing specific markers at targeted regions in the wheat genome.  相似文献   

18.
Expression patterns of gene products provide important insights into gene function. Reporter constructs are frequently used to analyze gene expression in Caenorhabditis elegans, but the sequence context of a given gene is inevitably altered in such constructs. As a result, these transgenes may lack regulatory elements required for proper gene expression. We developed Gene Catchr, a novel method of generating reporter constructs that exploits yeast homologous recombination (YHR) to subclone and tag worm genes while preserving their local sequence context. YHR facilitates the cloning of large genomic regions, allowing the isolation of regulatory sequences in promoters, introns, untranslated regions and flanking DNA. The endogenous regulatory context of a given gene is thus preserved, producing expression patterns that are as accurate as possible. Gene Catchr is flexible: any tag can be inserted at any position without introducing extra sequence. Each step is simple and can be adapted to process multiple genes in parallel. We show that expression patterns derived from Gene Catchr transgenes are consistent with previous reports and also describe novel expression data. Mutant rescue assays demonstrate that Gene Catchr-generated transgenes are functional. Our results validate the use of Gene Catchr as a valuable tool to study spatiotemporal gene expression.  相似文献   

19.
Comparative genomics provides insight into the evolutionary dynamics that shape discrete sequences as well as whole genomes. To advance comparative genomics within the Brassicaceae, we have end sequenced 23,136 medium-sized insert clones from Boechera stricta, a wild relative of Arabidopsis (Arabidopsis thaliana). A significant proportion of these sequences, 18,797, are nonredundant and display highly significant similarity (BLASTn e-value < or = 10(-30)) to low copy number Arabidopsis genomic regions, including more than 9,000 annotated coding sequences. We have used this dataset to identify orthologous gene pairs in the two species and to perform a global comparison of DNA regions 5' to annotated coding regions. On average, the 500 nucleotides upstream to coding sequences display 71.4% identity between the two species. In a similar analysis, 61.4% identity was observed between 5' noncoding sequences of Brassica oleracea and Arabidopsis, indicating that regulatory regions are not as diverged among these lineages as previously anticipated. By mapping the B. stricta end sequences onto the Arabidopsis genome, we have identified nearly 2,000 conserved blocks of microsynteny (bracketing 26% of the Arabidopsis genome). A comparison of fully sequenced B. stricta inserts to their homologous Arabidopsis genomic regions indicates that indel polymorphisms >5 kb contribute substantially to the genome size difference observed between the two species. Further, we demonstrate that microsynteny inferred from end-sequence data can be applied to the rapid identification and cloning of genomic regions of interest from nonmodel species. These results suggest that among diploid relatives of Arabidopsis, small- to medium-scale shotgun sequencing approaches can provide rapid and cost-effective benefits to evolutionary and/or functional comparative genomic frameworks.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号