首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Behura SK  Severson DW 《Gene》2012,504(2):226-232
We present a detailed genome-scale comparative analysis of simple sequence repeats within protein coding regions among 25 insect genomes. The repetitive sequences in the coding regions primarily represented single codon repeats and codon pair repeats. The CAG triplet is highly repetitive in the coding regions of insect genomes. It is frequently paired with the synonymous codon CAA to code for polyglutamine repeats. The codon pairs that are least repetitive code for polyalanine repeats. The frequency of hexanucleotide and dinucleotide motifs of codon pair repeats is significantly (p<0.001) different in the Drosophila species compared to the non-Drosophila species. However, the frequency of synonymous and non-synonymous codon pair repeats varies in a correlated manner (r(2)=0.79) among all the species. Results further show that perfect and imperfect repeats have significant association with the trinucleotide and hexanucleotide coding repeats in most of these insects. However, only select species show significant association between the numbers of perfect/imperfect hexamers and repeat coding for single amino acid/amino acid pair runs. Our data further suggests that genes containing simple sequence coding repeats may be under negative selection as they tend to be poorly conserved across species. The sequences of coding repeats of orthologous genes vary according to the known phylogeny among the species. In conclusion, the study shows that simple sequence coding repeats are important features of genome diversity among insects.  相似文献   

2.
Heliconius butterflies represent a recent radiation of species, in which wing pattern divergence has been implicated in speciation. Several loci that control wing pattern phenotypes have been mapped and two were identified through sequencing. These same gene regions play a role in adaptation across the whole Heliconius radiation. Previous studies of population genetic patterns at these regions have sequenced small amplicons. Here, we use targeted next-generation sequence capture to survey patterns of divergence across these entire regions in divergent geographical races and species of Heliconius. This technique was successful both within and between species for obtaining high coverage of almost all coding regions and sufficient coverage of non-coding regions to perform population genetic analyses. We find major peaks of elevated population differentiation between races across hybrid zones, which indicate regions under strong divergent selection. These 'islands' of divergence appear to be more extensive between closely related species, but there is less clear evidence for such islands between more distantly related species at two further points along the 'speciation continuum'. We also sequence fosmid clones across these regions in different Heliconius melpomene races. We find no major structural rearrangements but many relatively large (greater than 1 kb) insertion/deletion events (including gain/loss of transposable elements) that are variable between races.  相似文献   

3.
We identified a 178 bp mobile DNA element in lettuce with characteristic CGAGC/GCTCG repeats in the subterminal regions. This element has terminal inverted repeats and 8-bp target site duplications typical of the hAT superfamily of class II mobile elements, but its small size and potential to form a single-stranded stable hairpin-like secondary structure suggest that it is related to MITE elements. In silico searches for related elements identified 252 plant sequences with 8-bp target site duplications and sequence similarity in their terminal and subterminal regions. Some of these sequences were predicted to encode transposases and may be autonomous elements; these constituted a separate clade within the phylogram of hAT transposases. We demonstrate that the CGAGC/GCTCG pentamer maximizes the hairpin stability compared to any other pentamer with the same C + G content, and the secondary structures of these elements are more stable than for most MITEs. We named these elements collectively as hATpin elements because of the hAT similarity and their hairpin structures. The nearly complete rice genome sequence and the highly advanced genome annotation allowed us to localize most rice elements and to deduce insertion preferences. hATpin elements are distributed on all chromosomes, but with significant bias for chromosomes 1 and 10 and in regions of moderate gene density. This family of class II mobile elements is found primarily in monocot species, but is also present in dicot species. Electronic supplementary material Electronic supplementary material is available for this article at and accessible for authorised users.  相似文献   

4.
5.
Morphological evolution is driven both by coding sequence variation and by changes in regulatory sequences. However, how cis-regulatory modules (CRMs) evolve to generate entirely novel expression domains is largely unknown. Here, we reconstruct the evolutionary history of a lens enhancer located within a CRM that not only predates the lens, a vertebrate innovation, but bilaterian animals in general. Alignments of orthologous sequences from different deuterostomes sub-divide the CRM into a deeply conserved core and a more divergent flanking region. We demonstrate that all deuterostome flanking regions, including invertebrate sequences, activate gene expression in the zebrafish lens through the same ancient cluster of activator sites. However, levels of gene expression vary between species due to the presence of repressor motifs in flanking region and core. These repressor motifs are responsible for the relatively weak enhancer activity of tetrapod flanking regions. Ray-finned fish, however, have gained two additional lineage-specific activator motifs which in combination with the ancient cluster of activators and the core constitute a potent lens enhancer. The exploitation and modification of existing regulatory potential in flanking regions but not in the highly conserved core might represent a more general model for the emergence of novel regulatory functions in complex CRMs.  相似文献   

6.
7.
Identification of closely related nematode species or races can be very difficult when diagnostic characters are plastic and overlapping. In this study we describe the use of polymerase chain reaction technology and direct DNA sequencing on 19 populations of Bursaphelenchus spp. to help understand their taxonomic relationships. The 5'' end of the heat shock 70A gene from Caenorhabditis elegans was used as the target DNA sequence because it contains both coding and non-coding regions. The results indicate that the 19 populations could be divided into five types within B. xylophilus and four types within B. mucronatus. On a larger scale, the data revealed three distinct groups, representing B. xylophilus from North America and Japan, B. mucronatus from Japan, and "B. mucronatus" from Europe. There is sufficient difference between the European and Japanese "B. mucronatus" groups to warrant their consideration as separate species.  相似文献   

8.

Background

Cochliobolus heterostrophus is a dothideomycete that causes Southern Corn Leaf Blight disease. There are two races, race O and race T that differ by the absence (race O) and presence (race T) of ~ 1.2-Mb of DNA encoding genes responsible for the production of T-toxin, which makes race T much more virulent than race O. The presence of repetitive elements in fungal genomes is considered to be an important source of genetic variability between different species.

Results

A detailed analysis of class I and II TEs identified in the near complete genome sequence of race O was performed. In total in race O, 12 new families of transposons were identified. In silico evidence of recent activity was found for many of the transposons and analyses of expressed sequence tags (ESTs) demonstrated that these elements were actively transcribed. Various potentially active TEs were found near coding regions and may modify the expression and structure of these genes by acting as ectopic recombination sites. Transposons were found on scaffolds carrying polyketide synthase encoding genes, responsible for production of T-toxin in race T. Strong evidence of ectopic recombination was found, demonstrating that TEs can play an important role in the modulation of genome architecture of this species. The Repeat Induced Point mutation (RIP) silencing mechanism was shown to have high specificity in C. heterostrophus, acting only on transposons near coding regions.

Conclusions

New families of transposons were identified. In C. heterostrophus, the RIP silencing mechanism is efficient and selective. The co-localization of effector genes and TEs, therefore, exposes those genes to high rates of point mutations. This may accelerate the rate of evolution of these genes, providing a potential advantage for the host. Additionally, it was shown that ectopic recombination promoted by TEs appears to be the major event in the genome reorganization of this species and that a large number of elements are still potentially active. So, this study provides information about the potential impact of TEs on the evolution of C. heterostrophus.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-536) contains supplementary material, which is available to authorized users.  相似文献   

9.
A frequently used approach for detecting potential coding regions is to search for stop codons. In the standard genetic code 3 out of 64 trinucleotides are stop codons. Hence, in random or non-coding DNA one can expect every 21st trinucleotide to have the same sequence as a stop codon. In contrast, the open reading frames (ORFs) of most protein-coding genes are considerably longer. Thus, the stop codon frequency in coding sequences deviates from the background frequency of the corresponding trinucleotides. This has been utilized for gene prediction, in particular, in detecting protein-coding ORFs. Traditional methods based on stop codon frequency are based on the assumption that the GC content is about 50%. However, many genomes show significant deviations from that value. With the presented method we can describe the effects of GC content on the selection of appropriate length thresholds of potentially coding ORFs. Conversely, for a given length threshold, we can calculate the probability of observing it in a random sequence. Thus, we can derive the maximum GC content for which ORF length is practicable as a feature for gene prediction methods and the resulting false positive rates. A rough estimate for an upper limit is a GC content of 80%. This estimate can be made more precise by including further parameters and by taking into account start codons as well. We demonstrate the feasibility of this method by applying it to the genomes of the bacteria Rickettsia prowazekii, Escherichia coli and Caulobacter crescentus, exemplifying the effect of GC content variations according to our predictions. We have adapted the method for predicting coding ORFs by stop codon frequency to the case of GC contents different from 50%. Usually, several methods for gene finding need to be combined. Thus, our results concern a specific part within a package of methods. Interestingly, for genomes with low GC content such as that of R. prowazekii, the presented method provides remarkably good results even when applied alone.  相似文献   

10.
The proliferation of retrotransposons within a genome can contribute to increased size and affect the function of eukaryotic genes. BEL/Pao-like long-terminal repeat (LTR) retrotransposons were annotated from the highly adaptable insect species Diabrotica virgifera virgifera, the Western corn rootworm, using survey sequences from bacterial artificial chromosome (BAC) inserts and contigs derived from a low coverage next-generation genome sequence assembly. Eleven unique D. v. virgifera BEL elements were identified that contained full-length gagpol coding sequences, whereas 88 different partial coding regions were characterized from partially assembled elements. Estimated genome copy number for full and partial BEL-like elements ranged from ~ 8 to 1582 among individual contigs using a normalized depth of coverage (DOC) among Illumina HiSeq reads (total genome copy number ~ 8821). BEL element copy number was correlated among different D. v. virgifera populations (R2 = 0.9846), but individual element numbers varied ≤ 1.68-fold and the total number varied by ~ 527 copies. These data indicate that BEL element proliferation likely contributed to a large genome size, and suggest that differences in copy number are a source of genetic variability among D. v. virgifera.  相似文献   

11.
12.
13.
14.
15.
16.
The high occurrence of nosocomial multidrug-resistant (MDR) microorganisms is considered a global health problem. Here, we report the draft genome sequence of a MDR Pseudomonas aeruginosa strain isolated in Brazil that belongs to the endemic clone ST277. The genome encodes important resistance determinant genes and consists of 6.7 Mb with a G+C content of 66.86% and 6,347 predicted coding regions including 60 RNAs.  相似文献   

17.
Lazarow K  Du ML  Weimer R  Kunze R 《Genetics》2012,191(3):747-756
Activator/Dissociation (Ac/Ds) transposable elements from maize are widely used as insertional mutagenesis and gene isolation tools in plants and more recently also in medaka and zebrafish. They are particularly valuable for plant species that are transformation-recalcitrant and have long generation cycles or large genomes with low gene densities. Ac/Ds transposition frequencies vary widely, however, and in some species they are too low for large-scale mutagenesis. We discovered a hyperactive Ac transposase derivative, AcTPase(4x), that catalyzes in the yeast Saccharomyces cerevisiae 100-fold more frequent Ds excisions than the wild-type transposase, whereas the reintegration frequency of excised Ds elements is unchanged (57%). Comparable to the wild-type transposase in plants, AcTPase(4x) catalyzes Ds insertion preferentially into coding regions and to genetically linked sites, but the mutant protein apparently has lost the weak bias of the wild-type protein for insertion sites with elevated guanine-cytosine content and nonrandom protein-DNA twist. AcTPase(4x) exhibits hyperactivity also in Arabidopsis thaliana where it effects a more than sixfold increase in Ds excision relative to wild-type AcTPase and thus may be useful to facilitate Ac/Ds-based insertion mutagenesis approaches.  相似文献   

18.
19.
RNA sequence elements involved in the regulation of pre-mRNA splicing have previously been identified in vertebrate genomes by computational methods. Here, we apply such approaches to predict splicing regulatory elements in Drosophila melanogaster and compare them with elements previously found in the human, mouse, and pufferfish genomes. We identified 99 putative exonic splicing enhancers (ESEs) and 231 putative intronic splicing enhancers (ISEs) enriched near weak 5' and 3' splice sites of constitutively spliced introns, distinguishing between those found near short and long introns. We found that a significant proportion (58%) of fly enhancer sequences were previously reported in at least one of the vertebrates. Furthermore, 20% of putative fly ESEs were previously identified as ESEs in human, mouse, and pufferfish; while only two fly ISEs, CTCTCT and TTATAA, were identified as ISEs in all three vertebrate species. Several putative enhancer sequences are similar to characterized binding-site motifs for Drosophila and mammalian splicing regulators. To provide additional evidence for the function of putative ISEs, we separately identified 298 intronic hexamers significantly enriched within sequences phylogenetically conserved among 15 insect species. We found that 73 putative ISEs were among those enriched in conserved regions of the D. melanogaster genome. The functions of nine enhancer sequences were verified in a heterologous splicing reporter, demonstrating that these sequences are sufficient to enhance splicing in vivo. Taken together, these data identify a set of predicted positive-acting splicing regulatory motifs in the Drosophila genome and reveal regulatory sequences that are present in distant metazoan genomes.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号