首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

   

Transposable elements may acquire unrelated gene fragments into their sequences in a process called transduplication. Transduplication of protein-coding genes is common in plants, but is unknown of in animals. Here, we report that the Turmoil-1 transposable element in C. elegans has incorporated two protein-coding sequences into its inverted terminal repeat (ITR) sequences. The ITRs of Turmoil-1 contain a conserved RNA recognition motif (RRM) that originated from the rsp-2 gene and a fragment from the protein-coding region of the cpg-3 gene. We further report that an open reading frame specific to C. elegans may have been created as a result of a Turmoil-1 insertion. Mutations at the 5' splice site of this open reading frame may have reactivated the transduplicated RRM motif.  相似文献   

2.

Background  

Although the protein-coding sequences in the Saccharomyces cerevisiae genome have been studied and annotated extensively, much less is known about the extent and characteristics of the untranslated regions of yeast mRNAs.  相似文献   

3.
4.

Background  

The mitochondrial genomes of plants generally encode 30-40 identified protein-coding genes and a large number of lineage-specific ORFs. The lack of wide conservation for most ORFs suggests they are unlikely to be functional. However, an ORF, termed orf-bryo1, was recently found to be conserved among bryophytes suggesting that it might indeed encode a functional mitochondrial protein.  相似文献   

5.

Background  

Despite extensive efforts devoted to predicting protein-coding genes in genome sequences, many bona fide genes have not been found and many existing gene models are not accurate in all sequenced eukaryote genomes. This situation is partly explained by the fact that gene prediction programs have been developed based on our incomplete understanding of gene feature information such as splicing and promoter characteristics. Additionally, full-length cDNAs of many genes and their isoforms are hard to obtain due to their low level or rare expression. In order to obtain full-length sequences of all protein-coding genes, alternative approaches are required.  相似文献   

6.

Background  

In search of new antifungal targets of potential interest for pharmaceutical companies, we initiated a comparative genomics study to identify the most promising protein-coding genes in fungal genomes. One criterion was the protein sequence conservation between reference pathogenic genomes. A second criterion was that the corresponding gene in Saccharomyces cerevisiae should be essential. Since thiamine pyrophosphate is an essential product involved in a variety of metabolic pathways, proteins responsible for its production satisfied these two criteria.  相似文献   

7.

Background  

Statistical approaches for protein design are relevant in the field of molecular evolutionary studies. In recent years, new, so-called structurally constrained (SC) models of protein-coding sequence evolution have been proposed, which use statistical potentials to assess sequence-structure compatibility. In a previous work, we defined a statistical framework for optimizing knowledge-based potentials especially suited to SC models. Our method used the maximum likelihood principle and provided what we call the joint potentials. However, the method required numerical estimations by the use of computationally heavy Markov Chain Monte Carlo sampling algorithms.  相似文献   

8.

Background

IgE sensitization to storage proteins from nuts and seed is often related to severe allergic symptoms. There is a risk of immunological IgE cross-reactivity between storage proteins from different species. The potential clinical implication of such cross-reactivity is that allergens other than the known sensitizer can cause allergic symptoms. Previous studies have suggested that kiwi seed storage proteins may constitute hidden food allergens causing cross-reactive IgE-binding with peanut and other tree nut homologs, thereby mediating a potential risk of causing allergy symptoms among peanut ant tree nut allergic individuals. The objective of this study was to investigate the degree of sensitization towards kiwi fruit seed storage proteins in a cohort of peanut allergic individuals.

Methods

A cohort of 59 adolescents and adults with peanut allergy was studied, and self reported allergies to a number of additional foods were collected. Quantitative IgE measurements to seed storage proteins from kiwi and peanut were performed.

Results

In the cohort, 23 out of the 59 individuals were reporting kiwi fruit allergy (39%). The frequency of IgE sensitization to kiwi fruit and to any kiwi seed storage protein was higher among peanut allergic individuals also reporting kiwi fruit allergy (P = 0.0001 and P = 0.01). A positive relationship was found between IgE levels to 11S globulin (r = 0.65) and 7S globulin (r = 0.48) allergens from kiwi and peanut, but IgE levels to 2S albumin homologs did not correlate. Patients reporting kiwi fruit allergy also reported allergy to hazelnut (P = 0.015), soy (P < 0.0001), pea (P = 0.0002) and almond (P = 0.016) to a higher extent than peanut allergic individuals without kiwi allergy.

Conclusions

Thirty-nine percent of the peanut allergic patients in this cohort also reported kiwi fruit allergy, they displayed a higher degree of sensitization to kiwi storage proteins from both kiwi and peanut, and they also reported a higher extent of allergy to other nuts and legumes. On the molecular level, there was a correlation between IgE levels to 11S and 7S storage proteins from kiwi and peanut. Taken together, reported symptoms and serological findings to kiwi in this cohort of patients with concurrent allergy to peanut and kiwi fruit, could be explained by a combination of cross-reactivity between the 11S and 7S globulins and co-sensitization to the 2S albumin Act d 13.
  相似文献   

9.

Background  

Welwitschia mirabilis is the only extant member of the family Welwitschiaceae, one of three lineages of gnetophytes, an enigmatic group of gymnosperms variously allied with flowering plants or conifers. Limited sequence data and rapid divergence rates have precluded consensus on the evolutionary placement of gnetophytes based on molecular characters. Here we report on the first complete gnetophyte chloroplast genome sequence, from Welwitschia mirabilis, as well as analyses on divergence rates of protein-coding genes, comparisons of gene content and order, and phylogenetic implications.  相似文献   

10.

Background  

Nuclear DNA sequences provide genetic information that complements studies using mitochondrial DNA. Some 'universal' primer sets have been developed that target introns within protein-coding loci, but many simultaneously amplify introns from paralogous loci. Refining existing primer sets to target a single locus could circumvent this problem.  相似文献   

11.

Background  

While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets across 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase.  相似文献   

12.

Background  

Tandem repeat variation in protein-coding regions will alter protein length and may introduce frameshifts. Tandem repeat variants are associated with variation in pathogenicity in bacteria and with human disease. We characterized tandem repeat polymorphism in human proteins, using the UniGene database, and tested whether these were associated with host defense roles.  相似文献   

13.
Localizing triplet periodicity in DNA and cDNA sequences   总被引:1,自引:0,他引:1  

Background  

The protein-coding regions (coding exons) of a DNA sequence exhibit a triplet periodicity (TP) due to fact that coding exons contain a series of three nucleotide codons that encode specific amino acid residues. Such periodicity is usually not observed in introns and intergenic regions. If a DNA sequence is divided into small segments and a Fourier Transform is applied on each segment, a strong peak at frequency 1/3 is typically observed in the Fourier spectrum of coding segments, but not in non-coding regions. This property has been used in identifying the locations of protein-coding genes in unannotated sequence. The method is fast and requires no training. However, the need to compute the Fourier Transform across a segment (window) of arbitrary size affects the accuracy with which one can localize TP boundaries. Here, we report a technique that provides higher-resolution identification of these boundaries, and use the technique to explore the biological correlates of TP regions in the genome of the model organism C. elegans.  相似文献   

14.

Background  

In addition to known protein-coding genes, large amounts of apparently non-coding sequence are conserved between the human and mouse genomes. It seems reasonable to assume that these conserved regions are more likely to contain functional elements than less-conserved portions of the genome.  相似文献   

15.

Background  

Many current gene prediction methods use only one model to represent protein-coding regions in a genome, and so are less likely to predict the location of genes that have an atypical sequence composition. It is likely that future improvements in gene finding will involve the development of methods that can adequately deal with intra-genomic compositional variation.  相似文献   

16.

Background  

Many newly detected point mutations are located in protein-coding regions of the human genome. Knowledge of their effects on the protein's 3D structure provides insight into the protein's mechanism, can aid the design of further experiments, and eventually can lead to the development of new medicines and diagnostic tools.  相似文献   

17.

Background  

Frameshift mutations in protein-coding DNA sequences produce a drastic change in the resulting protein sequence, which prevents classic protein alignment methods from revealing the proteins' common origin. Moreover, when a large number of substitutions are additionally involved in the divergence, the homology detection becomes difficult even at the DNA level.  相似文献   

18.
19.

Background  

Non-coding DNA sequences comprise a very large proportion of the total genomic content of mammals, most other vertebrates, many invertebrates, and most plants. Unraveling the functional significance of non-coding DNA depends on how well we are able to align non-coding DNA sequences. However, the alignment of non-coding DNA sequences is more difficult than aligning protein-coding sequences.  相似文献   

20.

Background  

While most multiple sequence alignment programs expect that all or most of their input is known to be homologous, and penalise insertions and deletions, this is not a reasonable assumption for non-coding DNA, which is much less strongly conserved than protein-coding genes. Arguing that the goal of sequence alignment should be the detection of homology and not similarity, we incorporate an evolutionary model into a previously published multiple sequence alignment program for non-coding DNA, Sigma, as a sensitive likelihood-based way to assess the significance of alignments. Version 1 of Sigma was successful in eliminating spurious alignments but exhibited relatively poor sensitivity on synthetic data. Sigma 1 used a p-value (the probability under the "null hypothesis" of non-homology) to assess the significance of alignments, and, optionally, a background model that captured short-range genomic correlations. Sigma version 2, described here, retains these features, but calculates the p-value using a sophisticated evolutionary model that we describe here, and also allows for a transition matrix for different substitution rates from and to different nucleotides. Our evolutionary model takes separate account of mutation and fixation, and can be extended to allow for locally differing functional constraints on sequence.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号