首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Castle JC 《PloS one》2011,6(6):e20660
Rates of SNPs (single nucleotide polymorphisms) and cross-species genomic sequence conservation reflect intra- and inter-species variation, respectively. Here, I report SNP rates and genomic sequence conservation adjacent to mRNA processing regions and show that, as expected, more SNPs occur in less conserved regions and that functional regions have fewer SNPs. Results are confirmed using both mouse and human data. Regions include protein start codons, 3' splice sites, 5' splice sites, protein stop codons, predicted miRNA binding sites, and polyadenylation sites. Throughout, SNP rates are lower and conservation is higher at regulatory sites. Within coding regions, SNP rates are highest and conservation is lowest at codon position three and the fewest SNPs are found at codon position two, reflecting codon degeneracy for amino acid encoding. Exon splice sites show high conservation and very low SNP rates, reflecting both splicing signals and protein coding. Relaxed constraint on the codon third position is dramatically seen when separating exonic SNP rates based on intron phase. At polyadenylation sites, a peak of conservation and low SNP rate occurs from 30 to 17 nt preceding the site. This region is highly enriched for the sequence AAUAAA, reflecting the location of the conserved polyA signal. miRNA 3' UTR target sites are predicted incorporating interspecies genomic sequence conservation; SNP rates are low in these sites, again showing fewer SNPs in conserved regions. Together, these results confirm that SNPs, reflecting recent genetic variation, occur more frequently in regions with less evolutionarily conservation.  相似文献   

2.
3.
4.
We present a new microRNA target prediction algorithm called TargetBoost, and show that the algorithm is stable and identifies more true targets than do existing algorithms. TargetBoost uses machine learning on a set of validated microRNA targets in lower organisms to create weighted sequence motifs that capture the binding characteristics between microRNAs and their targets. Existing algorithms require candidates to have (1) near-perfect complementarity between microRNAs' 5' end and their targets; (2) relatively high thermodynamic duplex stability; (3) multiple target sites in the target's 3' UTR; and (4) evolutionary conservation of the target between species. Most algorithms use one of the two first requirements in a seeding step, and use the three others as filters to improve the method's specificity. The initial seeding step determines an algorithm's sensitivity and also influences its specificity. As all algorithms may add filters to increase the specificity, we propose that methods should be compared before such filtering. We show that TargetBoost's weighted sequence motif approach is favorable to using both the duplex stability and the sequence complementarity steps. (TargetBoost is available as a Web tool from http://www.interagon.com/demo/.).  相似文献   

5.
6.
7.
8.
Coding nucleotide sequences contain myriad functions independent of their encoded protein sequences. We present the COMIT algorithm to detect functional noncoding motifs in coding regions using sequence conservation, explicitly separating nucleotide from amino acid effects. COMIT concurs with diverse experimental datasets, including splicing enhancers, silencers, replication motifs, and microRNA targets, and predicts many novel functional motifs. Intriguingly, COMIT scores are well-correlated to scores uncalibrated for amino acids, suggesting that nucleotide motifs often override peptide-level constraints.  相似文献   

9.
10.
Nonsense-mediated decay (NMD) is a eukaryotic cellular RNA surveillance and quality-control mechanism that degrades mRNA containing premature stop codons (nonsense mutations) that otherwise may exert a deleterious effect by the production of dysfunctional truncated proteins. Collagen X (COL10A1) nonsense mutations in Schmid-type metaphyseal chondrodysplasia are localized in a region toward the 3' end of the last exon (exon 3) and result in mRNA decay, in contrast to most other genes in which terminal-exon nonsense mutations are resistant to NMD. We introduce nonsense mutations into the mouse Col10a1 gene and express these in a hypertrophic-chondrocyte cell line to explore the mechanism of last-exon mRNA decay of Col10a1 and demonstrate that mRNA decay is spatially restricted to mutations occurring in a 3' region of the exon 3 coding sequence; this region corresponds to where human mutations have been described. This localization of mRNA-decay competency suggested that a downstream region, such as the 3' UTR, may play a role in specifying decay of mutant Col10a1 mRNA containing nonsense mutations. We found that deleting any of the three conserved sequence regions within the 3' UTR (region I, 23 bp; region II, 170 bp; and region III, 76 bp) prevented mutant mRNA decay, but a smaller 13 bp deletion within region III was permissive for decay. These data suggest that the 3' UTR participates in collagen X last-exon mRNA decay and that overall 3' UTR configuration, rather than specific linear-sequence motifs, may be important in specifying decay of Col10a1 mRNA containing nonsense mutations.  相似文献   

11.
12.
13.
Animal microRNA (miRNA) target prediction is still a challenge, although many prediction programs have been exploited. MiRNAs exert their function through partially binding the messenger RNAs (mRNAs; likely at 3′ untranslated regions [3′UTRs]), which makes it possible to detect the miRNA-mRNA interactions in vitro by co-transfection of miRNA and a luciferase reporter gene containing the target mRNA fragment into mammalian cells under a dual-luciferase assay system. Here, we constructed a human miRNA expression library and used a dual-luciferase assay system to perform large-scale screens of interactions between miRNAs and the 3′UTRs of seven genes, which included more than 3,000 interactions with triplicate experiments for each interaction. The screening results showed that the 3′UTR of one gene can be targeted by multiple miRNAs. Among the prediction algorithms, a Bayesian phylogenetic miRNA target identification algorithm and a support vector machine (SVM) presented a relatively better performance (27% for EIMMo and 24.7% for miRDB) against the average precision (17.3%) of the nine prediction programs used here. Additionally, we noticed that a relatively high conservation level was shown at the miRNA 3′ end targeted regions, as well as the 5′ end (seed region) binding sites.  相似文献   

14.
MicroRNAs usually interact with 3' noncoding regions (3'NCRs) of target mRNAs leading to downregulation of mRNA expression. In contrast, liver-specific microRNA miR-122 interacts with the 5' end of the hepatitis C virus RNA genome, resulting in increased viral RNA abundance. We find that inserting the viral miR-122 binding site into the 3' noncoding region of a reporter mRNA leads to downregulation of mRNA expression, indicating that the location of the miR-122 binding site dictates its effect on gene regulation. Furthermore, we discovered an adjacent, second miR-122 binding site, separated from the first by a highly conserved 14-nucleotide sequence. Mutational analysis demonstrates that both miR-122 binding sites in a single viral genome are occupied by the microRNA and function cooperatively to regulate target gene expression. These findings set a paradigm for dual, position-dependent functions of tandem microRNA-binding sites. Targeting an oligomeric microRNA complex offers potential as an antiviral-intervention strategy.  相似文献   

15.
16.
17.
Comparative genomics of foot-and-mouth disease virus   总被引:28,自引:0,他引:28       下载免费PDF全文
Here we present complete genome sequences, including a comparative analysis, of 103 isolates of foot-and-mouth disease virus (FMDV) representing all seven serotypes and including the first complete sequences of the SAT1 and SAT3 genomes. The data reveal novel highly conserved genomic regions, indicating functional constraints for variability as well as novel viral genomic motifs with likely biological relevance. Previously undescribed invariant motifs were identified in the 5' and 3' untranslated regions (UTR), as was tolerance for insertions/deletions in the 5' UTR. Fifty-eight percent of the amino acids encoded by FMDV isolates are invariant, suggesting that these residues are critical for virus biology. Novel, conserved sequence motifs with likely functional significance were identified within proteins L(pro), 1B, 1D, and 3C. An analysis of the complete FMDV genomes indicated phylogenetic incongruities between different genomic regions which were suggestive of interserotypic recombination. Additionally, a novel SAT virus lineage containing nonstructural protein-encoding regions distinct from other SAT and Euroasiatic lineages was identified. Insights into viral RNA sequence conservation and variability and genetic diversity in nature will likely impact our understanding of FMDV infections, host range, and transmission.  相似文献   

18.
19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号