首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Zhang L  Luo L 《Nucleic acids research》2003,31(21):6214-6220
Based on the conservation of nucleotides at splicing sites and the features of base composition and base correlation around these sites we use the method of increment of diversity combined with quadratic discriminant analysis (IDQD) to study the dependence structure of splicing sites and predict the exons/introns and their boundaries for four model genomes: Caenorhabditis elegans, Arabidopsis thaliana, Drosophila melanogaster and human. The comparison of compositional features between two sequences and the comparison of base dependencies at adjacent or non-adjacent positions of two sequences can be integrated automatically in the increment of diversity (ID). Eight feature variables around a potential splice site are defined in terms of ID. They are integrated in a single formal framework given by IDQD. In our calculations 7 (8) base region around the donor (acceptor) sites have been considered in studying the conservation of nucleotides and sequences of 48 bp on either side of splice sites have been used in studying the compositional and base-correlating features. The windows are enlarged to 16 (donor), 29 (acceptor) and 80 bp (either side) to improve the prediction for human splice sites. The prediction capability of the present method is comparable with the leading splice site detector—GeneSplicer.  相似文献   

2.
3.
4.
5.
Activation of the insect innate immune system is dependent on a limited number of pattern recognition receptors (PRRs) capable of interacting with pathogen-associated molecular pattern. Here we report a novel role of an alternatively spliced hypervariable immunoglobulin domain-encoding gene, Dscam, in generating a broad range of PRRs implicated in immune defense in the malaria vector Anopheles gambiae. The mosquito Down syndrome cell adhesion molecule gene, AgDscam, has a complex genome organization with 101 exons that can produce over 31,000 potential alternative splice forms with different combinations of adhesive domains and interaction specificities. AgDscam responds to infection by producing pathogen challenge-specific splice form repertoires. Transient silencing of AgDscam compromises the mosquito's resistance to infections with bacteria and the malaria parasite Plasmodium. AgDscam is mediating phagocytosis of bacteria with which it can associate and defend against in a splice form–specific manner. AgDscam is a hypervariable PRR of the A. gambiae innate immune system.  相似文献   

6.
Identifying DNA splice sites is a main task of gene hunting. We introduce the hyper-network architecture as a novel method for finding DNA splice sites. The hypernetwork architecture is a biologically inspired information processing system composed of networks of molecules forming cells, and a number of cells forming a tissue or organism. Its learning is based on molecular evolution. DNA examples taken from GenBank were translated into binary strings and fed into a hypernetwork for training. We performed experiments to explore the generalization performance of hypernetwork learning in this data set by two-fold cross validation. The hypernetwork generalization performance was comparable to well known classification algorithms. With the best hypernetwork obtained, including local information and heuristic rules, we built a system (HyperExon) to obtain splice site candidates. The HyperExon system outperformed leading splice recognition systems in the list of sequences tested.  相似文献   

7.
A large number of RNA-sequencing studies set out to predict mutations, splice junctions or fusion RNAs. We propose a method, CRAC, that integrates genomic locations and local coverage to enable such predictions to be made directly from RNA-seq read analysis. A k-mer profiling approach detects candidate mutations, indels and splice or chimeric junctions in each single read. CRAC increases precision compared with existing tools, reaching 99:5% for splice junctions, without losing sensitivity. Importantly, CRAC predictions improve with read length. In cancer libraries, CRAC recovered 74% of validated fusion RNAs and predicted novel recurrent chimeric junctions. CRAC is available at http://crac.gforge.inria.fr.  相似文献   

8.

Background

Homologous recombination in Escherichia coli creates patches (non-crossovers) or splices (half crossovers), each of which may have associated heteroduplex DNA. Heteroduplex patches have recombinant DNA in one strand of the duplex, with parental flanking markers. Which DNA strand is exchanged in heteroduplex patches reflects the molecular mechanism of recombination. Several models for the mechanism of E. coli RecBCD-mediated recombinational double-strand-end (DSE) repair specify that only the 3′-ending strand invades the homologous DNA, forming heteroduplex in that strand. There is, however, in vivo evidence that patches are found in both strands.

Methodology/Principle Findings

This paper re-examines heteroduplex-patch-strand polarity using phage λ and the λdv plasmid as DNA substrates recombined via the E. coli RecBCD system in vivo. These DNAs are mutant for λ recombination functions, including orf and rap, which were functional in previous studies. Heteroduplexes are isolated, separated on polyacrylamide gels, and quantified using Southern blots for heteroduplex analysis. This method reveals that heteroduplexes are still found in either 5′ or 3′ DNA strands in approximately equal amounts, even in the absence of orf and rap. Also observed is an independence of the RuvC Holliday-junction endonuclease on patch formation, and a slight but statistically significant alteration of patch polarity by recD mutation.

Conclusions/Significance

These results indicate that orf and rap did not contribute to the presence of patches, and imply that patches occurring in both DNA strands reflects the molecular mechanism of recombination in E. coli. Most importantly, the lack of a requirement for RuvC implies that endonucleolytic resolution of Holliday junctions is not necessary for heteroduplex-patch formation, contrary to predictions of all of the major previous models. This implies that patches are not an alternative resolution of the same intermediate that produces splices, and do not bear on models for splice formation. We consider two mechanisms that use DNA replication instead of endonucleolytic resolution for formation of heteroduplex patches in either DNA strand: synthesis-dependent-strand annealing and a strand-assimilation mechanism.  相似文献   

9.
10.
11.
Group II introns are large ribozymes that require the assistance of intron-encoded or free-standing maturases to splice from their pre-mRNAs in vivo. They mainly splice through the classical branching pathway, being released as RNA lariats. However, group II introns can also splice through secondary pathways like hydrolysis and circularization leading to the release of linear and circular introns, respectively. Here, we assessed in vivo splicing of various constructs of the Ll.LtrB group II intron from the Gram-positive bacterium Lactococcus lactis. The study of excised intron junctions revealed, in addition to branched intron lariats, the presence of perfect end-to-end intron circles and alternatively circularized introns. Removal of the branch point A residue prevented Ll.LtrB excision through the branching pathway but did not hinder intron circle formation. Complete intron RNA circles were found associated with the intron-encoded protein LtrA forming nevertheless inactive RNPs. Traces of double-stranded head-to-tail intron DNA junctions were also detected in L. lactis RNA and nucleic acid extracts. Some intron circles and alternatively circularized introns harbored variable number of non-encoded nucleotides at their splice junction. The presence of mRNA fragments at the splice junction of some intron RNA circles provides insights into the group II intron circularization pathway in bacteria.  相似文献   

12.
Accurate mapping of spliced RNA-Seq reads to genomic DNA has been known as a challenging problem. Despite significant efforts invested in developing efficient algorithms, with the human genome as a primary focus, the best solution is still not known. A recently introduced tool, TrueSight, has demonstrated better performance compared with earlier developed algorithms such as TopHat and MapSplice. To improve detection of splice junctions, TrueSight uses information on statistical patterns of nucleotide ordering in intronic and exonic DNA. This line of research led to yet another new algorithm, UnSplicer, designed for eukaryotic species with compact genomes where functional alternative splicing is likely to be dominated by splicing noise. Genome-specific parameters of the new algorithm are generated by GeneMark-ES, an ab initio gene prediction algorithm based on unsupervised training. UnSplicer shares several components with TrueSight; the difference lies in the training strategy and the classification algorithm. We tested UnSplicer on RNA-Seq data sets of Arabidopsis thaliana, Caenorhabditis elegans, Cryptococcus neoformans and Drosophila melanogaster. We have shown that splice junctions inferred by UnSplicer are in better agreement with knowledge accumulated on these well-studied genomes than predictions made by earlier developed tools.  相似文献   

13.
Werner syndrome (WS) is an autosomal recessive segmental progeroid syndrome caused by null mutations at the WRN locus, which codes for a member of the RecQ family of DNA helicases. Since 1988, the International Registry of Werner syndrome had enrolled 130 molecularly confirmed WS cases from among 110 worldwide pedigrees. We now report 18 new mutations, including two genomic rearrangements, a deep intronic mutation resulting in a novel exon, a splice consensus mutation leading to utilization of the nearby splice site, and two rare missense mutations. We also review evidence for founder mutations among various ethnic/geographic groups. Founder WRN mutations had been previously reported in Japan and Northern Sardinia. Our Registry now suggests characteristic mutations originated in Morocco, Turkey, The Netherlands and elsewhere.  相似文献   

14.
The µ opioid receptor gene, OPRM1, undergoes extensive alternative pre-mRNA splicing in rodents and humans, with dozens of alternatively spliced variants of the OPRM1 gene. The present studies establish a SYBR green quantitative PCR (qPCR) assay to more accurately quantify mouse OPRM1 splice variant mRNAs. Using these qPCR assays, we examined the expression of OPRM1 splice variant mRNAs in selected brain regions of four inbred mouse strains displaying differences in µ opioid-induced tolerance and physical dependence: C56BL/6J, 129P3/J, SJL/J and SWR/J. The complete mRNA expression profiles of the OPRM1 splice variants reveal marked differences of the variant mRNA expression among the brain regions in each mouse strain, suggesting region-specific alternative splicing of the OPRM1 gene. The expression of many variants was also strain-specific, implying a genetic influence on OPRM1 alternative splicing. The expression levels of a number of the variant mRNAs in certain brain regions appear to correlate with strain sensitivities to morphine analgesia, tolerance and physical dependence in four mouse strains.  相似文献   

15.
Pre-mRNA splicing involves two transesterification steps catalyzed by the spliceosome. How RNA substrates are positioned in each step and the molecular rearrangements involved, remain obscure. Here, we show that mutations in PRP16, PRP8, SNU114 and the U5 snRNA that affect this process interact genetically with CWC21, that encodes the yeast orthologue of the human SR protein, SRm300/SRRM2. Our microarray analysis shows changes in 3′ splice site selection at elevated temperature in a subset of introns in cwc21Δ cells. Considering all the available data, we propose a role for Cwc21p positioning the 3′ splice site at the transition to the second step conformation of the spliceosome, mediated through its interactions with the U5 snRNP. This suggests a mechanism whereby SRm300/SRRM2, might influence splice site selection in human cells.  相似文献   

16.
GC-AG introns represent 0.7% of total human pre-mRNA introns. To study the function of GC-AG introns in splicing regulation, 196 cDNA-confirmed GC-AG introns were identified in Caenorhabditis elegans. These represent 0.6% of the cDNA- confirmed intron data set for this organism. Eleven of these GC-AG introns are involved in alternative splicing. In a comparison of the genomic sequences of homologous genes between C.elegans and Caenorhabditis briggsae for 26 GC-AG introns, the C at the +2 position is conserved in only five of these introns. A system to experimentally test the function of GC-AG introns in alternative splicing was developed. Results from these experiments indicate that the conserved C at the +2 position of the tenth intron of the let-2 gene is essential for developmentally regulated alternative splicing. This C allows the splice donor to function as a very weak splice site that works in balance with an alternative GT splice donor. A weak GT splice donor can functionally replace the GC splice donor and allow for splicing regulation. These results indicate that while the majority of GC-AG introns appear to be constitutively spliced and have no evolutionary constraints to prevent them from being GT-AG introns, a subset of GC-AG introns is involved in alternative splicing and the C at the +2 position of these introns can have an important role in splicing regulation.  相似文献   

17.
18.
Purine-rich enhancers are exon sequences that promote inclusion of alternative exons, usually via activation of weak upstream 3′ splice sites. A recently described purine-rich enhancer from the caldesmon gene has an additional activity by which it directs selection of competing 5′ splice sites within an alternative exon. In this study, we have compared the caldesmon enhancer with another purine-rich enhancer from the chicken cardiac troponin T (cTNT) gene for the ability to regulate flanking splice sites. Although similar in sequence and length, the two enhancers demonstrated strikingly different specificities towards 5′ splice site choice when placed between competing 5′ splice sites in an internal exon. The 32-nucleotide caldesmon enhancer caused effective usage of the exon-internal 5′ splice site, whereas the 30-nucleotide cTNT enhancer caused effective usage of the exon-terminal 5′ splice site. Both enhancer-mediated splicing pathways represented modulation of the default pathway in which both 5′ splice sites were utilized. Each enhancer is multipartite, consisting of two purine-rich sequences of a simple (GAR)n repeat interdigitated with two enhancer-specific sequences. The entire enhancer was necessary for maximal splice site selectivity; however, a 5- to 7-nucleotide region from the 3′ end of each enhancer dictated splice site selectivity. Mutations that interchanged this short region of the two enhancers switched specificity. The portion of the cTNT enhancer determinative for 5′ splice site selectivity was different than that shown to be maximally important for activation of a 3′ splice site, suggesting that enhancer environment can have a major impact on activity. These results are the first indication that individual purine-rich enhancers can differentiate between flanking splice sites. Furthermore, localization of the specificity of splice site choice to a short region within both enhancers indicates that subtle differences in enhancer sequence can have profound effects on the splicing pathway.  相似文献   

19.
《Gene》1998,208(2):279-283
In an effort to obtain a small genomic construct for the generation of a HIRA transgenic mouse, we have isolated and sequenced the Fugu TUPLE1/HIRA gene. We have compared the gene organization and the proteins encoded in pufferfish and human and also searched for conserved DNA sequences that might be important in gene regulation. The pufferfish gene spans approx. 9 kb, which is approx. 11 times smaller than the human gene, owing to the reduced size of the introns. Like its human counterpart, it is organized into 25 exons. The majority of the splice sites are in identical positions to those found in the human gene, however, for three internal exons the positions of the splice sites are not directly comparable. The coding regions are almost identical in size and show a high degree of similarity, especially at the amino and carboxy termini. Comparisons of 5′ and 3′ sequences failed to detect similarities or sequences involved in regulation.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号