首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
3.
We present data on the frequencies of nucleotides and nucleotide substitutions in conservative DNA regions involved in the regulation of gene expression. Data on prokaryotes and eukaryotes are considered separately. In both cases DNA strands complementary to those which serve as templates for RNA-polymerase have low frequencies of cytosine. The most conservative positions also have an increased frequency of adenine. Various substitutions in the series of homologous regulatory DNA sequences, as compared to their consensuses, have different frequencies. In prokaryotes guanine in a consensus sequence is substituted for at the lowest and adenine at the highest frequency, whereas in eukaryotes cytosine is substituted for at the lowest and guanine at the highest frequency. In both cases the nucleotides substituted for are most frequently replaced with cytosine. Deviations from consensus sequences tend to cluster in adjacent positions. The more pronounced the consequences of a nucleotide substitution are the higher is the frequency of substitutions in adjacent positions. Possible explanations for these phenomena are discussed.  相似文献   

4.
Over 50% of donor splice sites in the human genome have a potential alternative donor site at a distance of three to six nucleotides. Conservation of these potential sites is determined by the consensus requirements and by its exonic or intronic location. Several hundred pairs of overlapping sites are confirmed to be alternatively spliced as both sites in a pair are supported by a protein, by a full-length mRNA, or by expressed sequence tags (ESTs) from at least two independent clone libraries. Overlapping sites may clash with consensus requirements. Pairs with a site shift of four nucleotides are the most abundant, despite the frameshift in the protein-coding region that they introduce. The site usage in pairs is usually uneven, and the major site is more frequently conserved in other mammalian genomes. Overlapping alternative donor sites and acceptor sites may have different functional roles: alternative splicing of overlapping acceptor sites leads mainly to microvariations in protein sequences; whereas alternative donor sites often lead to frameshifts and thus either yield major differences in the protein sequence and structure, or generate nonsense-mediated decay-inducing mRNA isoforms likely involved in regulated unproductive splicing pathways.  相似文献   

5.
A low level of genetic variation has limited the application of molecular markers for characterizing important traits in cultivated tomato. To detect polymorphisms in tomato conserved ortholog sets (COS), expressed sequence tags (ESTs) were searched against tomato and Arabidopsis genomic sequences to define the positions of introns. Introns were amplified from 12 different accessions of tomato by polymerase chain reaction and nucleotide sequences were determined by sequencing. Results indicated that there was a possibility of 71% to amplify introns from tomato genomic DNA through this approach. A total of 201 introns were sequenced from 86 COS unigenes. The intron positions and numbers were conserved between tomato and Arabidopsis, but average intron length was three times longer in tomato than in Arabidopsis. A total of 307 single nucleotide polymorphisms (SNPs) and 75 indels were detected in introns of 57 COS unigenes among 12 tomato lines. Within cultivated tomato germplasm 172 SNPs and 47 indels were detected in introns of 33 COS unigenes. In addition, 41 SNPs were identified in the exons of 27 COS unigenes. The frequency of SNPs was 2.4 times higher in introns than in exons in the 22 COS unigenes having both intronic and exonic polymorphisms. These results indicate that intronic regions may contain sufficient variation to develop sufficient marker resources for genome-wide analysis in cultivated tomato.  相似文献   

6.
Functional studies on the ATM intronic splicing processing element   总被引:1,自引:1,他引:0  
In disease-associated genes, the understanding of the functional significance of deep intronic nucleotide variants may represent a difficult challenge. We have previously reported a new disease-causing mechanism that involves an intronic splicing processing element (ISPE) in ATM, composed of adjacent consensus 5′ and 3′ splice sites. A GTAA deletion within ISPE maintains potential adjacent splice sites, disrupts a non-canonical U1 snRNP interaction and activates an aberrant exon. In this paper, we demonstrate that binding of U1 snRNA through complementarity within a ~40 nt window downstream of the ISPE prevents aberrant splicing. By selective mutagenesis at the adjacent consensus ISPE splice sites, we show that this effect is not due to a resplicing process occurring at the ISPE. Functional comparison of the ATM mouse counterpart and evaluation of the pre-mRNA splicing intermediates derived from affected cell lines and hybrid minigene assays indicate that U1 snRNP binding at the ISPE interferes with the cryptic acceptor site. Activation of this site results in a stringent 5′–3′ order of intron sequence removal around the cryptic exon. Artificial U1 snRNA loading by complementarity to heterologous exonic sequences represents a potential therapeutic method to prevent the usage of an aberrant CFTR cryptic exon. Our results suggest that ISPE-like intronic elements binding U1 snRNPs may regulate correct intron processing.  相似文献   

7.
The mitochondrial genome of 23 wild-type strains belonging to three different species of the filamentous fungus Podospora was examined. Among the 15 optional sequences identified are two intronic reading frames, nad1-i4-orf1 and cox1-i7-orf2. We show that the presence of these sequences was strictly correlated with tightly clustered nucleotide substitutions in the adjacent exon. This correlation applies to the presence or absence of closely related open reading frames (ORFs), found at the same genetic locations, in all the Pyrenomycete genera examined. The recent gain of these optional ORFs in the evolution of the genus Podospora probably account for such sequence differences. In the homoplasmic progeny from heteroplasmons constructed between Podospora strains differing by the presence of these optional ORFs, nad1-i4-orf1 and cox1-i7-orf2 appeared highly invasive. Sequence comparisons in the nad1-i4 intron of various strains of the Pyrenomycete family led us to propose a scenario of its evolution that includes several events of loss and gain of intronic ORFs. These results strongly reinforce the idea that group I intronic ORFs are mobile elements and that their transfer, and comcomitant modification of the adjacent exon, could participate in the modular evolution of mitochondrial genomes.  相似文献   

8.
Cystic fibrosis is a common human genetic disease caused by mutations in CFTR, a gene that codes for a chloride channel that is regulated by phosphorylation and cytosolic nucleotides. As part of a program to discover natural animal models for human genetic diseases, we have determined the genomic sequence of CFTR in the Rhesus monkey, Macaca mulatta. The coding region of rhesus CFTR is 98.3% identical to human CFTR at the nucleotide level and 98.2% identical and 99.7% similar at the amino acid level. Partial sequences of flanking introns (5582 base pair positions analyzed) revealed 91.1% identity with human introns. Relative to rhesus intronic sequence, the human sequences had 27 insertions and 22 deletions. Primer sequences for amplification of rhesus genomic CFTR sequences are provided. The accession number is AF013753 (all 27 exons and some flanking intronic sequence). Received: 27 August 1992 / Accepted: 5 December 1997  相似文献   

9.
The primary structure of the rat insulin-like growth factor II gene region   总被引:1,自引:0,他引:1  
Complete nucleotide sequences of the rat insulin-like growth factor II gene region including 5' 18 kilobases (kb) up to the insulin gene, all exonic and intronic, and 3' 6 kb sequences were determined. Among these sequences several repetitive stretches became evident besides integration of type II Alu and identifier sequences. They were: (1) twelve repetitions of about 100 base pair (bp) units; (2) duplication of 60 bp units; (3) triplication of a 24 bp unit; and (4) 41-fold expansion of 12-15 bp units.  相似文献   

10.
11.

Background  

Predicting and proper ranking of canonical splice sites (SSs) is a challenging problem in bioinformatics and machine learning communities. Any progress in SSs recognition will lead to better understanding of splicing mechanism. We introduce several new approaches of combining a priori knowledge for improved SS detection. First, we design our new Bayesian SS sensor based on oligonucleotide counting. To further enhance prediction quality, we applied our new de novo motif detection tool MHMMotif to intronic ends and exons. We combine elements found with sensor information using Naive Bayesian Network, as implemented in our new tool SpliceScan.  相似文献   

12.
Type VI secretion systems (T6SSs) are widespread, tightly regulated, protein delivery apparatuses used by Gram-negative bacteria to outcompete their neighbours. The pathogen, Vibrio parahaemolyticus, encodes two T6SSs. These T6SSs are differentially regulated by external conditions. T6SS1, an antibacterial system predominantly found in pathogenic isolates, requires warm marine-like conditions and surface sensing for activation. The regulatory network that governs this activation is not well understood. In this work, we devised a screening methodology that allows us to easily monitor the outcome of bacterial competitions and thus to identify mutants that are defective in T6SS1-mediated bacterial killing. The methodology, termed Ba cterial Co mpetition F luorescence (BaCoF), relies on detection of a fluorescent signal as an indicator of the survival and growth of a T6SS-sensitive, GFP-expressing prey that has been co-cultured with mutants derived from a T6SS+ attacker of interest. Using BaCoF, we screened a random transposon insertion mutant library and identified genes required for V. parahaemolyticus T6SS1 activation, among them TfoY and Tmk. We used epistasis experiments to determine the relationships between the newly identified components and other regulators that were previously described. Thus, we present here a detailed biological understanding of the T6SS1 regulatory network.  相似文献   

13.
The complete nucleotide sequence of an active class I HLA gene, HLA-A3, has been determined. This sequence, together with that obtained for the HLA-CW3 gene, represents the first complete nucleotide sequence to be determined for functional class I HLA genes. The gene organisation of HLA-A3 closely resembles that of class I H-2 genes in mouse: it shows a signal exon, three exons encoding the three extracellular domains, one exon encoding the transmembrane region and three exons encoding the cytoplasmic domain. The complete nucleotide sequences of the active HLA genes, HLA-A3 and HLA-CW3, now permit a meaningful comparison of the nucleotide sequences of class I HLA genes by alignment with the sequence established for a HLA-B7-specific cDNA clone and the sequences of two HLA class I pseudogenes HLA 12.4 and LN- 11A . The comparisons show that there is a non-random pattern of nucleotide differences in both exonic and intronic regions featuring segmental homologies over short regions, which is indicative of a gene conversion mechanism. In addition, analysis of the frequency of nucleotide substitution at the three base positions within the codons of the functional genes HLA-A3, HLA-B7 and HLA-CW3 shows that the pattern of nucleotide substitution in the exon coding for the 3rd extracellular domain is consistent with strong selection pressure to conserve the sequence. The distribution of nucleotide variation in the other exons specifying the mature protein is nearly random with respect to the frequencies of substitution at the three nucleotide positions of their codons. The evolutionary implications of these findings are discussed.  相似文献   

14.
Zhang L  Luo L 《Nucleic acids research》2003,31(21):6214-6220
Based on the conservation of nucleotides at splicing sites and the features of base composition and base correlation around these sites we use the method of increment of diversity combined with quadratic discriminant analysis (IDQD) to study the dependence structure of splicing sites and predict the exons/introns and their boundaries for four model genomes: Caenorhabditis elegans, Arabidopsis thaliana, Drosophila melanogaster and human. The comparison of compositional features between two sequences and the comparison of base dependencies at adjacent or non-adjacent positions of two sequences can be integrated automatically in the increment of diversity (ID). Eight feature variables around a potential splice site are defined in terms of ID. They are integrated in a single formal framework given by IDQD. In our calculations 7 (8) base region around the donor (acceptor) sites have been considered in studying the conservation of nucleotides and sequences of 48 bp on either side of splice sites have been used in studying the compositional and base-correlating features. The windows are enlarged to 16 (donor), 29 (acceptor) and 80 bp (either side) to improve the prediction for human splice sites. The prediction capability of the present method is comparable with the leading splice site detector—GeneSplicer.  相似文献   

15.
16.
Carmel I  Tal S  Vig I  Ast G 《RNA (New York, N.Y.)》2004,10(5):828-840
Human-mouse comparative genomics is an informative tool to assess sequence functionality as inferred from its conservation level. We used this approach to examine dependency among different positions of the 5' splice site. We compiled a data set of 50,493 homologous human-mouse internal exons and analyzed the frequency of changes among different positions of homologous human-mouse 5' splice-site pairs. We found mutual relationships between positions +4 and +5, +5 and +6, -2 and +5, and -1 and +5. We also demonstrated the association between the exonic and the intronic positions of the 5' splice site, in which a stronger interaction of U1 snRNA and the intronic portion of the 5' splice site compensates for weak interaction of U1 snRNA and the exonic portion of the 5' splice site, and vice versa. By using an ex vivo system that mimics the effect of mutation in the 5' splice site leading to familial dysautonomia, we demonstrated that U1 snRNA base-pairing with positions +6 and -1 is the only functional requirement for mRNA splicing of this 5' splice site. Our findings indicate the importance of U1 snRNA base-pairing to the exonic portion of the 5' splice site.  相似文献   

17.
Monte Carlo simulations are useful to verify the significance of data. Genomic regularities, such as the nucleotide correlations or the not uniform distribution of the motifs throughout genomic or mature mRNA sequences, exist and their significance can be checked by means of the Monte Carlo test. The test needs good quality random sequences in order to work, moreover they should have the same nucleotide distribution as the sequences in which the regularities have been found. Random DNA sequences are also useful to estimate the background score of an alignment, that is a threshold below which the resulting score is merely due to chance. We have developed RANDNA, a free software which allows to produce random DNA or RNA sequences setting both their length and the percentage of nucleotide composition. Sequences having the same nucleotide distribution of exonic, intronic or intergenic sequences can be generated. Its graphic interface makes it possible to easily set the parameters that characterize the sequences being produced and saved in a text format file. The pseudo-random number generator function of Borland Delphi 6 is used, since it guarantees a good randomness, a long cycle length and a high speed. We have checked the quality of sequences generated by the software, by means of well-known tests, both by themselves and versus genuine random sequences. We show the good quality of the generated sequences. The software, complete with examples and documentation, is freely available to users from: http://www.introni.it/en/software.  相似文献   

18.
Regulation of glutamate receptor B pre-mRNA splicing by RNA editing   总被引:1,自引:0,他引:1  
RNA-editing enzymes of the ADAR family convert adenosines to inosines in double-stranded RNA substrates. Frequently, editing sites are defined by base-pairing of the editing site with a complementary intronic region. The glutamate receptor subunit B (GluR-B) pre-mRNA harbors two such exonic editing sites termed Q/R and R/G. Data from ADAR knockout mice and in vitro editing assays suggest an intimate connection between editing and splicing of GluR-B pre-mRNA.

By comparing the events at the Q/R and R/G sites, we can show that editing can both stimulate and repress splicing efficiency. The edited nucleotide, but not ADAR binding itself, is sufficient to exert this effect. The presence of an edited nucleotide at the R/G site reduces splicing efficiency of the adjacent intron facilitating alternative splicing events occurring downstream of the R/G site.

Lack of editing inhibits splicing at the Q/R site. Editing of both the Q/R nucleotide and an intronic editing hotspot are required to allow efficient splicing. Inefficient intron removal may ensure that only properly edited mRNAs become spliced and exported to the cytoplasm.

  相似文献   

19.

Background  

Models of sequence evolution typically assume that different nucleotide positions evolve independently. This assumption is widely appreciated to be an over-simplification. The best known violations involve biases due to adjacent nucleotides. There have also been suggestions that biases exist at larger scales, however this possibility has not been systematically explored.  相似文献   

20.
In Pseudomonas aeruginosa three type VI secretion systems (T6SSs) coexist, called H1‐ to H3‐T6SSs. Several T6SS components are proposed to be part of a macromolecular complex resembling the bacteriophage tail. The T6SS protein HsiE1 (TagJ) is unique to the H1‐T6SS and absent from the H2‐ and H3‐T6SSs. We demonstrate that HsiE1 interacts with a predicted N‐terminal α‐helix in HsiB1 (TssB) thus forming a novel subcomplex of the T6SS. HsiB1 is homologous to the Vibrio cholerae VipA component, which contributes to the formation of a bacteriophage tail sheath‐like structure. We show that the interaction between HsiE1 and HsiB1 is specific and does not occur between HsiE1 and HsiB2. Proteins of the TssB family encoded in T6SS clusters lacking a gene encoding a TagJ‐like component are often devoid of the predicted N‐terminal helical region, which suggests co‐evolution. We observe that a synthetic peptide corresponding to the N‐terminal 20 amino acids of HsiB1 interacts with purified HsiE1 protein. This interaction is a common feature to other bacterial T6SSs that display a TagJ homologue as shown here with Serratia marcescens. We further show that hsiE1 is a non‐essential gene for the T6SS and suggest that HsiE1 may modulate incorporation of HsiB1 into the T6SS.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号