首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In this article, we introduce three 3D graphical representations of DNA primary sequences, which we call RY-curve, MK-curve and SW-curve, based on three classifications of the DNA bases. The advantages of our representations are that (i) these 3D curves are strictly non-degenerate and there is no loss of information when transferring a DNA sequence to its mathematical representation and (ii) the coordinates of every node on these 3D curves have clear biological implication. Two applications of these 3D curves are presented: (a) a simple formula is derived to calculate the content of the four bases (A, G, C and T) from the coordinates of nodes on the curves; and (b) a 12-component characteristic vector is constructed to compare similarity among DNA sequences from different species based on the geometrical centers of the 3D curves. As examples, we examine similarity among the coding sequences of the first exon of beta-globin gene from eleven species and validate similarity of cDNA sequences of beta-globin gene from eight species.  相似文献   

2.
We introduce a novel 2D graphical representation of DNA sequences based on the pairs of the neighboring nucleotides (PNNs). Then we get the PNNs' distributions and obtain a y-M. The construction of the PNN-curve has some important advantages (1) It avoids loss of information and the PNN-curve standing for DNA sequences does not overlap or intersect with itself. (2) The novel 2D representation is more sensitive. The utility of this method can be illustrated by the examination of similarities/dissimilarities among the coding sequences of the first exon of beta-globin gene of eleven different species in Table 2.  相似文献   

3.
4.
为了更多地挖掘隐藏在蛋白质序列中的信息,本研究将20种氨基酸均匀地排列在单位圆周上,得到每种氨基酸对应的二维坐标,再与氨基酸的6个理化指标结合起来,最终用一个八维向量来刻画蛋白质序列。为避免数据极差对分析结果造成的影响,本研究对蛋白质序列所对应的八维向量作归一化处理。基于归一化后的蛋白质序列的向量表示,运用神经网络对蛋白质序列进行分类,并根据向量之间的欧式距离来量化序列之间的相似性。最后,以9个不同物种的ND5蛋白质序列以及8个不同物种的ND6蛋白质序列为例,Clustal W序列比对方法为基准,对本研究的方法与5-字母方法进行验证和比较,结果表明本研的方法是有效的。  相似文献   

5.
DNA sequencing has resulted in an abundance of data on DNA sequences for various species. Hence, the characterization and comparison of sequences become more important but still difficult tasks. In this paper, we first give a 2-D ladderlike graphical representation for the characteristic sequences of a DNA sequence, and then construct a 3-component vector, in which the normalized ALE-indices extracted from such three 2-D graphs via D/D matrices are individual components, to characterize the DNA sequence. The examination of similarities/dissimilarities among sequences of the beta-globin genes of different species illustrates the utility of the approach.  相似文献   

6.
Summary Part of the beta-globin genes ofMacaca cynomolgus andGorilla gorilla has been cloned and sequenced. Ten putatively neutral nucleotide polymorphisms have been described at the beta-globin locus in humans. They are associated in seven combinations, which define seven different haplotypes of the beta-globin gene: four major frameworks—1, 2, 3, and 3*—and three minor frameworks, which we term KI1, KA1, and OR1. The nucleotide sequences of these frameworks are compared with those of homologous sequences in chimpanzee, colobus, macaque, and gorilla. This comparison provides strong evidence that framework 2 was the earliest framework in the human lineage. From framework 2, a rooted parsimonious tree for the six other frameworks is constructed. This phylogenetic tree is discussed in terms of the evolution of nucleotide polymorphisms as well as in terms of genetic affinities between human populations.For each position at which there is base difference in comparing human, gorilla, and chimpanzee beta-globin genes, the phyletic lineage where the corresponding substitution occurred has been identified using the maximum parsimony procedure. The data provide evidence that polymorphisms may represent a significant component of differences between closely related species. If so, nucleotide polymorphisms may strongly bias estimates of small evolutionary distances.  相似文献   

7.
We have analysed beta-globin mRNA sequences in total RNA extracted from embryos and tadpoles of Xenopus laevis at different stages of development and we have identified the most abundantly transcribed beta-globin mRNA (beta T1). The entire nucleotide sequence of a cDNA clone corresponding to this mRNA is known. We have now identified the gene corresponding to this mRNA and we have determined the nucleotide sequences of its immediate 5'-flanking region. Using a DNA fragment from within the coding region of the cloned beta T1 cDNA we show, by primer extension analysis, that beta T1 mRNA is first detectable at stage 28-32 of development. This is the time at which the first presumptive erythropoietic tissue, the ventral blood island, becomes observable histologically. We show that two minor beta-globin genes, distinct from beta T1, are expressed during early stages of development, and that their expression ceases shortly after the beginning of the feeding stage. We term these two early larval genes beta E1 and beta E2. A third minor beta-globin gene is expressed during early development but, unlike beta E1 and beta E2, it is also expressed throughout subsequent larval development. We term this gene beta T2 and show that it corresponds to a gene previously termed beta LII. Finally, using a primer derived from the major adult beta-globin gene (beta 1), we have analysed the accumulation of the major adult beta-globin mRNA during larval development, and we show that this sequence does not accumulate to any significant level before metamorphosis.  相似文献   

8.
Primary structure of the goat beta-globin locus control region   总被引:6,自引:0,他引:6  
The goat beta-globin cluster is composed of a triplicated four-gene set. A locus control region (LCR) containing elements homologous to 5'DNase I hypersensitive sites (HS) 1, 2, and 3 of the human beta-globin LCR has been identified at the 5' end of this locus. We determined 10.2 kb of nucleotide sequence from the goat beta-globin locus control region. Self-comparison of this sequence by dot matrix analysis revealed the presence of six complete and three incomplete artiodactyl repeats. A novel repeated element, termed D repeat, was also identified. Southern blotting analysis demonstrated that these elements exist in the goat genome as a low to medium frequency interspersed repeat family. The absence of any other large region of self-homology (direct or inverted) in the goat LCR suggests that 5'HSs 1, 2, and 3 did not arise through duplication, but rather evolved independently. By comparing goat 5'HS 1 to those of human, rabbit, and mouse, we show a greater than 80% conservation in sequence between the four species. This level of evolutionary conservation suggests that 5'HS 1 plays an important role in the regulation of beta-globin loci.  相似文献   

9.
To produce transgenic mice carrying human beta-globin genes, we introduced the following two constructs of the genes to male pronuclei of fertilized mouse eggs: 4.4 kb Pst I/Pst I sequences of the human beta-globin gene (experiment 1) and the human beta-globin gene cluster (cosHG 28) containing G gamma, A gamma, delta and beta-globin genes and cosmid vector pJB8 (37.5 kb, experiment 2). In experiment 1, 25 mice were born, and four (one female and three males) carrying the injected gene sequences were identified. One of these mice carried the entire sequence of the human beta-globin gene but three others appeared to carry only a part of the entire sequence. The mouse with the entire sequence showed a slight increase in the minor component of the mouse beta-globin chain in the same position as the human beta-globin chain. In experiment 2, 61 mice were born, and nine (three females and six males) carried the sequences of the injected gene. However, from DNA analysis, no appropriate sequences present within the A gamma- or beta-globin gene were identified in any of the founder mice. In this case, DNA fragments of the gene cluster that were digested in the mouse nucleus after microinjection of the gene might be integrated into host DNA.  相似文献   

10.
11.
MOTIVATION: Many proposed statistical measures can efficiently compare biological sequences to further infer their structures, functions and evolutionary information. They are related in spirit because all the ideas for sequence comparison try to use the information on the k-word distributions, Markov model or both. Motivated by adding k-word distributions to Markov model directly, we investigated two novel statistical measures for sequence comparison, called wre.k.r and S2.k.r. RESULTS: The proposed measures were tested by similarity search, evaluation on functionally related regulatory sequences and phylogenetic analysis. This offers the systematic and quantitative experimental assessment of our measures. Moreover, we compared our achievements with these based on alignment or alignment-free. We grouped our experiments into two sets. The first one, performed via ROC (receiver operating curve) analysis, aims at assessing the intrinsic ability of our statistical measures to search for similar sequences from a database and discriminate functionally related regulatory sequences from unrelated sequences. The second one aims at assessing how well our statistical measure is used for phylogenetic analysis. The experimental assessment demonstrates that our similarity measures intending to incorporate k-word distributions into Markov model are more efficient.  相似文献   

12.
Single-copy human beta-globin transgenes are very susceptible to suppression by position effects of surrounding closed chromatin. However, these position effects are overcome by a 20 kbp DNA fragment containing the locus control region (LCR). Here we show that the 6.5 kbp microlocus LCR cassette reproducibly directs full expression from independent single-copy beta-globin transgenes. By testing individual DNase I-hypersensitive sites (HS) present in the microlocus cassette, we demonstrate that the 1.5 kbp 5'HS2 enhancer fragment does not direct beta-globin expression from single-copy transgenes. In contrast, the 1.9 kbp 5'HS3 fragment directs beta-globin expression in five independent single-copy transgenic mouse lines. Moreover, the 5'HS3 core element and beta-globin proximal promoter sequences are DNase I hypersensitive in fetal liver nuclei of these expressing transgenic lines. Taken together, these results demonstrate that LCR activity is the culmination of at least two separable functions including: (i) a novel activity located in 5'HS3 that dominantly opens and remodels chromatin structure; and (ii) a recessive enhancer activity residing in 5'HS2. We postulate that the different elements of the LCR form a 'holocomplex' that interacts with the individual globin genes.  相似文献   

13.
Saccharomyces uvarum is proposed as a proper species within the complex Saccharomyces sensu stricto. Molecular characteristics including the similarity of the restriction profile of the non-transcribed spacer 2 (NTS2) and of the D1/D2 sequences of the rDNA, as well as other genotypic and phenotypic characteristics confirm that this group of strains is highly homogeneous and distinguishable from other species of the Saccharomyces sensu stricto group.  相似文献   

14.
Three methanol-assimilating yeast strains representing a hitherto undescribed species were isolated from rotten wood and freshwater samples collected in Hungary. Analysis of the D1/D2 large subunit rRNA gene sequences placed the strains in the Kuraishia clade; however, no ascospore formation was observed. These strains differ from Candida hungarica , the genetically most closely related recognized species, by four and five substitutions in D1/D2 and by >1% and 4% differences in the internal transcribed spacer and in the mitochondrial small subunit rRNA gene regions, respectively. Some phenotypic differences were also observed. Candida ogatae , a novel yeast species, is proposed to accommodate these isolates. The type culture is NCAIM Y.01845T (CBS 10924, NRRL Y-48474).  相似文献   

15.
16.
The degree of similarity of DNA sequences can be concluded according to the comparison of DNA sequences, which helps to speculate their relationship in respect of the structure, function and evolution. In this paper, we introduce the fundamental of the weighted relative entropy based on 2-step Markov Model to compare DNA sequences. The DNA sequence, consisted of four characters A, T, C, G, can be considered as a Markov chain. By taking state space I = {A, T, C, G} and describe the DNA sequences with 2-step transition probability matrix we can get the eigenvalue of the DNA sequence to define the similarity metric. Therefore, we find a new method to compare the DNA sequences, which is used to classify chromosomes DNA sequences obtained from 30 species. The phylogenetic tree built by the alignment-free method of the distance matrix resulted from the weighted relative entropy has clearer and more accurate division.  相似文献   

17.
18.
Endogenous retrovirus-related sequences exist within the normal genomic DNA of all eukaryotes, and these endogenous sequences have been shown to be important to the nature and biology of related exogenous retroviruses and may also play a role in cellular functions. To date, no endogenous sequences related to human immunodeficiency virus type 1 (HIV-1) have been reported. Herein we describe the first report of the presence of nucleotide sequences related to HIV-1 in human, chimpanzee, and rhesus monkey DNAs from normal uninfected individuals. We also present the isolation and characterization of two of these endogenous HIV-1-related sequences, EHS-1 and EHS-2. With use of low-stringency Southern blot hybridization, complex banding patterns were detected in human DNA with 5' and 3' HIV-1-derived probes. When an HIV-1 env region probe was used, we detected a less complex, conserved banding pattern in human DNA as well as a related but distinct banding pattern in chimpanzee and rhesus monkey DNAs. EHS-1 and -2 were cloned from normal human genomic DNA libraries by using the env region probe. Clone EHS-1 shows sequence similarity with the domain of the envelope cellular protease cleavage site of HIV-1, while EHS-2 has sequence similarity to the overlapping reading frame for Rev and gp41. Stringent hybridization of EHS-1 back to primate genomic DNA indicates two distinct EHS-1 loci in normal human DNA, an identical band pattern in chimpanzee DNA, and a single locus in rhesus monkey DNA. Likewise, EHS-2 is present as a single highly conserved locus in all three species. An oligonucleotide derived from EHS-2 across a region of near identity to HIV-1 detects a complex banding pattern in all primates tested similar to that seen with the 3' HIV-1 probe. These data suggest that most of the HIV-1-related sequences identified in primate DNA share a common core of nucleic acid sequence found in both EHS-2 and rev and that some of these HIV-1-related sequences have additional larger regions of sequence similarity to HIV-1.  相似文献   

19.
20.
In this paper, we first present a new concept of ‘weight’ for 64 triplets and define a different weight for each kind of triplet. Then, we give a novel 2D graphical representation for DNA sequences, which can transform a DNA sequence into a plot set to facilitate quantitative comparisons of DNA sequences. Thereafter, associating with a newly designed measure of similarity, we introduce a novel approach to make similarities/dissimilarities analysis of DNA sequences. Finally, the applications in similarities/dissimilarities analysis of the complete coding sequences of β-globin genes of 11 species illustrate the utilities of our newly proposed method.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号