首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The analysis of functionally related sequences for conserved patterns is important for further research of different functional regions. This paper presents an analysis of genes and intergenic sequences from the point of view of linguistics analysis, where gene and intergenic regions are regarded as two different subjects written in the four-letter alphabet [A, C, G, T] and high-frequency simple sequences are taken as keywords. A measurement alpha[l(tau)] was introduced to describe the relative repeat ratio of simple sequences. Cutoff values were found for keywords selection. After eliminating "noise," 87 short sequences were selected as keywords for intergenic regions and 76 for gene regions.  相似文献   

2.
Repetitive DNA sequences represent a substantial component of eukaryotic genomes. These sequences have been described and characterized in many mammalian species. However, little information about repetitive DNA sequences is available in bat species. Here we describe an EcoRI family of repetitive DNA sequences present in the species Miniopterus schreibersi. These repetitive sequences are 57.85%, A-T rich, organized in tandem, and with a monomer unit length of 904 bp. Methylation analysis using the isoesquizomer pair MspI and HpaII indicates that the cytosines present in the sequences CCGG are partially methylated. Furthermore, Southern blot analysis demonstrated that these DNA sequences are absent in the genomes of four related microbat species and suggest that it could be specific to the M. schreibersi genome.  相似文献   

3.
基于Linux的cDNA文库序列分析平台的构建与应用   总被引:1,自引:0,他引:1  
本研究构建了基于Linux的cDNA文库序列分析平台,该分析平台可大批量自动处理测序后的序列,包括载体序列的去除、序列格式的转换、序列的自动拼接、序列对数据库的相似性搜索及全长ORF的预测等,可加速对大规模测序数据的分析和利用。用该平台对构建的野生大豆盐胁迫全长cDNA文库部分测序结果进行分析和利用。用该平台对构建的野生大豆盐胁迫全长cDNA文库部分测序结果进行分析,获得了较好的结果,已得到多个具有潜在价值的新基因序列。  相似文献   

4.
Here we propose a weighted measure for the similarity analysis of DNA sequences. It is based on LZ complexity and (0,1) characteristic sequences of DNA sequences. This weighted measure enables biologists to extract similarity information from biological sequences according to their requirements. For example, by this weighted measure, one can obtain either the full similarity information or a similarity analysis from a given biological aspect. Moreover, the length of DNA sequence is not problematic. The application of the weighted measure to the similarity analysis of β-globin genes from nine species shows its flexibility.  相似文献   

5.
6.
Aims:  Some Geobacillus species have highly similar 16S rRNA gene sequences, making 16S rDNA sequence analysis-based identification problematic. To overcome this limitation, recA and rpoB sequence analysis was evaluated as an alternative for distinguishing Geobacillus species.
Methods and Results:  The phylogram of 16S rRNA gene sequences inferred from the neighbour-joining method showed that nine clusters of Geobacillus species were characterized with bootstrap values >90%. The recA and rpoB sequences of 10 reference strains in clusters V, VIb and VIc were amplified and sequenced using consensus primers. Alignment of recA sequences in clusters V, VIb and VIc revealed three types of recA genes, consistent with the putative amino acid sequences and in vivo recA splicing analysis. The phylogram constructed from rpoB sequences showed more divergence than that constructed from 16S rRNA gene sequences.
Conclusions:  recA and rpoB sequence analysis differentiated closely-related Geobacillus species and provided direct evidence for reclassifying some species dubiously categorized as Geobacilli . Additionally, this study revealed three types of recA genes in the different Geobacillus species.
Significance and Impact of the Study:  This study highlights the advantage of recA and rpoB sequence analysis to supplement 16S rRNA gene sequence analysis for efficient and convenient determination of Geobacillus species.  相似文献   

7.
Cereal centromeres commonly contain many repetitive sequences that are derived from Ty3/gypsy retrotransposon. FISH analysis using a large DNA insert library of wheat identified a 67-kb clone (R11H) that showed strong hybridization signals on the centromeres. The R11H clone contains Ty3/gypsy retrotransposon-related sequences; both integrase and CCS1 family sequences were identified. Subsequently, we isolated additional 23 large-insert clones which also contained the integrase and CCS1 sequences. Based on the number of the integrase repeats in the clones determined by DNA gel blot analysis, we concluded that the retrotransposon-like sequences are tandemly repeated in wheat centromeres in ca. 55-kb interval on average. This conclusion is consistent with the results of FISH analysis on the extended DNA fibers.  相似文献   

8.
不具有3-碱基周期性的编码序列初探   总被引:4,自引:0,他引:4  
对120个较短编码序列(<1 200 bp)的Fourier频谱进行分析表明,3-碱基周期性在短编码序列中并不是绝对存在的.统计分析提示,编码序列有无3-碱基周期性与序列的碱基组成和分布、所编码蛋白质氨基酸的选用和顺序以及同义密码子的使用都有一定的关系.一般地,非周期-3序列中A+U含量高于G+C含量,周期-3序列的情况则相反;非周期-3序列中碱基在密码子三个位点上的分布比周期-3序列中的分布均匀;非周期-3序列密码子和氨基酸的使用偏向没有周期-3序列的大.在利用Fourier分析方法预测DNA序列中的基因和外显子时,应充分考虑到这些现象.  相似文献   

9.
Hümbelin M  Thomas A  Lin J  Li J  Jore J  Berry A 《Gene》2002,300(1-2):129-139
Three statistical/mathematical analyses are carried out on isochore sequences: spectral analysis, analysis of variance, and segmentation analysis. Spectral analysis shows that there are GC content fluctuations at different length scales in isochore sequences. The analysis of variance shows that the null hypothesis (the mean value of a group of GC contents remains the same along the sequence) may or may not be rejected for an isochore sequence, depending on the subwindow sizes at which GC contents are sampled, and the window size within which group members are defined. The segmentation analysis shows that there are stronger indications of GC content changes at isochore borders than within an isochore. These analyses support the notion of isochore sequences, but reject the assumption that isochore sequences are homogeneous at the base level. An isochore sequence may pass a homogeneity test when GC content fluctuations at smaller length scales are ignored or averaged out.  相似文献   

10.
Comparing DNA or protein sequences plays an important role in the functional analysis of genomes. Despite many methods available for sequences comparison, few methods retain the information content of sequences. We propose a new approach, the Yau-Hausdorff method, which considers all translations and rotations when seeking the best match of graphical curves of DNA or protein sequences. The complexity of this method is lower than that of any other two dimensional minimum Hausdorff algorithm. The Yau-Hausdorff method can be used for measuring the similarity of DNA sequences based on two important tools: the Yau-Hausdorff distance and graphical representation of DNA sequences. The graphical representations of DNA sequences conserve all sequence information and the Yau-Hausdorff distance is mathematically proved as a true metric. Therefore, the proposed distance can preciously measure the similarity of DNA sequences. The phylogenetic analyses of DNA sequences by the Yau-Hausdorff distance show the accuracy and stability of our approach in similarity comparison of DNA or protein sequences. This study demonstrates that Yau-Hausdorff distance is a natural metric for DNA and protein sequences with high level of stability. The approach can be also applied to similarity analysis of protein sequences by graphic representations, as well as general two dimensional shape matching.  相似文献   

11.
We describe a new computer program that identifies conserved secondary structures in aligned nucleotide sequences of related single-stranded RNAs. The program employs a series of hash tables to identify and sort common base paired helices that are located in identical positions in more than one sequence. The program gives information on the total number of base paired helices that are conserved between related sequences and provides detailed information about common helices that have a minimum of one or more compensating base changes. The program is useful in the analysis of large biological sequences. We have used it to examine the number and type of complementary segments (potential base paired helices) that can be found in common among related random sequences similar in base composition to 16S rRNA from Escherichia coli. Two types of random sequences were analyzed. One set consisted of sequences that were independent but they had the same mononucleotide composition as the 16S rRNA. The second set contained sequences that were 80% similar to one another. Different results were obtained in the analysis of these two types of random sequences. When 5 sequences that were 80% similar to one another were analyzed, significant numbers of potential helices with two or more independent base changes were observed. When 5 independent sequences were analyzed, no potential helices were found in common. The results of the analyses with random sequences were compared with the number and type of helices found in the phylogenetic model of the secondary structure of 16S ribosomal RNA. Many more helices are conserved among the ribosomal sequences than are found in common among similar random sequences. In addition, conserved helices in the 16S rRNAs are, on the average, longer than the complementary segments that are found in comparable random sequences. The significance of these results and their application in the analysis of long non-ribosomal nucleotide sequences is discussed.  相似文献   

12.
DNA sequences related to the endogenous retrovirus of chickens, Rous-associated virus-O (RAV-O), have been examined using site-specific DNA endonuclease analysis of cellular DNA derived from line 15 and line 100 chickens. Individual embryos from both inbred lines were used as a source of embryonic fibroblasts from which cellular DNA was isolated. Analysis of DNA containing either endogenous RAV-O sequences alone or both endogenous and exogenous RAV-O sequences produced identical patterns of RAV-O-specific DNA fragments after digestion with the endonucleases Eco RI, Hind III, BgI II, Bam HI or Xho I. Similar analysis with endonucleases Hinc II or Hha I, however, produced several RAV-O-specific DNA fragments which were derived from cellular DNA containing both endogenous and exogenous RAV-O sequences but not from cellular DNA containing only endogenous sequences. Although some differences exist between the DNA fragments specific for the endogenous viral sequences of line 15 and line 100 cellular DNA, the DNA fragments specific for the exogenous viral sequences were identical between the two inbred lines. Cleavage of an unintegrated linear RAV-O DNA molecule with Hinc II or Hha I produced DNA fragments identical to those specific for the exogenously acquired RAV-O provirus. This suggests that these characteristic fragments contain no cellular DNA. The potential DNA junction fragments containing both viral and cellular DNA, identified after analysis of DNA that contains both endogenous and exogenous viral sequences, were identical to those observed after analysis of DNA containing only endogenous viral sequences. These results support the following conclusions. First, exogenous proviral sequences are integrated into chicken cell DNA following an interaction between viral and cellular DNA that is specific with respect to the virus and nonspecific with respect to the cell. Second, both the free linear RAV-O DNA intermediate and the newly integrated exogenous provirus contain specific endonuclease sites that are not found in endogenous RAV-O DNA sequences. These results suggest that the formation of the exogenous DNA provirus involves specific alteration of the endogenous viral DNA sequences before reinsertion of the sequences as the exogenous RAV-O DNA provirus. It is possible that newly integrated exogenous RAV-O sequences are characterized by specific differences in the pattern of base methylation and a limited sequence arrangement.  相似文献   

13.
14.
MOTIVATION: The programs currently available for the analysis of nucleic acid and protein sequences suffer from a variety of problems: Web-based programs often require inconvenient reformatting of sequences when proceeding from one analysis to the next, and commercial-console-based programs are cost prohibitive. Here, we report the development of DNASSIST:, an inexpensive, multiple-document, interface program for the fully integrated editing and analysis of nucleic acid and protein sequences in the familiar environment of Microsoft Windows.  相似文献   

15.
Research on male courtship behavior of moths has focused on documenting stereotyped sequences for successful copulation. We characterized successful male courtship behavior among 126 virgin mating pairs of Ostrinia nubilalis. Using Markov analysis, stereotypy indices, and a novel application of ecological network analysis, we found high variability in these sequences. Fifteen courtship behaviors were described and 96 behavioral transitions were observed, 39 of which occurred only once. The number of courtship bouts ranged from one to ten, the number of behavioral transitions ranged from four to 41, and the number of copulation attempts ranged from one to 29. Only 23% of males used a common, simple behavioral sequence. Females exhibited acceptance or rejection behaviors in 40% of the sequences, but these did not explain the high variability in male courtship sequences. About half of the transitions occurred non-randomly, and stereotypy was low. Network analysis revealed that the courtship sequences started and ended with stereotyped behaviors and the high variability occurred in the middle of the sequences. Whole system analysis showed that the courtship sequences were more variable than for optimal transfer of information. Overall, these results suggest that the sequence of behaviors may be less important than the occurrence of certain behavioral elements for successful mating.  相似文献   

16.
The RDP (Ribosomal Database Project) continues   总被引:56,自引:0,他引:56  
The Ribosomal Database Project (RDP-II), previously described by Maidak et al., continued during the past year to add new rRNA sequences to the aligned data and to improve the analysis commands. Release 7.1 (September 17, 1999) included more than 10 700 small subunit rRNA sequences. More than 850 type strain sequences were identified and added to the prokaryotic alignment, bringing the total number of type sequences to 3324 representing 2460 different species. Availability of an RDP-II mirror site in Japan is also near completion. RDP-II provides aligned and annotated rRNA sequences, derived phylogenetic trees and taxonomic hierarchies, and analysis services through its WWW server (http://rdp.cme.msu.edu/ ). Analysis services include rRNA probe checking, approx-i-mate phylogenetic placement of user sequences, screening user sequences for possible chimeric rRNA sequences, automated alignment, production of similarity matrices and services to plan and analyze terminal restriction fragment length polymorphism (T-RFLP) experiments.  相似文献   

17.
微卫星(Microsatellite)是一类由2-6个核苷酸经多次单位串联组成的高度变异重复DNA序列(Schlotterer and Tautz,1992)。它具有按照孟德尔方式分离、突变快、多态信息含量丰富、呈共显性遗传等特点,其核心序列在同一物种中具有保守性,因此,可以根据微卫星的侧翼序列设计合适的引  相似文献   

18.
信号处理技术在生物分子序列分析中的应用主要包括周期分析、基因预测、相似和重复序列分析、蛋白质分子结构预测等。涉及的技术方法有:Fourier变换、小波变换、相关分析、分形技术、非线性信号处理技术等。本文将全面回顾这些应用。  相似文献   

19.
基于PC/Linux的核酸序列分析系统的构建及其应用   总被引:13,自引:2,他引:11  
基于PC机和Linux操作系统, 利用Phred/Phrap/Consed软件和Blast软件, 构建了核酸序列大规模自动分析系统. 该套系统可自动完成从测序峰图向核酸序列的转化、载体序列去除、序列自动拼接、重复序列鉴定以及序列的相似性分析, 可加速对大规模测序数据的分析和利用.  相似文献   

20.
The diversity of serine proteases secreted from Chrysomya bezziana larvae was investigated biochemically and by PCR and sequence analysis. Cation-exchange chromatography of purified larval serine proteases resolved four trypsin-like activities and three chymotrypsin-like activities as discerned by kinetic studies with benzoyl-Arg-p-nitroanilide and succinyl-Ala-Ala-Pro-Phe-p-nitroanilide. Amino-terminal sequencing of the three most abundant fractions gave two sequences, which were homologous to other Dipteran trypsins and chymotrypsins. Analysis of products generated by PCR of cDNA from whole larvae using specific primers based on the amino-terminal sequences and generic serine protease primers identified 22 different sequences, while phylogenetic analysis of the deduced amino acid sequences differentiated two trypsin-like and four chymotrypsin-like families. Phylogenetic comparisons with Dipteran and mammalian serine protease sequences showed that all the Chrysomya bezziana sequences clustered with Dipteran sequences. The Chrysomya bezziana chymotrypsin-like sequences segregated within a Dipteran cluster of chymotrypsin sequences, but were well dispersed amongst these sequences. The largest Chrysomya bezziana serine protease family, the trypB family, clustered tightly as a group, and was closely related to a Lucilia cuprina trypsin but distinct from Drosophila melanogaster alpha and beta trypsins. The trypB family contains ten highly homologous sequences and probably represents an example of concerted evolution of a trypsin gene in Chrysomya bezziana.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号