首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A computer software package called 'FasParser' was developed for manipulating sequence data.It can be used on personal computers to perform series of analyses,including counting and viewing differences between two sequences at both DNA and codon levels,identifying overlapping regions between two alignments,sorting of sequences according to their IDs or lengths,concatenating sequences of multiple loci for a particular set of samples,translating nucleotide sequences to amino acids,and constructing alignments in several different formats,as well as some extracting and filtrating of data for a particular FASTA file.Majority of these functions can be run in a batch mode,which is very useful for analyzing large data sets.This package can be used by a broad audience,and is designed for researchers that do not have programming experience in sequence analyses.The GUI version of FasParser can be downloaded from https://github.com/Sun-Yanbo/FasParser,free of charge.  相似文献   

2.
3.
Analysis for free: comparing programs for sequence analysis   总被引:4,自引:0,他引:4  
Programs to import, manage and align sequences and to analyse the properties of DNA, RNA and proteins are essential for every biological laboratory. This review describes two different freeware (BioEdit and pDRAW for MS Windows) and a commercial program (Sequencher for MS Windows and Apple MacOS). Bioedit and Sequencher offer functions such as sequence alignment and editing plus reading of sequence trace files. pDRAW is a very comfortable visualisation tool with a variety of analysis functions. While Sequencher impresses with a very user-friendly interface and easy-to-use tools, BioEdit offers the largest and most customisable variety of tools. The strength of pDRAW is drawing and analysis of single sequences for priming and restriction sites and virtual cloning. It has a database function for user-specific oligonucleotides and restriction enzymes.  相似文献   

4.
Vector NTI, a balanced all-in-one sequence analysis suite   总被引:6,自引:0,他引:6  
Vector NTI is a well-balanced desktop application integrated for molecular sequence analysis and biological data management. It has a centralised database and five application modules: Vector NTI, AlignX, BioAnnotator, ContigExpress and GenomBench. In this review, the features and functions available in this software are examined. These include database management, primer design, virtual cloning, alignments, sequence assembly, 3D molecular viewer and internet tools. Some problems encountered when using this software are also discussed. It is hoped that this review will introduce this software to more molecular biologists so they can make better-informed decisions when choosing computational tools to facilitate their everyday laboratory work. This tool can save time and enhance analysis but it requires some learning on the user's part and there are some issues that need to be addressed by the developer.  相似文献   

5.
大头蛙Sox基因的克隆与序列分析   总被引:2,自引:0,他引:2  
参考人SRY基因HMC—box的保守区序列,设计一对特异引物,采用PCR技术扩增了大头蛙的Sox基因,并对扩增产物进行了克隆与测序。结果在雌雄个体中均扩增出217bp的基因片段,与人对照相同。序列分析表明,大头蛙雌雄个体之间Sox序列没有差异,与人SOX基因的同源性达到88%,与其它各类动物的Sox基因也都有非常高的相似性。结果支持Sox基因在进化上十分保守的结论。  相似文献   

6.
High-Cot sequence analysis of the maize genome   总被引:10,自引:0,他引:10  
Higher eukaryotic genomes, including those from plants, contain large amounts of repetitive DNA that complicate genome analysis. We have developed a technique based on DNA renaturation which normalizes repetitive DNA, and thereby allows a more efficient outcome for full genome shotgun sequencing. The data indicate that sequencing the unrenatured outcome of a Cot experiment, otherwise known as High-Cot DNA, enriches genic sequences by more than fourfold in maize, from 5% for a random library to more than 20% for a High-Cot library. Using this approach, we predict that gene discovery would be greater than 95% and that the number of sequencing runs required to sequence the full gene space in maize would be at least fourfold lower than that required for full-genome shotgun sequencing.  相似文献   

7.
目的 探讨Unev无毛小鼠的无毛性状与无毛基因(hairless gene,hr)的相关性.方法 参照Gen-Bank上公布的小鼠的hr序列,设计5对引物,用RT-PCR方法对本单位培育的Unev无毛小鼠hr的编码区序列进行了克隆与分析.结果 获得了Unev无毛小鼠及野生型hr的全部编码区序列(3546 bp).Unev无毛小鼠hr基因与野生型小鼠hr基因的长度及序列完全一致,同源性为100%.与GenBank上发表的国外小鼠hr基因序列(Z32675)相比,同源性为99.7%,共10个碱基发生了突变,其中2个碱基突变导致了相应的氨基酸突变;和昆明小鼠的hr(AY547391)相比,同源性为99.6%,共12个碱基发生了突变,其中3个碱基突变导致了相应的氨基酸突变;但这些突变是由种属差异造成的.结论 Uncv无毛小鼠的无毛性状产生与hr基因无关.  相似文献   

8.
It is often possible to identify sequence motifs that characterize a protein family in terms of its fold and/or function from aligned protein sequences. Such motifs can be used to search for new family members. Partitioning of sequence alignments into regions of similar amino acid variability is usually done by hand. Here, I present a completely automatic method for this purpose: one that is guaranteed to produce globally optimal solutions at all levels of partition granularity. The method is used to compare the tempo of sequence diversity across reliable three-dimensional (3D) structure-based alignments of 209 protein families (HOMSTRAD) and that for 69 superfamilies (CAMPASS). (The mean alignment length for HOMSTRAD and CAMPASS are very similar.) Surprisingly, the optimal segmentation distributions for the closely related proteins and distantly related ones are found to be very similar. Also, optimal segmentation identifies an unusual protein superfamily. Finally, protein 3D structure clues from the tempo of sequence diversity across alignments are examined. The method is general, and could be applied to any area of comparative biological sequence and 3D structure analysis where the constraint of the inherent linear organization of the data imposes an ordering on the set of objects to be clustered.  相似文献   

9.
Myristoylation by the myristoyl-CoA:protein N-myristoyltransferase (NMT) is an important lipid anchor modification of eukaryotic and viral proteins. Automated prediction of N-terminal N-myristoylation from the substrate protein sequence alone is necessary for large-scale sequence annotation projects but it requires a low rate of false positive hits in addition to a sufficient sensitivity.Our previous analysis of substrate protein sequence variability, NMT sequences and 3D structures has revealed motif properties in addition to the known PROSITE motif that are utilized in a new predictor described here. The composite prediction function (with separate ad hoc parameterization (a) for queries from non-fungal eukaryotes and their viruses and (b) for sequences from fungal species) consists of terms evaluating amino acid type preferences at sequences positions close to the N terminus as well as terms penalizing deviations from the physical property pattern of amino acid side-chains encoded in multi-residue correlation within the motif sequence. The algorithm has been validated with a self-consistency and two jack-knife tests for the learning set as well as with kinetic data for model substrates. The sensitivity in recognizing documented NMT substrates is above 95 % for both taxon-specific versions. The corresponding rate of false positive prediction (for sequences with an N-terminal glycine residue) is close to 0.5 %; thus, the technique is applicable for large-scale automated sequence database annotation. The predictor is available as public WWW-server with the URL http://mendel.imp.univie.ac.at/myristate/. Additionally, we propose a version of the predictor that identifies a number of proteolytic protein processing sites at internal glycine residues and that evaluates possible N-terminal myristoylation of the protein fragments.A scan of public protein databases revealed new potential NMT targets for which the myristoyl modification may be of critical importance for biological function. Among others, the list includes kinases, phosphatases, proteasomal regulatory subunit 4, kinase interacting proteins KIP1/KIP2, protozoan flagellar proteins, homologues of mitochondrial translocase TOM40, of the neuronal calcium sensor NCS-1 and of the cytochrome c-type heme lyase CCHL. Analyses of complete eukaryote genomes indicate that about 0.5 % of all encoded proteins are apparent NMT substrates except for a higher fraction in Arabidopsis thaliana ( approximately 0.8 %).  相似文献   

10.
目的 为确定猪链球菌2型(SS2)江苏分离株9801是否具有纤连蛋白/血纤维蛋白原结合蛋白基因(fbps)。方法 根据已发表的SS2 fbps序列,设计并合成1对引物,以SS2江苏分离株的基因组为模板,采用聚合酶链反应(PER)方法扩增fbps,克隆于pMD-T18载体。测定阳性质粒插入序列。利用DNAstar软件,比较所测序列与不同来源SS2的fbps和FBPS与链球菌属其他种同源蛋白的同源性。结果 扩增的fbps为1 938 bp,江苏分离株与猪源致病性SS2荷兰分离株的fbps序列有5个碱基不同,与ATCCA3765株有3个碱基不同,同源性均达99.99%以上。推导的氨基酸序列比较,分别有4个和2个氨基酸不同,同源性均为99%以上。纤连蛋白结合蛋白(FBPS)无前导序列,无锚定序列,与链球菌属其他种的同源Fn结合蛋白的同源性为68.8%~76.0%。结论 FBPS是一种无锚的黏附素。  相似文献   

11.
Residue determinants and sequence analysis of cold-adapted trypsins   总被引:3,自引:0,他引:3  
The digestive enzyme trypsin is among the most extensively studied proteins, and its structure has been reported from a large number of organisms. This article focuses on the trypsins from vertebrates adapted to life at low temperatures. Cold-adapted organisms seem to have compensated for the reduced reaction rates at low temperatures by evolving more active and less temperature-stable enzymes. We have analyzed 27 trypsin sequences from a variety of organisms to find unique attributes for the cold-adapted trypsins, comparing trypsins from salmon, Antarctic fish, cod, and pufferfish to other vertebrate trypsins. Both the "cold" and the "warm" active trypsins have about 50 amino acids that are unique and conserved within each class. The main unique features of the cold-adapted trypsins attributable to low-temperature adaptation seem to be (1) reduced hydrophobicity and packing density of the core, mainly because of a lower (Ile + Leu)/(Ile + Leu + Val) ratio, (2) reduced stability of the C-terminal, (3) lack of one warm trypsin conserved proline residue and one proline tyrosine stacking, (4) difference in charge and flexibility of loops extending the binding pocket, and (5) different conformation of the "autolysis" loop that is likely to be involved in substrate binding. Received: January 14, 1999 / Accepted: March 31, 1999  相似文献   

12.
目的初步研究巴尔通体在恒河猴体内的存在情况,并分析其柠檬酸合成酶(gltA)的基因序列,判断巴尔通体的种属。方法 16只来自福建的恒河猴,用血琼脂培养基分离可能存在的巴尔通体。根据NCBI数据库上的巴尔通体gltA的基因序列,设计一对引物,以巴尔通体菌落为模板进行扩增,将获得的序列进行克隆测序。结果从3只恒河猴体内成功分离到了巴尔通体病原,并获得了巴尔通体gltA全长基因序列,测序结果表明该序列与五日热巴尔通体同源性99%。结论福建来源的恒河猴种群携带巴尔通体病原,巴尔通体流行地区可能存在鼠与灵长类动物之间病原体的流行和传播。  相似文献   

13.
中生菌素生物合成基因zpsA的克隆及序列分析   总被引:1,自引:0,他引:1  
利用PCR技术,首次从淡紫灰链霉菌海南变种(Streptomyces Lavendulae var. hainanensis)的基因组克隆出了中生菌素生物合成基因簇中的负责β-赖氨酸活化的zpsA基因。根据国外已报道的诺尔丝菌素肽合成酶基因(npsA)和链丝菌素肽合成酶基因(sttA)的保守序列以及链霉菌基因密码子的偏爱性,设计简并引物;以中生菌素高产菌株UV-69的总DNA作为模板进行PCR,先后得到长度分别为O.284kb、0.611kb、0.34kb和0.371kb的PCR产物,经测序后拼接得到1.025kb的zpsA基因片段。在GenBank中的进行比对分析,发现ZasA与NpsA、SttA的功能相同,都具有非核糖体肽合成醇腺苷酸化域(A域)的功能。  相似文献   

14.
15.
16.
从泰安地区送检的疑是细小病毒感染的蓝狐粪便中分离到一株病毒。经理化特性鉴定、血凝谱鉴定、人
工感染蓝狐等鉴定,表明所分离病毒为细小病毒。并且根据GenBank 上发表的犬细小病毒(Canine parvovirus,
CPV)、猫细小病毒(Feline parvovirus,FPV)核酸序列,设计扩增VP1 基因的引物,采用PCR 技术扩增所分离
细小病毒的VP1 全基因,将PCR 产物克隆入pMD18 - T 载体,进行测序分析。结果,所分离细小病毒的VP1 基
因全长2 256 bp,编码727 个氨基酸,与CPV 和FPV 参照株的VP1 基因同源性在98. 7% ~ 99. 5% 。VP1 基因的
系统发生分析表明所分离病毒与FPV 的亲源关系最为密切。所分离病毒VP1 蛋白375 位氨基酸残基与CPV 的
VP1 蛋白氨基酸残基一致,但其223 位、236 位、246 位、466 位、707 位、711 位氨基酸残基与FPV VP1 蛋白的
氨基酸残基一致,该病毒VP1 蛋白序列表现出了过渡型序列特征,介于FPV 与CPV 间的过渡类型,这说明所分离病毒为蓝狐细小病毒(Blue fox parvovirus,BFPV),命名为BFPV - TA,蓝狐可能在CPV 的起源过程起到重要
的作用。  相似文献   

17.
根据http://www.tigr.org中与小麦几丁质酶基因相关的序列TC187877,设计引物,分别从小麦品种Gamenya和苏麦3号中扩增到大小约为1 000bp的片段。经序列测定和软件分析比较,发现这些片段所编码的蛋白质氨基酸序列,都有CHITINASE-19.1和CHITINASE-19.2的基序,为第II类几丁质酶基因。扩增的核酸序列在GenBank上发表,登录号分别为AY973229和AY973230。  相似文献   

18.
19.
An open reading frame (termed ORF-PR) encoding a metallothionein-like domain-including protein was found upstream of a previously identified Streptomyces chymotrypsin-type protease gene (sam-P20). Promoter and terminator activities of ORF-PR were detected using the promoterless Streptomyces tyrosinase gene as a reporter gene and expression of ORF-PR was supposed to occur before that of sam-P20 gene. Frameshift mutation anaysis showed that the ORF-PR product might act as a repressive regulator of the sam-P20 gene.  相似文献   

20.
德国小蠊泛素基因的克隆及序列分析   总被引:7,自引:2,他引:7  
设计一对简并引物,从德国小蠊Blattellager manica细胞中克隆了泛素基因的编码区,GenBank登录号为AY501003。序列分析表明,该编码区的长度为228 bp,编码的多肽由76个氨基酸残基组成,相对分子质量为8.47 Kd ,其等电点为5.73。同源性比较发现,德国小蠊泛素基因与其他真核生物泛素基因在氨基酸水平上具有94%以上的相似性。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号